Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Reweighting samples under covariate shift using a Wasserstein distance criterion

Abstract : Considering two random variables with different laws to which we only have access through finite size iid samples, we address how to reweight the first sample so that its empirical distribution converges towards the true law of the second sample as the size of both samples goes to infinity. We study an optimal reweighting that minimizes the Wasserstein distance between the empirical measures of the two samples, and leads to an expression of the weights in terms of Nearest Neighbors. The consistency and some asymptotic convergence rates in terms of expected Wasserstein distance are derived, and do not need the assumption of absolute continuity of one random variable with respect to the other. These results have some application in Uncertainty Quantification for decoupled estimation and in the bound of the generalization error for the Nearest Neighbor Regression under covariate shift.
Complete list of metadatas

Cited literature [22 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-02968059
Contributor : Touboul Adrien <>
Submitted on : Friday, October 16, 2020 - 12:44:41 PM
Last modification on : Tuesday, October 20, 2020 - 3:28:44 AM

Files

Reweighting_samples_under_cova...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02968059, version 1
  • ARXIV : 2010.09267

Collections

Citation

Julien Reygner, Adrien Touboul. Reweighting samples under covariate shift using a Wasserstein distance criterion. 2020. ⟨hal-02968059⟩

Share

Metrics

Record views

53

Files downloads

31