copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Variance Penalties and Domain Shift Robustness

C. Heinze-Deml, and N. Meinshausen. (2017)cite arxiv:1710.11469.

Abstract

When training a deep neural network for image classification, one can broadly distinguish between two types of latent features of images that will drive the classification. We can divide latent features into (i) "core" or "conditionally invariant" features $X^core$ whose distribution $X^coreY$, conditional on the class $Y$, does not change substantially across domains and (ii) "style" features $X^style$ whose distribution $X^style Y$ can change substantially across domains. Examples for style features include position, rotation, image quality or brightness but also more complex ones like hair color, image quality or posture for images of persons. Our goal is to minimize a loss that is robust under changes in the distribution of these style features. In contrast to previous work, we assume that the domain itself is not observed and hence a latent variable. We do assume that we can sometimes observe a typically discrete identifier or "$ID$ variable". In some applications we know, for example, that two images show the same person, and $ID$ then refers to the identity of the person. The proposed method requires only a small fraction of images to have $ID$ information. We group observations if they share the same class and identifier $(Y,ID)=(y,id)$ and penalize the conditional variance of the prediction or the loss if we condition on $(Y,ID)$. Using a causal framework, this conditional variance regularization (CoRe) is shown to protect asymptotically against shifts in the distribution of the style variables. Empirically, we show that the CoRe penalty improves predictive accuracy substantially in settings where domain changes occur in terms of image quality, brightness and color while we also look at more complex changes such as changes in movement and posture.

Description

[1710.11469] Conditional Variance Penalties and Domain Shift Robustness

Links and resources

BibTeX key: heinzedeml2017conditional
entry type: article
year: 2017
url: http://arxiv.org/abs/1710.11469
note: cite arxiv:1710.11469

Cite this publication

%0 Journal Article %1 heinzedeml2017conditional %A Heinze-Deml, Christina %A Meinshausen, Nicolai %D 2017 %K causal-analysis invariance %T Conditional Variance Penalties and Domain Shift Robustness %U http://arxiv.org/abs/1710.11469 %X When training a deep neural network for image classification, one can broadly distinguish between two types of latent features of images that will drive the classification. We can divide latent features into (i) "core" or "conditionally invariant" features $X^core$ whose distribution $X^coreY$, conditional on the class $Y$, does not change substantially across domains and (ii) "style" features $X^style$ whose distribution $X^style Y$ can change substantially across domains. Examples for style features include position, rotation, image quality or brightness but also more complex ones like hair color, image quality or posture for images of persons. Our goal is to minimize a loss that is robust under changes in the distribution of these style features. In contrast to previous work, we assume that the domain itself is not observed and hence a latent variable. We do assume that we can sometimes observe a typically discrete identifier or "$ID$ variable". In some applications we know, for example, that two images show the same person, and $ID$ then refers to the identity of the person. The proposed method requires only a small fraction of images to have $ID$ information. We group observations if they share the same class and identifier $(Y,ID)=(y,id)$ and penalize the conditional variance of the prediction or the loss if we condition on $(Y,ID)$. Using a causal framework, this conditional variance regularization (CoRe) is shown to protect asymptotically against shifts in the distribution of the style variables. Empirically, we show that the CoRe penalty improves predictive accuracy substantially in settings where domain changes occur in terms of image quality, brightness and color while we also look at more complex changes such as changes in movement and posture.

@article{heinzedeml2017conditional, abstract = {When training a deep neural network for image classification, one can broadly distinguish between two types of latent features of images that will drive the classification. We can divide latent features into (i) "core" or "conditionally invariant" features $X^\text{core}$ whose distribution $X^\text{core}\vert Y$, conditional on the class $Y$, does not change substantially across domains and (ii) "style" features $X^{\text{style}}$ whose distribution $X^{\text{style}} \vert Y$ can change substantially across domains. Examples for style features include position, rotation, image quality or brightness but also more complex ones like hair color, image quality or posture for images of persons. Our goal is to minimize a loss that is robust under changes in the distribution of these style features. In contrast to previous work, we assume that the domain itself is not observed and hence a latent variable. We do assume that we can sometimes observe a typically discrete identifier or "$\mathrm{ID}$ variable". In some applications we know, for example, that two images show the same person, and $\mathrm{ID}$ then refers to the identity of the person. The proposed method requires only a small fraction of images to have $\mathrm{ID}$ information. We group observations if they share the same class and identifier $(Y,\mathrm{ID})=(y,\mathrm{id})$ and penalize the conditional variance of the prediction or the loss if we condition on $(Y,\mathrm{ID})$. Using a causal framework, this conditional variance regularization (CoRe) is shown to protect asymptotically against shifts in the distribution of the style variables. Empirically, we show that the CoRe penalty improves predictive accuracy substantially in settings where domain changes occur in terms of image quality, brightness and color while we also look at more complex changes such as changes in movement and posture.}, added-at = {2019-08-06T11:38:17.000+0200}, author = {Heinze-Deml, Christina and Meinshausen, Nicolai}, biburl = {https://www.bibsonomy.org/bibtex/2d94122ae577cd0b5d01bf81a33c832c0/kirk86}, description = {[1710.11469] Conditional Variance Penalties and Domain Shift Robustness}, interhash = {e9b2bfe942cba0f043a3da3ae512b095}, intrahash = {d94122ae577cd0b5d01bf81a33c832c0}, keywords = {causal-analysis invariance}, note = {cite arxiv:1710.11469}, timestamp = {2019-08-06T11:38:17.000+0200}, title = {Conditional Variance Penalties and Domain Shift Robustness}, url = {http://arxiv.org/abs/1710.11469}, year = 2017 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Variance Penalties and Domain Shift Robustness

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Conditional Variance Penalties and Domain Shift Robustness

Abstract

Description

Links and resources

Tags

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Conditional Variance Penalties and Domain Shift Robustness

Comments and Reviews
(0)