Shortcut detection with variational autoencoders
For real-world applications of machine learning (ML), it is essential that models make predictions based on well-generalizing features rather than spurious correlations in the data. The identification of such spurious correlations, also known as shortcuts, is a challenging problem and has so far been scarcely addressed. In this work, we present a novel approach to detect shortcuts in image and audio datasets by leveraging variational autoencoders (VAEs). The disentanglement of features in the latent space of VAEs allows us to discover feature-target correlations in datasets and semi-automatically evaluate them for ML shortcuts. We demonstrate the applicability of our method on several real-world datasets and identify shortcuts that have not been discovered before.
Shortcut detection with variational autoencoders
Accepted at the ICML 2023 Workshop on Spurious Correlations, Invariance and Stability
Authors: | Nicolas M. Mueller, Simon Roschmann, Shahbaz Khan, Philip Sperl, and Konstantin Boettinger |
Year/month: | 2023/2 |
Booktitle: | Accepted at the ICML 2023 Workshop on Spurious Correlations, Invariance and Stability |
Fulltext: | click here |
Abstract |
|
For real-world applications of machine learning (ML), it is essential that models make predictions based on well-generalizing features rather than spurious correlations in the data. The identification of such spurious correlations, also known as shortcuts, is a challenging problem and has so far been scarcely addressed. In this work, we present a novel approach to detect shortcuts in image and audio datasets by leveraging variational autoencoders (VAEs). The disentanglement of features in the latent space of VAEs allows us to discover feature-target correlations in datasets and semi-automatically evaluate them for ML shortcuts. We demonstrate the applicability of our method on several real-world datasets and identify shortcuts that have not been discovered before. |
Bibtex:
@inproceedings {author = { Nicolas M. Mueller and Simon Roschmann and Shahbaz Khan and Philip Sperl and Konstantin Boettinger},
title = { Shortcut detection with variational autoencoders },
year = { 2023 },
month = { Febuary },
booktitle = { Accepted at the ICML 2023 Workshop on Spurious Correlations, Invariance and Stability },
url = { https://doi.org/10.48550/arXiv.2302.04246 },
}