Invertible Residual Networks

2019-05-18

Jens Behrmann, Will Grathwohl, Ricky T. Q. Chen, David Duvenaud, Jörn-Henrik Jacobsen

arXiv_AI

arXiv_AI Classification

Abstract
Abstract (translated by Google)
URL
PDF

Abstract

We show that standard ResNet architectures can be made invertible, allowing the same model to be used for classification, density estimation, and generation. Typically, enforcing invertibility requires partitioning dimensions or restricting network architectures. In contrast, our approach only requires adding a simple normalization step during training, already available in standard frameworks. Invertible ResNets define a generative model which can be trained by maximum likelihood on unlabeled data. To compute likelihoods, we introduce a tractable approximation to the Jacobian log-determinant of a residual block. Our empirical evaluation shows that invertible ResNets perform competitively with both state-of-the-art image classifiers and flow-based generative models, something that has not been previously achieved with a single architecture.

Abstract (translated by Google)

URL

http://arxiv.org/abs/1811.00995

PDF

http://arxiv.org/pdf/1811.00995

Invertible Residual Networks

Abstract

Abstract (translated by Google)

URL

PDF

Similar Posts

Comments