Learning actions from data using invertible neural networks
Claudia Merger in joint work with Alexandre René, Kirsten Fischer and Peter Bouss
supervised by Moritz Helias and Carsten Honerkamp
in collaboration with David Dahmen and Christian Keup
Invertible neural networks on unsupervised learning tasks
data
INN
Invertible neural networks on unsupervised learning tasks
choose output
data distribution
data
INN
Invertible neural networks on unsupervised learning tasks
data
INN
sample
Gaussian
What has the INN learned?
Invertible neural networks on unsupervised learning tasks
Examples
- NICE (Dinh et. al., 2015 )
- RealNVP (Dinh et. al., 2017)
- GLOW (Kingma et. al. , 2018)
- https://openai.com/blog/glow/
data
INN
sample
INN must learn a approximate representation of data distribution.
How can we:
- extract the representation?
- use it to understand how the network learns?
Extract the representation
Extracting representation
- Simplify architecture
- Construct layer transform such that action stays polynomial
- Write down transform rules for coefficients
Application
Application
Application
Order in polynomial increases exponentially.
Coefficients become large.
Order in polynomial increases exponentially.
Coefficients become large.
Strategy:
- Store higher rank coefficients implicitly
- Truncate polynomial at specific order and include normalizing diagonal term
Order in polynomial increases exponentially.
Coefficients become large.
Strategy:
- Store higher rank coefficients implicitly
- Truncate polynomial at specific order and include normalizing diagonal term
Affine coupling must also be able to decrease the order.
Affine coupling must also be able to decrease the order.
2nd order action
4th order action
sign change
Summary and outlook
- Extract representation in the form of polynomials
- Test SimpleFLOW architecture on simple tasks
- Implement action mapping via Coefficient transforms
Further investigations:
- Explore lower order actions, using example systems from stat. physics
- Trunctaion scheme
- How do higher order terms contribute to lower orders? (Train with and without bias )
Learning actions from data using invertible neural networks
By merger
Learning actions from data using invertible neural networks
- 72