Generative Models in Astrophysics
Carol Cuesta-Lazaro
IAIFI Fellow - Yale Data Science Seminar
Initial Conditions of the Universe
Laws of gravity
3-D distribution of galaxies
Which are the ICs of OUR Universe?
Primordial non-Gaussianity?
Probe Inflation
Galaxy formation
3-D distribution of dark matter
Is GR modified on large scales?
How do galaxies form?
Neutrino mass hierarchy?
Dall-E 3
A 2D animation of a folk music band composed of anthropomorphic autumn leaves, each playing traditional bluegrass instruments, amidst a rustic forest setting dappled with the soft light of a harvest moon
"Sparks of Artificial General Intelligence: Early experiments with GPT-4" Bubeck et al
Produce Javascript code that creates a random graphical image that looks like a painting of Kandinsky
Draw a unicorn in TikZ
ChatGPT
Emulation
Model uncertainties
Likelihood estimation
Complex priors
lensing
arxiv:2206.14820
Data
A parametric PDF
Maximise the likelihood
(or something similar)
Explicit Density
Implicit Density
Tractable Density
Approximate Density
Normalising flows
Variational Autoencoders
Diffusion models
Generative Adversarial Networks
The zoo of generative models
The backbone of vision generative models
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
A person half Yoda half Gandalf
Diffusion Models
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
Cosmology
Diffusion on point clouds
"Diffusion generative modeling for galaxy surveys: emulating clustering for inference at the field level" Carolina Cuesta-Lazaro, Siddarth Mishra-Sharma
Modelling galaxy surveys
Halo Mass Function
Velocity
Mean pairwise velocity
Node features
Edge features
edge embedding
node embedding
Input
noisy halo properties
Output
noise prediction
Tight constraints with only 5000 positions!
Prior loss
Diffusion loss
Reconstruction loss
Be a true Bayesian: Always maximise the likelihood
arxiv:2107.00630
arxiv:2303.00848
Maximum Likelihood = Denoising
(Probabilistic) reconstruction of Dark Matter
"Probabilistic Reconstruction of Dark Matter fields from galaxies using diffusion models" Victoria Ono, Core Francisco Park, Nayantara Mudur, Yueying Ni, Carolina Cuesta-Lazaro (in prep)
Hybrid hydro simulators?
Hybrid simulators
Nbody
Slow
Non-differentiable
Particle mesh
Accurate
Fast
Differentiable
Missing small scales
Nbodyify
Fast
Differentiable
Accurate
"Nbodyify: adaptive mesh corrections for PM simulations" Carolina Cuesta-Lazaro, Chirag Modi (in prep)
Gravitational evolution ODE
Particle-mesh
Hybrid Simulator
Low Resolution
CNN learned features
Emulation
Likelihood estimation
Model uncertainties
Hybrid simulators?
Evidence Lower Bound
Distance to true posterior
Find
1. ELBO is a lower bound of the evidence
2. Maximising ELBO = Minimising KL
Maximise ELBO to maximise ev/likelihood
Maximise ELBO to approximate true posterior
deck
By carol cuesta
deck
- 268