MLizing cosmology
florpi
The What? How? And Why?
And really, Why?
https://florpi.github.io/
IAIFI Fellow
Carol Cuesta-Lazaro
Initial Conditions of the Universe
Laws of gravity
3-D distribution of galaxies
Which are the ICs of OUR Universe?
Primordial non-Gaussianity?
Probe Inflation
Galaxy formation
3-D distribution of dark matter
Is GR modified on large scales?
How do galaxies form?
Neutrino mass hierarchy?
The Why: The limits of analytical models
But... Access to tones of data!
simulated data
Complex non-Gaussian distributions
Non-linear forward models
(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)
A 2D animation of a folk music band composed of anthropomorphic autumn leaves, each playing traditional bluegrass instruments, amidst a rustic forest setting dappled with the soft light of a harvest moon
ML has solved high-dimensional inference
#1 Field-level likelihoods for galaxy surveys
Siddharth Mishra-Sharma
#2 Probabilistic reconstruction of the cosmic web
Core Park
Nayantara Mudur
Victoria Ono
Yueying Ni
#3 Faster, invertible and more accurate simulators
Sarah Jeffreson
Chirag Modi
Generative Models 101
Samples
Parametric PDF
Maximize the likelihood of the training samples
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
A person half Yoda half Gandalf
Diffusion generative models
Score
#1 Modelling galaxy surveys
"A point cloud approach to generative modeling for galaxy surveys at the field level"
arXiv:2311.17141
Carolina Cuesta-Lazaro and Siddharth Mishra-Sharma
2PCF
Mean pairwise
velocity
kNN
Emulating cosmic variance
2PCF
kNN
Diffusion models approximate the likelihood
arxiv:2107.00630
arxiv:2208.11970
Maximum Likelihood = Denoising
Tight constraints with only 5000 positions!
"Probabilistic Reconstruction of Dark Matter fields from galaxies using diffusion models"
arXiv:2311.08558
Core Francisco Park, Victoria Ono, Nayantara Mudur, Yueying Ni, Carolina Cuesta-Lazaro
#2 Solving inverse problems:
Probabilistic reconstruction of the cosmic web
1 to Many:
#3 AI Powered simulators
Fast and differentiable N-body sims
Inverting dynamics
More accurate hydro sims: Resolving star formation rates
~10-100 pc
Molecular clouds
where stars form
Hydro sims:
A matrioska of scales
~10-50 kpc
Galaxies
where clouds form
TNG50 ~50 Mpc
Cosmic web
(where galaxies form)
MXXL ~ 4 Gpc
Adapted from: "Learning the learning the Universe" by Jake Bennet
Current subgrid models of the ISM
High Res Sim
Springel and Hernquist 03
Learning subgrid models from high resolution isolated galaxies
Gas Surface Density
SFR Surface Density
Hybrid simulators
Nbody
Slow
Non-differentiable
Particle mesh
Accurate
Fast
Differentiable
Missing small scales
Nbodyify
Fast
Differentiable
Accurate
"Nbodyify: adaptive mesh corrections for PM simulations"
Carolina Cuesta-Lazaro and Chirag Modi (in prep)
Gravitational evolution ODE
Particle-mesh
Hybrid Simulator
#2 Probabilistic reconstruction of the cosmic web
#1 Field-level likelihoods for galaxy surveys
#3 Faster, invertible and more accurate simulators
cuestalz@mit.edu