Will ML shape the future of cosmology?
A journey through obstacles and potential approaches to overcoming them
Carolina Cuesta-Lazaro
IAIFI Fellow (MIT/CfA)
DESI: Dark Energy Spectroscopic Instrument
~35 Million spectra!
(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)
(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)
Neutrino mass hierarchy
Primordial Non-gaussianity
Galaxy formation
Large scale modifications of gravity
using growth to detect the existence of fifth forces
LSS might provide the most accurate measurement
to probe the physics of inflation (single/multi field, particle content)
not only a nuisance to margnilize over!
1
Observe galaxies
4
Pick your favourite analytical likelihood (Gaussian!)
5
Compute ~1Million times to get posterior
Constrain cosmology 101
Missing Information!
Perturbation Theory inaccurate / hard to compute
Is it always true?
3
Work on your analytical theory
2
Count pairs as a function of distance
ML for Large Scale Structure:
A wish list
Generative models
Learn p(x)
Sample simulations with different parameter values quickly
3
Evaluate their likelihood the field level
1
Do not make assumptions on the likelihood's form
2
Siddharth Mishra-Sharma
Image Credit: https://lilianweng.github.io/posts/2021-07-11-diffusion-models/
Latent Generative Models: Normalising flows
(Image Credit: Phillip Lippe)
Should we be using CNNs?
Galaxy positions
+ Magnitudes, velocities ...
Dark matter density field
(Image Credit: SIMBIG)
Diffusion Models
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
Neural network
Equivalent to noise prediction!
Diffusion on sets
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
Graph Neural Network
+ Galaxy formation
+ Observational systematics (Cut-sky, Fiber collisions)
+ Lightcone, Redshift Space Distortions....
Forward Model
N-body simulations
Observations
SIMBIG arXiv:2211.00723
Adversarial examples
We can simulate the observable Universe, we just need hydrodynamical simulations
What are subresolution models?
Super massive black hole seeding in dark matter halos
BH feedback impacts galactic scales
Black holes can also growth through mergers
Effective models of astrophysical processes needed due to limited numerical resolutions or limited physical models
they can even teleport!
We can't model galaxy formation, how do we make our models robust?
Robust Summarisation
Summariser (neural net)
Increase the evidence of the observations
Sims misspecified
What ML can do for cosmology
- ML to accelerate non-linear predictions and density estimation
- Can ML extract **all** the information that there is at the field-level in the non-linear regime?
- Compare data and simulations, point us to the missing pieces?
cuestalz@mit.edu
Introducing homogeneity and isotropy
Redshift Space Distortions break isotropy
(Credit: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials)
arXiv:2202.05282
Normalising flows as generative models
arXiv:2202.05282
- Fast theory models
- Do all these summaries combined get all info?
- Optimal information extraction
- Density estimation
- Improving current simulations?
- How can we deal with model mispecification?
ML to the rescue for
Invariance vs Equivariance
Invariance
Scalar interactions
Equivariance
What can we do with vectors?
Tensor products
Mario Geiger
Siddharth Mishra-Sharma
Making homogeneous and isotropic universes
Invariant
to rotations and translations
Equivariant
Invariant
Invariant
arXiv:2107.00630
Gaussian,
fully known
Also Gaussian,
but learned mean
Copy of Copy of deck
By carol cuesta
Copy of Copy of deck
- 296