Will ML shape the future of cosmology?
A journey through obstacles and potential approaches to overcoming them
Carolina Cuesta-Lazaro
IAIFI Fellow (MIT/CfA)
The golden days of Cosmology:
A five parameter Universe
Initial Conditions
(Inflation)
Dynamics
Dark energy
Dark matter
Ordinary matter
Amplitude initial density field
Scale dependence
t = 400,000 years
DESI: Dark Energy Spectroscopic Instrument
~35 Million spectra!
(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)
(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)
Neutrino mass hierarchy
Primordial Non-gaussianity
Galaxy formation
Large scale modifications of gravity
using growth to detect the existence of fifth forces
LSS might provide the most accurate measurement
to probe the physics of inflation (single/multi field, particle content)
not only a nuisance to margnilize over!
1
Observe galaxies
4
Pick your favourite analytical likelihood (Gaussian!)
5
Compute ~1Million times to get posterior
Constrain cosmology 101
Missing Information!
Perturbation Theory inaccurate / hard to compute
Is it always true?
3
Work on your analytical theory
2
Count pairs as a function of distance
Enrique Paillas
Waterloo
arXiv:2209.04310
How much information do we lose?
Bispectrum
Wavelet Scattering Transform
arXiv:1909.11107
arXiv:2204.13717
Marked Power Spectrum
arXiv:2206.01709
ML for Large Scale Structure:
A wish list
Generative models
Learn p(x)
Sample simulations with different parameter values quickly
1
Evaluate their likelihood the field level
2
Do not make assumptions on the likelihood's form
3
Latent Generative Models: Normalising flows
(Image Credit: Phillip Lippe)
Diffusion Models
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
Siddharth Mishra-Sharma
Should we be using CNNs?
Galaxy positions
+ Magnitudes, velocities ...
Dark matter density field
(Image Credit: SIMBIG)
Diffusion on sets
Reverse diffusion: Denoise previous step
Forward diffusion: Add Gaussian noise (fixed)
Set
Set
Neural network
Credit: Siddharth Mishra-Sharma
+ Galaxy formation
+ Observational systematics (Cut-sky, Fiber collisions)
+ Lightcone, Redshift Space Distortions....
Forward Model
N-body simulations
Observations
SIMBIG arXiv:2211.00723
We can simulate the observable Universe, we just need hydrodynamical simulations
What are subresolution models?
Super massive black hole seeding in dark matter halos
BH feedback impacts galactic scales
Black holes can also growth through mergers
Effective models of astrophysical processes needed due to limited numerical resolutions or limited physical models
they can even teleport!
We can't model galaxy formation, how do we make our models robust?
Robust Summarisation
Summariser (neural net)
Increase the evidence of the observations
Sims misspecified
What ML can do for cosmology
- ML to accelerate non-linear predictions and density estimation
- Can ML extract **all** the information that there is at the field-level in the non-linear regime?
- Compare data and simulations, point us to the missing pieces?
cuestalz@mit.edu
Graph Neural Networks in a nutshell
edge embedding
node embedding
Introducing homogeneity and isotropy
Redshift Space Distortions break isotropy
(Credit: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials)
arXiv:2202.05282
Normalising flows as generative models
arXiv:2202.05282
- Fast theory models
- Do all these summaries combined get all info?
- Optimal information extraction
- Density estimation
- Improving current simulations?
- How can we deal with model mispecification?
ML to the rescue for
arXiv:2107.00630
Gaussian,
fully known
Also Gaussian,
but learned mean
Invariance vs Equivariance
Invariance
Scalar interactions
Equivariance
What can we do with vectors?
Tensor products
Mario Geiger
Siddharth Mishra-Sharma
Making homogeneous and isotropic universes
Invariant
to rotations and translations
Equivariant
Invariant
Invariant
Copy of deck
By carol cuesta
Copy of deck
- 285