Will ML shape the future of cosmology?

A journey through obstacles and potential approaches to overcoming them

Carolina Cuesta-Lazaro

IAIFI Fellow

The golden days of Cosmology:

A five parameter Universe

\Omega_m

\Omega_b

\Omega_\Lambda

A_s

n_s

Initial Conditions

(Inflation)

Dynamics

Dark energy

Dark matter

Ordinary matter

Amplitude initial density field

Scale dependence

t = 400,000 years

DESI: Dark Energy Spectroscopic Instrument

~35 Million spectra!

Text

(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)

(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)

Neutrino mass hierarchy

Primordial Non-gaussianity

Galaxy formation

Large scale modifications of gravity

using growth to detect the existence of fifth forces

LSS might provide the most accurate measurement

to probe the physics of inflation (single/multi field, particle content)

not only a nuisance to margnilize over!

(\vec{\theta}_i, z_i)

z_i = z_{\mathrm{Cosmological} }

+ z_{\mathrm{Doppler}}

\chi(z) = \int_0^z \frac{dz'}{H(z')}

+ \frac{v_{\mathrm{pec}}}{aH(a)}

\chi_i

Observe galaxies

Pick your favourite analytical likelihood (Gaussian!)

Compute ~1Million times to get posterior

Constrain cosmology 101

Missing Information!

Perturbation Theory inaccurate / hard to compute

Is it always true?

Work on your analytical theory

Count pairs as a function of distance

\bar{\xi}(R_s)

R_s

Enrique Paillas

Waterloo

arXiv:2209.04310

How much information do we lose?

Bispectrum

Wavelet Scattering Transform

arXiv:1909.11107

arXiv:2204.13717

Siddharth Mishra-Sharma

Should we be using CNNs?

Galaxy positions

+ Magnitudes, velocities ...

Dark matter density field

z_T

z_{0}

z_{1}

Diffusion on sets

Reverse diffusion: Denoise your data

Forward diffusion: Add noise (fixed)

z_{2}

p_\theta(z_{t-1}|z_t)

q(z_t|z_{t-1})

p_\theta(z_{t-1}|z_t) = \mathcal{N}(z_{t-1}|\mu_\theta(z_t, t), \sigma_t^2 \mathcal{I})

Set

\mu_\theta

Neural network

z_{t-1}

z_{t}

Credit: Siddharth Mishra-Sharma

arXiv:2107.00630

- \log p(x) \leq -\mathrm{VLB}(x) =

\purple{\mathrm{Prior Loss}} + \gray{\mathrm{Recon Loss}} + \blue{\mathrm{Diffusion Loss}}

\purple{\mathrm{Prior Loss} = D_\mathrm{KL} (q(z_T|x)|p(z_T))}

\gray{\mathrm{Recon Loss} = \mathbb{E}_{q(z_0|x)} [ -\log p(x|z_0)]}

\blue{\mathrm{Diffusion Loss} = }

\blue{\sum_{t} D_\mathrm{KL} \left[ q(z_{t-1}|z_{t},x) || p_\theta (z_{t-1}|z_{t}) \right]}

Gaussian,

fully known

Also Gaussian,

but learned mean

Mario Geiger

Siddharth Mishra-Sharma

Making homogeneous and isotropic universes

p(x)

\mu_\theta(z_t,t)

p(z_T)

Invariant

to rotations and translations

Equivariant

Invariant

)

Invariant

)

+ Galaxy formation

+ Observational systematics (Cut-sky, Fiber collisions)

+ Lightcone, Redshift Space Distortions....

Forward Model

N-body simulations

Observations

SIMBIG arXiv:2211.00723

We can simulate the observable Universe, we just need hydrodynamical simulations

25 \, h^{-1}\mathrm{Mpc}

What are subresolution models?

10^{10} - 10^{11} M_\odot

Super massive black hole seeding in dark matter halos

BH feedback impacts galactic scales

Black holes can also growth through mergers

Effective models of astrophysical processes needed due to limited numerical resolutions or limited physical models

they can even teleport!

We can't model galaxy formation, how do we make our models robust?

Robust Summarisation

\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims}) + \blue{\lambda p_\mathrm{sims}(S(x_\mathrm{obs}))}

Summariser (neural net)

(\theta_\mathrm{sims}, x_\mathrm{sims})

x_\mathrm{obs}

\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims})

Increase the evidence of the observations

Sims misspecified

What ML can do for cosmology

ML to accelerate non-linear predictions and density estimation

Can ML extract **all** the information that there is at the field-level in the non-linear regime?

Compare data and simulations, point us to the missing pieces?

cuestalz@mit.edu

Introducing homogeneity and isotropy

Redshift Space Distortions break isotropy

(Credit: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials)

Latent Generative Models: Normalising flows

x = f(z), \, z = f^{-1}(x)

p(\mathbf{x}) = p_z(f^{-1}(\mathbf{x})) \left\vert \det J(f^{-1}) \right\vert

No assumptions on the likelihood (likelihoods rarely Gaussian!)

No expensive MCMC chains needed to estimate posterior

(Image Credit: Phillip Lippe)

arXiv:2202.05282

Normalising flows as generative models

arXiv:2202.05282

Fast theory models
Do all these summaries combined get all info?
- Optimal information extraction
Density estimation
Improving current simulations?
How can we deal with model mispecification?

ML to the rescue for

Invariance vs Equivariance

\theta

\phi

Invariance

Scalar interactions

Equivariance

What can we do with vectors?

Tensor products

v_i

v_j

Will ML shape the future of cosmology?

A journey through obstacles and potential approaches to overcoming them

The golden days of Cosmology:

A five parameter Universe

t = 400,000 years

DESI: Dark Energy Spectroscopic Instrument

Constrain cosmology 101

How much information do we lose?

What ML can do for cosmology

LSS vs CMB

deck

deck

carol cuesta

Will ML shape the future of cosmology?

A journey through obstacles and potential approaches to overcoming them

The golden days of Cosmology:

A five parameter Universe

t = 400,000 years

DESI: Dark Energy Spectroscopic Instrument

Constrain cosmology 101

How much information do we lose?

What ML can do for cosmology

LSS vs CMB

deck

More from carol cuesta