Will ML shape the future of cosmology?

 

A journey through obstacles and potential approaches to overcoming them

 

Carolina Cuesta-Lazaro

IAIFI Fellow (MIT/CfA)

DESI: Dark Energy Spectroscopic Instrument

~35 Million spectra!

(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)

(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)

Neutrino mass hierarchy 

Primordial Non-gaussianity 

Galaxy formation 

Large scale modifications of gravity 

using growth to detect the existence of fifth forces

LSS might provide the most accurate measurement

to probe the physics of inflation (single/multi field, particle content)

not only a nuisance to margnilize over!

1

Observe galaxies

4

Pick your favourite analytical likelihood (Gaussian!)

5

Compute ~1Million times to get posterior

Constrain cosmology 101

Missing Information!

Perturbation Theory inaccurate / hard to compute

Is it always true?

3

Work on your analytical theory

2

Count pairs as a function of distance

ML for Large Scale Structure:

A wish list

Generative models

Learn p(x)

Sample simulations with different parameter values quickly

3

Evaluate their likelihood the field level

1

Do not make assumptions on the likelihood's form

2

Siddharth Mishra-Sharma

Image Credit: https://lilianweng.github.io/posts/2021-07-11-diffusion-models/

p(x) = \int dz \, p(x|z)

Latent Generative Models: Normalising flows

x = f(z), \, z = f^{-1}(x)
p(\mathbf{x}) = p_z(f^{-1}(\mathbf{x})) \left\vert \det J(f^{-1}) \right\vert

(Image Credit: Phillip Lippe)

Should we be using CNNs?

Galaxy positions

+ Magnitudes, velocities ...

Dark matter density field

(Image Credit: SIMBIG)

z_T
z_{0}
z_{1}

Diffusion Models

Reverse diffusion: Denoise previous step

Forward diffusion: Add Gaussian noise (fixed)

z_{2}
p_\theta(z_{t-1}|z_t)
q(z_t|z_{t-1})
q(\mathbf{z}_t \vert \mathbf{z}_{t-1}) = \mathcal{N}(\mathbf{z}_t; \sqrt{1 - \beta_t} \mathbf{z}_{t-1}, \beta_t\mathbf{I})
p_\theta(z_{t-1}|z_t) = \mathcal{N}(z_{t-1}|\mu_\theta(z_t, t), \sigma_t^2 \mathcal{I})

Neural network

Equivalent to noise prediction!

\mu_\theta(z_t, t) = \frac{1}{\sqrt{1-\beta_t}} \left( z_t - \frac{\beta_t}{\sqrt{1-\bar{\alpha}_t}} \epsilon_\theta(z_t,t) \right)
z_{t-1}
z_{t}
z_T
z_{0}
z_{1}

Diffusion on sets

z_{2}

Reverse diffusion: Denoise previous step

Forward diffusion: Add Gaussian noise (fixed)

Graph Neural Network

+ Galaxy formation

+ Observational systematics (Cut-sky, Fiber collisions)

+ Lightcone, Redshift Space Distortions....

Forward Model

N-body simulations

Observations

 SIMBIG arXiv:2211.00723

Adversarial examples

We can simulate the observable Universe, we just need hydrodynamical simulations

25 \, h^{-1}\mathrm{Mpc}

What are subresolution models?

10^{10} - 10^{11} M_\odot

Super massive black hole seeding in dark matter halos

BH feedback impacts galactic scales

Black holes can also growth through mergers

Effective models of astrophysical processes needed due to limited numerical resolutions or limited physical models

they can even teleport!

We can't model galaxy formation, how do we make our models robust?

Robust Summarisation

\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims}) + \blue{\lambda p_\mathrm{sims}(S(x_\mathrm{obs}))}
S:

Summariser (neural net)

(\theta_\mathrm{sims}, x_\mathrm{sims})
x_\mathrm{obs}
\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims})

Increase the evidence of the observations

Sims misspecified

What ML can do for cosmology

  • ML to accelerate non-linear predictions and density estimation

 

  • Can ML extract **all** the information that there is at the field-level in the non-linear regime?

 

  • Compare data and simulations, point us to the missing pieces?

cuestalz@mit.edu

Introducing homogeneity and isotropy

Redshift Space Distortions break isotropy

(Credit: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials)

 

arXiv:2202.05282

 

Normalising flows as generative models

arXiv:2202.05282

 

  • Fast theory models
  • Do all these summaries combined get all info?
    • Optimal information extraction
  • Density estimation
  • Improving current simulations?
  • How can we deal with model mispecification?

 

ML to the rescue for

Invariance vs Equivariance

r
\theta
\phi

Invariance

Scalar interactions

r

Equivariance

What can we do with vectors? 

Tensor products

v_i
v_j

Mario Geiger

Siddharth Mishra-Sharma

Making homogeneous and isotropic universes

p(x)
\mu_\theta(z_t,t)
p(z_T)

Invariant

to rotations and translations

Equivariant

Invariant

=
p(
)
p(
p(
)

Invariant

=
p(
)
p(
=
p(
)
p(
(\vec{\theta}_i, z_i)
z_i = z_{\mathrm{Cosmological} }
+ z_{\mathrm{Doppler}}
\chi(z) = \int_0^z \frac{dz'}{H(z')}
+ \frac{v_{\mathrm{pec}}}{aH(a)}
\chi_i

arXiv:2107.00630

- \log p(x) \leq -\mathrm{VLB}(x) =
\gray{\mathrm{Recon Loss}} + \blue{\mathrm{Diffusion Loss}}
\gray{\mathrm{Recon Loss} = \mathbb{E}_{q(z_0|x)} [ -\log p(x|z_0)]}
\blue{\mathrm{Diffusion Loss} = }
\blue{\sum_{t} D_\mathrm{KL} \left[ q(z_{t-1}|z_{t},x) || p_\theta (z_{t-1}|z_{t}) \right]}

Gaussian,

fully known

Also Gaussian,

but learned mean