Will ML shape the future of cosmology?

 

A journey through obstacles and potential approaches to overcoming them

 

Carolina Cuesta-Lazaro

IAIFI Fellow

The golden days of Cosmology:

A five parameter Universe

\Omega_m
\Omega_b
\Omega_\Lambda
A_s
n_s

Initial Conditions

(Inflation)

Dynamics

Dark energy

Dark matter 

Ordinary matter 

Amplitude initial density field

Scale dependence

t = 400,000 years

DESI: Dark Energy Spectroscopic Instrument

~35 Million spectra!

Text

(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)

(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)

Neutrino mass hierarchy 

Primordial Non-gaussianity 

Galaxy formation 

Large scale modifications of gravity 

using growth to detect the existence of fifth forces

LSS might provide the most accurate measurement

to probe the physics of inflation (single/multi field, particle content)

not only a nuisance to margnilize over!

(\vec{\theta}_i, z_i)
z_i = z_{\mathrm{Cosmological} }
+ z_{\mathrm{Doppler}}
\chi(z) = \int_0^z \frac{dz'}{H(z')}
+ \frac{v_{\mathrm{pec}}}{aH(a)}
\chi_i

1

Observe galaxies

4

Pick your favourite analytical likelihood (Gaussian!)

5

Compute ~1Million times to get posterior

Constrain cosmology 101

Missing Information!

Perturbation Theory inaccurate / hard to compute

Is it always true?

3

Work on your analytical theory

2

Count pairs as a function of distance

\bar{\xi}(R_s)
R_s
1
1
1
2
2
4
5
5
5
3

Enrique Paillas

Waterloo

arXiv:2209.04310

How much information do we lose?

Bispectrum

Wavelet Scattering Transform

arXiv:1909.11107

arXiv:2204.13717

Siddharth Mishra-Sharma

Should we be using CNNs?

Galaxy positions

+ Magnitudes, velocities ...

Dark matter density field

z_T
z_{0}
z_{1}

Diffusion on sets

Reverse diffusion: Denoise your data

Forward diffusion: Add noise (fixed)

z_{2}
p_\theta(z_{t-1}|z_t)
q(z_t|z_{t-1})
p_\theta(z_{t-1}|z_t) = \mathcal{N}(z_{t-1}|\mu_\theta(z_t, t), \sigma_t^2 \mathcal{I})

Set

Set

\mu_\theta

Neural network

z_{t-1}
z_{t}

Credit: Siddharth Mishra-Sharma

arXiv:2107.00630

- \log p(x) \leq -\mathrm{VLB}(x) =
\purple{\mathrm{Prior Loss}} + \gray{\mathrm{Recon Loss}} + \blue{\mathrm{Diffusion Loss}}
\purple{\mathrm{Prior Loss} = D_\mathrm{KL} (q(z_T|x)|p(z_T))}
\gray{\mathrm{Recon Loss} = \mathbb{E}_{q(z_0|x)} [ -\log p(x|z_0)]}
\blue{\mathrm{Diffusion Loss} = }
\blue{\sum_{t} D_\mathrm{KL} \left[ q(z_{t-1}|z_{t},x) || p_\theta (z_{t-1}|z_{t}) \right]}

Gaussian,

fully known

Also Gaussian,

but learned mean

Mario Geiger

Siddharth Mishra-Sharma

Making homogeneous and isotropic universes

p(x)
\mu_\theta(z_t,t)
p(z_T)

Invariant

to rotations and translations

Equivariant

Invariant

=
p(
)
p(
p(
)

Invariant

=
p(
)
p(
=
p(
)
p(

+ Galaxy formation

+ Observational systematics (Cut-sky, Fiber collisions)

+ Lightcone, Redshift Space Distortions....

Forward Model

N-body simulations

Observations

 SIMBIG arXiv:2211.00723

We can simulate the observable Universe, we just need hydrodynamical simulations

25 \, h^{-1}\mathrm{Mpc}

What are subresolution models?

10^{10} - 10^{11} M_\odot

Super massive black hole seeding in dark matter halos

BH feedback impacts galactic scales

Black holes can also growth through mergers

Effective models of astrophysical processes needed due to limited numerical resolutions or limited physical models

they can even teleport!

We can't model galaxy formation, how do we make our models robust?

Robust Summarisation

\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims}) + \blue{\lambda p_\mathrm{sims}(S(x_\mathrm{obs}))}
S:

Summariser (neural net)

(\theta_\mathrm{sims}, x_\mathrm{sims})
x_\mathrm{obs}
\mathcal{L} = I(S(x_\mathrm{sims})|\theta_\mathrm{sims})

Increase the evidence of the observations

Sims misspecified

What ML can do for cosmology

  • ML to accelerate non-linear predictions and density estimation

 

  • Can ML extract **all** the information that there is at the field-level in the non-linear regime?

 

  • Compare data and simulations, point us to the missing pieces?

cuestalz@mit.edu

Introducing homogeneity and isotropy

Redshift Space Distortions break isotropy

(Credit: E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials)

 

Latent Generative Models: Normalising flows

x = f(z), \, z = f^{-1}(x)
p(\mathbf{x}) = p_z(f^{-1}(\mathbf{x})) \left\vert \det J(f^{-1}) \right\vert

No assumptions on the likelihood (likelihoods rarely Gaussian!)

 

No expensive MCMC chains needed to estimate posterior

(Image Credit: Phillip Lippe)

arXiv:2202.05282

 

Normalising flows as generative models

arXiv:2202.05282

 

  • Fast theory models
  • Do all these summaries combined get all info?
    • Optimal information extraction
  • Density estimation
  • Improving current simulations?
  • How can we deal with model mispecification?

 

ML to the rescue for

Invariance vs Equivariance

r
\theta
\phi

Invariance

Scalar interactions

r

Equivariance

What can we do with vectors? 

Tensor products

v_i
v_j

LSS vs CMB

(Image Credit: Julian Bautista at Aix-Marseille University)

deck

By carol cuesta

deck

  • 312