Big Data Cosmology meets AI
IAIFI Fellow
Carol Cuesta-Lazaro
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11147457/pasted-from-clipboard.png)
Boston University - 27 February 2024
Video Credit: N-body simulation Francisco Villaescusa-Navarro
![](https://galaxies-cosmology-2015.wdfiles.com/local--files/baryon-acoustic-oscillations/baryon_acoustic_peak.png)
![](https://d3i71xaburhd42.cloudfront.net/7be5f2f28a985c181d7bdcd3a01bb8dce1d86297/3-Figure1-1.png)
![](https://www.darkenergysurvey.org/wp-content/uploads/2021/07/y1bao_powerspectrum_figure2.png)
![](https://astro.ucla.edu/~wright/DL-vs-z-26Mar2015.gif)
The era of Big Data Cosmology
![](https://upload.wikimedia.org/wikipedia/commons/3/3c/Ilc_9yr_moll4096.png)
![](https://hubblesite.org/files/live/sites/hubble/files/home/resource-gallery/articles/_images/hs-article-0720a-2400x1840.jpg?t=tn2400)
![](https://news.fnal.gov/wp-content/uploads/2024/01/1-Deep-image-SN-only.jpg)
1-Dimensional
![](https://www.ligo.caltech.edu/system/pages/images/24/page/Gravity_Waves_StillImage.jpg?1699659823)
![](https://upload.wikimedia.org/wikipedia/commons/3/3c/Ilc_9yr_moll4096.png)
![](https://news.fnal.gov/wp-content/uploads/2024/01/1-Deep-image-SN-only.jpg)
![](https://www.ligo.caltech.edu/system/pages/images/24/page/Gravity_Waves_StillImage.jpg?1699659823)
![](https://hubblesite.org/files/live/sites/hubble/files/home/resource-gallery/articles/_images/hs-article-0720a-2400x1840.jpg?t=tn2400)
Machine Learning
Secondary anisotropies
Galaxy formation
Intrinsic alignments
Dust
xAstrophysics
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11147279/pasted-from-clipboard.png)
DESI, DESI-II, Spec-S5
Euclid
LSST
Simons Observatory
CMB-S4
Ligo
Einstein
LSST
Early Universe Inflation
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
Late Universe
![](https://upload.wikimedia.org/wikipedia/commons/3/3c/Ilc_9yr_moll4096.png)
Energy and matter content
Evolution
Dark matter
Dark energy
Hubble Constant
Baryons
Neutrino masses
Non-Gaussianity
Tilt power spectrum
Hubble tension
Beyond the Standard Model
Multifield Inflation
Dark Matter Reconstruction
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
![](https://sjeffreson.github.io/images/galaxy-fig-insets-smaller.png)
Hybrid ML - Physics Simulators
Cosmological (field level) Inference for Galaxy Surveys
DESI
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
DESI: Dark Energy Spectroscopic Instrument
![](https://newscenter.lbl.gov/wp-content/uploads/2022/01/Final_DESIz65QSO_new-628x454.png)
~40 Million spectra!
(Image Credit: Jinyi Yang, Steward Observatory/University of Arizona)
"Towards testing the theory of gravity with DESI: summary statistics, model predictions and future simulation requirements" Alam et al (including Cuesta-Lazaro) JCAP
![](https://newscenter.lbl.gov/wp-content/uploads/2022/01/allframe-1000mpc-960x540-1.gif)
(Image Credit: D. Schlegel/Berkeley Lab using data from DESI)
High dimensional data
Unknown
Simple summary statistic
estimated with Perturbation Theory
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
![](https://galaxies-cosmology-2015.wdfiles.com/local--files/baryon-acoustic-oscillations/baryon_acoustic_peak.png)
Probability pair of galaxy
Pair separation
"Full-shape analysis with simulation-based priors: constraints on single field inflation from BOSS" Ivanov, Cuesta-Lazaro, et al Submitted PRD
"Towards a non-Gaussian model of redshift space distortions" Cuesta-Lazaro et al, MNRAS
Forward Model
Parameters
Observable
Likelihood
Simulator
+ MCMC hammer
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11130639/pasted-from-clipboard.png)
Dark matter
Dark energy
Inflation
Perturbation Theory
Pen and paper
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144507/pasted-from-clipboard.png)
![](https://galaxies-cosmology-2015.wdfiles.com/local--files/baryon-acoustic-oscillations/baryon_acoustic_peak.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
+ Density Estimation
+ Sampler
High dimensional data
Unknown
Write your favourite summary statistic here
Simulations + ML
"SUNBIRD: Neural-network-based models for galaxy clustering" Cuesta-Lazaro et al MNRAS
"LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and Cosmology" Ho, Barlett, Chartier, Cuesta-Lazaro et al
Density Split
Cluster
Void
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
Dark Matter
Tilt primordial Fluctuations
Clumpiness
Expansion rate
Neutrinos
"Baryons"
"Constraining νΛCDM with density-split clustering" Paillas, Cuesta-Lazaro et al
MNRAS
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/9186643/pasted-from-clipboard.png)
"Cosmological constraints from density-split clustering in the BOSS CMASS galaxy sample" Paillas, Cuesta-Lazaro et al MNRAS
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/6926201/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/6926201/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/6926201/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/6926201/pasted-from-clipboard.png)
Dark matter density field
DESI:
Alternative Clustering Methods
DESI
![](https://planck.ipac.caltech.edu/system/news_items/images/9/thumb/planck13-001_Tn.jpg?1363903743)
Initial conditions
Dark matter
Dark energy
Inflation
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
Image credit: Bullock & Boylan-Kolchin
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040018/mass_halos-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040020/halo_circles-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11147243/pasted-from-clipboard.png)
Dark matter halo mass
Number of objects
Dark matter halo mass
Dark matter halo mass
A forward model samples the likelihood
Parameters
Observable
Observed galaxy pointcloud
Initial conditions
![](https://planck.ipac.caltech.edu/system/news_items/images/9/thumb/planck13-001_Tn.jpg?1363903743)
DESI
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
Forward Model
Example: How far will Patrick Mahomes throw?
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139808/pasted-from-clipboard.png)
def simulate_trajectory(
velocity,
angle,
time_step=0.1,
g=9.8,
):
z_velocity = np.random.normal(scale=2.)
z_angle = np.random.normal(scale=2.)
velocity = velocity + z_velocity
angle = angle + z_angle
angle_rad = np.radians(angle)
v_x = velocity * np.cos(angle_rad)
v_y = velocity * np.sin(angle_rad)
total_time = 2 * v_y / g
times = np.arange(0, total_time, time_step)
x = v_x * times
y = v_y * times - 0.5 * g * times**2
x_american = x * 1.09361
return x_american, y, times
Sample Prior
Simulator
Latent variables z
(e.g. is Taylor Swift looking?)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139743/football_throws.gif)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139716/likelihood.png)
Maximize the likelihood of the training samples
Model
Training Samples
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139793/samples_1d.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139794/likelihood_cont.png)
Generate Novel Samples
Evaluate probabilities
Trained Model
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139794/likelihood_cont.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139743/football_throws.gif)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139813/pasted-from-clipboard.png)
![](https://images.openai.com/blob/b196df3a-6fea-4d86-87b2-f9bb50be64c7/leaf.png?trim=0,0,0,0&width=2600)
![](https://cdn-icons-png.flaticon.com/512/1793/1793332.png)
A 2D animation of a folk music band composed of anthropomorphic autumn leaves, each playing traditional bluegrass instruments, amidst a rustic forest setting dappled with the soft light of a harvest moon
Image credit: DALL·E 3
1024x1024
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040096/diffusion_fig-1.png)
"A point cloud approach to generative modeling for galaxy surveys at the field level"
Cuesta-Lazaro and Mishra-Sharma
ICML ML4Astro workshop (Spotlight talk)
Base Distribution
Target Distribution
- Sample
- Evaluate
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11139914/darkcosmology__1_.gif)
Fixed Initial Conditions
Varying Cosmology
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040102/parameter_variations__1_-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040102/parameter_variations__1_-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040102/parameter_variations__1_-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040104/velocity_parameter_variations__1_-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040104/velocity_parameter_variations__1_-1.png)
Mean pairwise
velocity
k Nearest neighbours
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040102/parameter_variations__1_-1.png)
Pair separation
Pair separation
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10597125/pasted-from-clipboard.png)
Trained on only 5000 positions!
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
1 to Many:
Galaxy distribution
Dark Matter
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
"Probabilistic Reconstruction of Dark Matter fields from galaxies"
Park, Ono, Mudur, Ni, Cuesta-Lazaro NeurIPS Machine Learning and the Physical Sciences
![](https://samuel.physics.harvard.edu/sites/scholar.harvard.edu/files/aravisamuel/files/corepark-cropped.jpeg?m=1582231368)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040269/pasted-from-clipboard.png)
Victoria Ono
Core Park
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040057/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
Truth
Sampled
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
Observed
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824460/fiducial_summaries-1.png)
Small
Large
Scale (k)
Power Spectrum
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824460/fiducial_summaries-1.png)
log Mass
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040020/halo_circles-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040020/halo_circles-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040018/mass_halos-1.png)
log Mass
Counts
![](https://www.universetoday.com/wp-content/uploads/2011/06/molecular_cloud.jpg)
![](https://smd-cms.nasa.gov/wp-content/uploads/2023/06/spiral-galaxy-jpg.webp)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040353/Comparison-of-the-three-TNG-simulations-TNG50-TNG100-and-TNG300-For-each-projected.png)
![](https://wwwmpa.mpa-garching.mpg.de/galform/virgo/millennium/seqB_037a.jpg)
~ Gpc
pc
kpc
Mpc
Gpc
Video credit: Francisco Villaescusa-Navarro
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040016/cross_matrix-1.png)
Small
Large
In-Distribution
In-Distribution
In-Distribution
Out-of-Distribution
Out-of-Distribution
Out-of-Distribution
Out-of-Distribution
Out-of-Distribution
Out-of-Distribution
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
CAMELS
DESI LRG ~ 20 (Gpc/h)^3
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
TNG-300
True DM
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
Sample DM
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
Can we run larger simulations? (DESI volumes)
At high resolution?
Faster?
All this works depends on simulations, but...
Thousands of them?
![](https://media1.tenor.com/m/VqoQZNqYdkQAAAAC/more-i-want-more.gif)
Gravitational evolution ODE
Particle-mesh
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10812245/densities-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10812245/densities-1.png)
Particle-mesh
Full Nbody
Hybrid Simulator - on the fly
Gravitational evolution ODE
Trained to match particle velocities and positions: DIFFERENTIABLE
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10812245/densities-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10812245/densities-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10812245/densities-1.png)
Particle-mesh
Full Nbody
Hybrid ML-Simulator
"Nbodyify: Adaptive mesh corrections for PM simulations" Cuesta-Lazaro, Modi in preps
Hybrid subgrid models to bridge scales
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040606/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040609/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040610/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040612/pasted-from-clipboard.png)
High Res Sim
Springel and Hernquist 03
"Learning a subgrid model for the ISM from high resolution simulations" Jeffreson, Cuesta-Lazaro in preps
DESI
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11132792/debias-TNG300.png)
Building digital twins
Selections
Survey systematics
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11042385/pasted-from-clipboard.png)
Robustness
Extract only information that is robust
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
across time
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10597125/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11144498/pasted-from-clipboard.png)
AI4Science
Equivariance & Symmetries
Anomaly detection
Out-of-Distribution
Interpretability
Quantifying Uncertainties
Partial Observations
PDEs
Multimodal
Simulation-based-Inference
Foundation Models
Weather & Climate
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11097490/pasted-from-clipboard.png)
Chemistry & Biology
Quantum Mechanics
Particle Physics
Astrophysics & Cosmology
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11097517/pasted-from-clipboard.png)
![](https://smd-cms.nasa.gov/wp-content/uploads/2023/06/spiral-galaxy-jpg.webp)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11097657/pasted-from-clipboard.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11097663/pasted-from-clipboard.png)
Neuroscience
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11098512/pasted-from-clipboard.png)
Hierarchical
Conclusions
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10597125/pasted-from-clipboard.png)
1. There is a lot of information in galaxy surveys that ML methods can access
2. We can tackle high dimensional inference problems so far unatainable
3. Our ability to simulate will limit the amount of information we can extract
Hybrid simulators, forward models, robustness
Dark matter, Initial Conditions, let's get creative!
Field level inference
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/10824461/fiducial_imshows-1.png)
![](https://s3.amazonaws.com/media-p.slid.es/uploads/993552/images/11040606/pasted-from-clipboard.png)
BU
By carol cuesta
BU
- 116