Generative Solutions for Cosmic Problems

Flatiron Institute

Institute for Advanced Studies

Carol(ina) Cuesta-Lazaro

p(\mathrm{World}|\mathrm{Prompt})

["Genie 2: A large-scale foundation model" Parker-Holder et al (2024)]

p(\mathrm{Drug}|\mathrm{Properties})

["Generative AI for designing and validating easily synthesizable and structurally novel antibiotics" Swanson et al]

Probabilistic ML has made high dimensional inference tractable

1024x1024xTime

["Genie 3: A new frontier for world models" Parker-Holder et al (2025)]

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

1-Dimensional

Machine Learning

Secondary anisotropies

Galaxy formation

Intrinsic alignments

DESI / SphereX / Hetdex

Euclid / LSST

SO / CMB-S4

Ligo / Einstein

The era of Big Data Cosmology

xAstrophysics

HERA / CHIME

SAGA / MANGA

Galaxy formation

Emitters Census

Reionization

Cosmic Microwave Background

Galaxies / Dwarfs

21 cm

Galaxy Surveys

Gravitational Lensing

Gravitational Waves

AGN Feedback/Supernovae

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Field-Level Inference and Emulators

Robust Simulation-based inference

Generating Fields

Generating Representations

Disentangling systematics from physics latent spaces

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

What is field-level inference?

A digital twin of our Universe

Observed Galaxy Distribution

Simulated Galaxy Distribution

Field Level Inference

Forward Model

(= no Cosmic Variance)

\Omega_m,

\sigma_8 ...

p(\delta_{\mathrm{ICs}}, \mathcal{\theta}|\delta_{\mathrm{Obs}})

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Why field-level inference?

Optimal constraints

)

\mathrm{Cosmology}

Counts-in-cell

Do we really need to infer 10^9 parameters to constrain ~10?

)

\mathrm{Cosmology}

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

)

\mathrm{Cosmology}

)

\mathrm{Cosmology}

Compression

Marginal Likelihood

p(x|\theta) = \int p(x|z, \theta) p(z|\theta) \, dz

Explicit Likelihood

Implicit Likelihood

Initial Conditions

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Generative Models 101

Maximize the likelihood of the training samples

\hat \phi = \argmax \left[ \log p_\phi (x_\mathrm{train}) \right]

x_1

x_2

Parametric Model

p_\phi(x)

Training Samples

x_\mathrm{train}

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

x_1

x_2

Trained Model

p_\phi(x)

Evaluate probabilities

Low Probability

High Probability

Generate Novel Samples

Simulator

Generative Model

Fast emulators

Inference

Generative Model

Simulator

Generative Models: Simulate and Analyze

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Bridging two distributions

x_1

x_0

Base

Data

"Creating noise from data is easy;

creating data from noise is generative modeling."

Yang Song

Neural Network

\frac{dx_t}{dt} = v^\phi_t(x_t)

\frac{d p(x_t)}{dt} = - \nabla \left( v^\phi_t(x_t) p(x_t) \right)

6 seconds / sim vs 40 million CPU hours

Fast Emulation

)

\mathrm{Cosmology}

25 \mathrm{Mpc/h}

100 \mathrm{kpc/h}

Density Fields

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Marginal Likelihoods:

arXiv:2405.05255

Point Clouds

arXiv:2311.17141

)

\mathrm{Cosmology}

)

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Marginal Posteriors:

)

\mathrm{Cosmology}

1) Sampling the Neural Likelihood (NLE) with HMC

2) Directly an optimal compression: Neural Posterior (NPE)

p(\theta|x) = \frac{p(x|\theta)p(\theta)}{p(x)}

Learned Likelihood

CNN

Diffusion

Increasing Noise

p(\sigma_8|\delta_m)

p(\sigma_8|\delta_m + 0.01 \epsilon)

p(\sigma_8|\delta_m + 0.02 \epsilon)

["Diffusion-HMC: Parameter Inference with Diffusion Model driven Hamiltonian Monte Carlo" 
Mudur, Cuesta-Lazaro and Finkbeiner
NeurIPs 2023 ML for the physical sciences, arXiv:2405.05255]

Nayantara Mudur

Posterior (NPE)

Likelihood (NLE)

Learning the marginal likelihood is more robust

p(\theta|x) = \frac{p(x|\theta)p(\theta)}{p(x)}

Learned Likelihood

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Reconstructing ALL latent variables:

Dark Matter distribution

Entire formation history

Peculiar velocities

Predictive Cross Validation:

Cross-Correlation with other probes without Cosmic Variance

[Image Credit: Yuuki Omori]

Constraining Inflation:

Inferring primordial non-gaussianity

Why field-level inference?

Data-driven Subgrid models / Data-driven Systematics

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

"Joint cosmological parameter inference and initial condition reconstruction with Stochastic Interpolants"

Cuesta-Lazaro, Bayer, Albergo et al 
NeurIPs ML4PS 2024 Spotlight talk

Particle Mesh

Dark Matter Only

Gaussian Likelihood

Explicit Sampling vs SBI

1) Likelihood not necessarily Gaussian

2) Forward model no need differentiable

3) Amortized

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Generative Model: Marginalizing over ICs

Generative Model: Fixing ICs

HMC: Marginalizing over ICs

True

Reconstructed

\delta_\mathrm{Obs}

\delta_\mathrm{ICs}

p(\delta_\mathrm{ICs}, \theta|\delta_\mathrm{Obs})

Carolina Cuesta-Lazaro Flatiron/IAS - FLI

Initial Conditions

Finals

L = 400 h^{-1} \mathrm{Mpc}, N = 32^3

L = 100 h^{-1} \mathrm{Mpc}, N = 64^3

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

HMC (ICs)

SBI (ICs)

SBI (Finals)

HMC (Finals)

HMC (ICs)

SBI (ICs)

SBI (Finals)

HMC (Finals)

HMC (ICs)

SBI (ICs)

SBI (Finals)

HMC (Finals)

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

SBI

HMC

Scaling up in volume

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Implicit FLI for DESI

DESI Y1 LRG Effective volumes already larger than our sims!

Small Scale Galaxy Bias

How galaxies are selected

Fibre collisions

Forward Modelling the Survey Systematics

EFT

Galaxy Formation

Self-Consistent Predictions across observables

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

arXiv:1804.03097

X-Ray

Cluster gas mass fractions

Cluster gas density profiles

Sunyaev-Zeldovich

Galaxy Properties

Thermal Integrated electron pressure (hot electrons / big objects)

Star formation + histories

Stellar mass / halo mass relation

FRBs

Integrated electron density

Kinetic Integrated electron density x peculiar velocity

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Multi-wavelength Observables

["BaryonBridge: Interpolants models for fast hydrodynamical simulations" Horowitz, Cuesta-Lazaro, Yehia ML4Astro workshop 2025]

Particle Mesh for Gravity

CAMELS Volumes

25 h^{-1} \mathrm{Mpc}

1000 boxes with varying cosmology and feedback models

Gas Properties

Current model optimised for Lyman Alpha forest

7 GPU minutes for a 50 Mpc simulation

130 million CPU core hours for TNG50

Density

Temperature

Galaxy Distribution

+ \mathcal{C}, \mathcal{A}

p(\mathrm{Baryons}|\mathrm{DM}, \mathcal{C}, \mathcal{A})

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Hydro At Scale

[Video credit: Francisco Villaescusa-Navarro]

Gas density

Gas temperature

Subgrid model 1

Subgrid model 2

Subgrid model 3

Subgrid model 4

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Can we learn a general and continuous representation of Baryonic feedback?

Gas

Galaxies

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

, z_\mathrm{baryons})

Dark Matter

Baryonic fields

Marginalize over a broader set of subgrid physics

Interpolate between simulators

Mingshau Liu

(Ming)

Constrain z via multi-wavelength observations

Carolina Cuesta-Lazaro Flatiron/IAS - Astro Seminar

Trained on:

TNG, SIMBA, Astrid, EAGLE

z = f(x)

Encoder

z_\mathrm{baryons}

1) Encoder

Gas

Galaxies

, z_\mathrm{baryons})

Dark Matter

Baryonic fields

2) Probabilistic Decoder

, z_\mathrm{baryons})

Dark Matter

Baryonic fields

Carolina Cuesta-Lazaro Flatiron/IAS - FLI

\mathcal{O}(10)

(Test suite)

Gas Density

Temperature

Astrid

EAGLE

\alpha = 0

\alpha = 0.25

\alpha = 0.5

\alpha = 0.75

\alpha = 1

Carolina Cuesta-Lazaro - IAS Astro Seminar

Interpolating over Simulations

Generalizing to unseen simulations: Magneticum

Carolina Cuesta-Lazaro - IAS Astro Seminar

smsharma/awesome-neural-sbi

Astrophysics proliferates Simulation-based Inference

on Simulations

Carolina Cuesta-Lazaro - IAS Astro Seminar

x^\mathcal{O}

x^\mathcal{S}

Simulated Data

Observed Data

z^\mathcal{O}_p

z^\mathcal{O}_s

z^\mathcal{S}_s

z^\mathcal{S}_p

Alignment Loss

\mathcal{L} = \sum_{\mathcal{D} \in (\mathcal{S}, \mathcal{O})} p(x^\mathcal{D}|z^\mathcal{D}_s, z^\mathcal{D}_p) + \lambda d(z^\mathcal{O}_s,z^\mathcal{S}_s)

Reconstruction

Statistical Alignment

50\%

(OT / Adversarial)

Carolina Cuesta-Lazaro - IAS Astro Seminar

Encoder

Obs

Encoder

Sims

Private Domain Information

Shared Information

\hat{x}^\mathcal{O}

\hat{x}^\mathcal{S}

Observed Reconstructed

Simulated Reconstructed

Shared Decoder

A Toy Model Example

Idealized Simulations

Observations

+ Scale Dependent Noise

+ Bump

x^\mathcal{O}

x^\mathcal{S}

Carolina Cuesta-Lazaro - IAS Astro Seminar

Amplitude

Tilt

p(\theta|z^\mathcal{O}_s)

p(\theta|z^\mathcal{O}_p)

p(\theta|z^\mathcal{O}_p,z^\mathcal{O}_s)

p(\theta|z^\mathcal{O}_p)

Robust SBI from Shared

p(x^\mathcal{O}|z^\mathcal{O}_p,z^\mathcal{O}_s)

p(x^\mathcal{O}|z^\mathcal{O}_s)

Visualizing Information Split

Carolina Cuesta-Lazaro - IAS Astro Seminar

Anomaly Detection in Astrophysics

arXiv:2503.15312

Carolina Cuesta-Lazaro - IAS Astro Seminar

Can we separate Systematics from Physics?

Pablo Mercader

Daniel Muthukrishna

Jeroen Audenaert

Legacy Survey

HSC

DESI

SDSS

Same Object / Different Instrument

Different Object / Same Instrument

Carolina Cuesta-Lazaro - IAS Astro Seminar

Object 1

Object 2

Object 1

z_\mathrm{instrument}

Back to the Playground!

Orientation + Scale

Number

z_\mathrm{instrument},

z_\mathrm{object}

)

Instrument 1

Instrument 2

Instrument Encoder

z_\mathrm{object}

Object Encoder

Instrument Pair

Object Pair

Instrument Pair

Object Pair

Carolina Cuesta-Lazaro - IAS Astro Seminar

Ground Truth

Instrument Pair

Object Pair

Recon

2. We can scale hydrodynamical simulation in volume for the analysis of LSS surveys

Conclusions

Can we leverage multi-wavelength observations?

3. Playing with the latent space will help us learn robustly

1. Cosmological field level inference can be made scalable with generative models

Can EFT help us scale in volume?

Can generally make simulators more controllable!

Carolina Cuesta-Lazaro - IAS Astro Seminar

Is resolution too low?

Private-Shared Information Split

Disentangling systematics

Observation

Question

Hypothesis

Testable Predictions

Gather data

Alter, Expand, Reject Hypothesis

Develop General Theories

[Figure adapted from ArchonMagnus]

Simulators as theory models

The Scientific Method in 2025

High-dimensional data

["An LLM-driven framework for cosmological
model-building and exploration" Mudur, Cuesta-Lazaro, Toomey (in prep)]

Can LLMs turn these anomalies into new hypothesis?

Propose a model for Dark Energy

Implement it in a Cosmology simulation code: CLASS

Test fit to DESI Observations

Iterate to improve fit

Quintessence, DE/DM interactions....

Must pass a set of general tests for "reasonable" models

Ideally, compare evidence to LCDM.

For now, Bayesian Information Criteria (BIC)

Nayantara Mudur (Harvard)

Carolina Cuesta-Lazaro - IAS Astro Seminar

Can LLMs implement new physics models?

Thawing Quintessence

Axion-like Early Dark Energy

Ultra-light scalar field that temporarily acts as dark energy in the early universe

Implementation Challenge:

Dynamic dark energy model: scalar field transitions from "frozen" (cosmological constant-like) to evolving as the universe expands.

Oscillatory behaviour

Can take advantage of existing scalar field implementations in CLASS

+ 43,000 lines of C code

+ 10,000 lines of numerical files

CLASS Challenge:

Carolina Cuesta-Lazaro - IAS Astro Seminar

1) Code compiles + obtains reasonable observables

2) Implementation agrees with target repository

3) Goodness of fit for DESI + Supernovae

4) H0 tension metrics

Curated

1 page long description of model to be implemented, CLASS tips + very explicit units

Paper

Directly from a full paper

If fails, get feedback from another LLM

Carolina Cuesta-Lazaro - IAS Astro Seminar

Propose a Dark Energy Model

Shortcut: field that produces this?

Carolina Cuesta-Lazaro - IAS Astro Seminar

Propose a Dark Energy Model

Asked for physical motivation. It tried :(

Not true, preferred scale

Carolina Cuesta-Lazaro - IAS Astro Seminar

Reinforcement Learning

How to iterate

Update the base model weights to optimize a scalar reward (s)

DeepSeek R1

Base LLM

(being updated)

What rewards are more advantageous?

Base LLM

(frozen)

Develop basic skills: numerics, theoretical physics, UNIT CONVERSION

Community Effort!

Carolina Cuesta-Lazaro - IAS Astro Seminar

Evolutionary algorithms

Learning in natural language, reflect on traces and results

Examples: EvoPrompt, FunSearch,AlphaEvolve

How to iterate

Carolina Cuesta-Lazaro - IAS Astro Seminar

["GEPA: Reflective prompt evolution can outperform reinforcement learning" Agrawal et al]

GEPA: Evolutionary

GRPO: RL

+10% improvement over RL with x35 less rollouts

Scientific reasoning with LLMs still in its infancy!

Carolina Cuesta-Lazaro - IAS Astro Seminar

Observation

Question

Hypothesis

Testable Predictions

Gather data

Alter, Expand, Reject Hypothesis

Develop General Theories

[Figure adapted from ArchonMagnus]