modeling galaxy spectra using
deep generative models

changhoon.hahn@princeton.edu

these slides are interactive!
hover over figures and text with your cursor to find extra details

upcoming MOS missions to study galaxy evolution will face a number of major challenges

1

current spectral energy distribution (SED) models limit our ability to measure galaxy properties from spectra

2

galaxy formation models don't reproduce observations so they can't be used for spectral simulations

3

current SED modeling cannot be scaled-up to meet the demands of future data from MOS missions

hover over the text for more details

3

with deep generative models we can address each of these major challenges

deep generative models (e.g. VAEs, GANs) provide powerful unsupervised methods for learning the data distribution

low-dimensional representation of spectra in latent variable-space

a neural network that maps latent variables to spectral-space

Variational Autoencoders, Generative Adversarial Networks

a neural network that maps spectra to compact latent variables

spectra from MOS missions

reconstructed spectra from the generative model

hover over diagram for more details

by embedding a galaxy SED model in the architecture, we can build physics- and data-driven SED models 

neural network emulator of a galaxy stellar population synthesis model (e.g. Alsing+2020)

latent variable space now includes physical galaxy properties (e.g. SFH, metallicity)

physical transformation (redshifting) included in the causal structure

decoder outputs high-resolution, rest-frame galaxy SED

hover over diagram for more details

deep generative SED models can more accurately model observed spectra and account for systematics

our model can accurately model emission lines, which means we can extract galaxy property information from them

our model can accurately model emission lines, which means we can extract galaxy property information from them

our model provides a higher resolution SED that matches the observed spectra more accurately 

hover over the figure for more details

generative models can be combined with simulation-based inference* to accelerate Bayesian SED modeling

*a.k.a. "likelihood-free" inference

obligatory normalizing flow gif

hover over text for more details

can be used to generate data-driven spectral simulations for upcoming MOS missions (see also Peter Melchior's poster)

 

generative + SBI based SED modeling: ~3 hrs to train, full posterior 0.2 seconds/galaxy

11-dimensional SED model with non-parametric SFH and ZH

 scalable Bayesian analysis for future MOS missions

generative model + SBI based SED modeling applied to 144k galaxies in DESI survey validation --- PROVABGS-SV

the PRObabilistic Value-Added Bright Galaxy Survey-SV will be public available later this year!

the PRObabilistic Value-Added Bright Galaxy Survey-SV will be public available later this year!

hover over the figure for more details

inferred stellar masses from full posteriors