Modern recipes for good photometric redshifts

Kenneth Duncan

Lorentz Workshop - June 2016

Leiden Observatory

e.g. CANDELS UDS: Galametz et al. (2013)

You have your nice new multi-wavelength catalog, now what...

Recipe 1:

Template fitting photo-z estimates

Step 1: The code

EAZY

Brammer et al. (2008)

LePhare

Arnouts et al. (1999)

Ilbert et al. (2006)

PhotoZ

Bender et al. (2001)

Hyper-Z

Bolzonella et al. (2000)

ZEBRA

Feldmann et al. (2006)

BPZ

Benitez (2000)

Step 1: The code

Total citations: ~2800

Then

Now

Step 2: The Templates

Step 3: zeropoint offsets and additional smoothing errors

Additional rest-frame errors

Corrections to the observed zeropoints

Brammer et al. (2008) 

Dust

AGB Stars?

PAH/Dust emission/AGN?

Step 4: priors (optional)

Brammer et al. (2008) 

Benitez (2000)

Magnitude

Spectral type

The results

e.g. for the UDS catalog at the beginning...

Recipe 2:

Training based photo-z estimates

(aka machine learning)

Aside: Motivations for ML-based Photo-z's

Euclid

LSST

Aside: Motivations for training (ML) based Photo-z's

1. Speed

Euclid: ~1.5 billion galaxies

 

LSST: ~10 billion galaxies

 

Estimated time to run EAZY on all sources (on a desktop machine):

~2+ years (Euclid)

~14+ years (LSST)

Motivations for training (ML) based Photo-z's

2. Improvements in accuracy

Sanchez et al. (2014)

Weak Lensing requirements:

Scatter

\sigma_{z}/(1+z) < 5\%
\langle z \rangle / (1+z) < 0.2\%

Bias

Step 1: Select your training sample

i.e. a representative subset of your sample with spectroscopic redshifts

Step 2: Pick your favourite regression/classification algorithm

Neural Networks

Self-organizing Maps (SOMs)

Deep learning

Support Vector Machines (SVM)

Naive Bayes

Gaussian Processes

Generalized Linear Models

Bayesian Network

k-Nearest Neighbour

Boosted Decision Trees

Randomised Forests

Relevance vector machines

Radial basis function networks

Normalised inner product nearest neighbour

Directional neighbourhood fitting

Voronoi tesselation density estimator

Non-conditional density estimation

Step 3: Train your regression/classification algorithm

Step 4: Apply to your science sample

magic happens somewhere here

Pros and Cons of ML Photo-z's

Pro:

  • Fast and scalable
     
  • Entirely empirical:
    no concern about template choice
     
  • Simple to include extra information:
    properties such as size and morphology can help break degeneracies

Con:

  • Entirely dependent on spectroscopic training sample
     
  • Struggle more with inhomogeneous datasets (e.g. missing filters)
     
  • Difficult to physically interpret solutions - e.g. rest-frame colours

Final step: (For all photo-z methods)

Fraction of spectroscopic redshifts within given confidence interval

Dahlen et al. (2012)

!

7/11 submitted photo-z estimates significantly overconfident for 1-sigma errors

Calibrating redshift pdfs

Calibrating redshift pdfs

Wittman et al. (2016)

See also Bordoloi et al. (2010)

Improving photo-z estimates even more...

the wisdom of crowds

Combine multiple photo-z estimates

Dahlen et al. (2012)

= Median of all photo-z estimates

= Median of best 5 photo-z estimates

See also Carraso Kind & Brunner (2014)

Also works for diff. templates with the same code

Limits of existing spectroscopic samples

Masters et al. (2015)

Limits of existing spectroscopic samples

Masters et al. (2015)

Short term solution:

 Targeted spectroscopic follow-up of unexplored colour space (Underway):            spectra

Ongoing spectroscopic surveys: 

(e.g. VANDELS/VUDS/LEGA-C/OzDES/HETDEX)                spectra

 

Long term solution (2018+):

New massively multiplexed spectroscopic surveys -

Subaru/PFS and WHT/WEAVE in the North:            spectra

VLT/MOONS and VISTA/4MOST in the South:           spectra

\sim 10^4
\sim 10^{4-6}
> 10^6
> 10^6

What is the future of photometric redshifts?

Hybrid template + ML methods?

e.g. 'fuzzy archetypes' method of Speagle & Eisenstein (2015)

What is the future of photometric redshifts?

Next generation Template fitting code(s)?

If speed is the requirement:

 

Scope for new template codes which make use of...

- Efficient parallelisation

- GPU acceleration

If physical interpretation of fitted SEDs is the requirement:

Fit using SED modelling codes, e.g.:

- Prospector

- BEAGLE

- MAGPHYS/Cigale

New Euclid team code?

In addition to the proliferation of machine learning redshifts... 

Photo-z's for AGN

A whole other problem...

Redshifts for LOFAR sources

Suggested strategy...

* Carrasco Kind and Brunner (2014)

Bayesian model combination/averaging (BMC/BMA)*

Test field

Spectroscopic sample (Bootes)

N x Photo-z estimates

e.g.:

QSO Templates

Stellar Templates

ML estimates

Photo-z's Optimised for all source types

Photometric redshifts

By Kenneth Duncan

Photometric redshifts

Review of photometric redshifts past, present and future. For the Lorentz Workshop Jun 20th-24th

  • 333