ILLUSTRIS TNG
Hydro simulations
Empirical
models
&
?
Full Physics
Dark Matter Only
1) Find 50 most bound DM particles
Halo #1
Halo #23
Halo #1
2) Find DMO halo with at least 50% of these particles
97% of halos matched
Bijective!
2) Find DMO halo with at least 50% of these particles
Full Physics
Dark Matter Only
Shape
Dynamics
Temporal Evolution
Environment
Mass
Radius
Concentration
...
Velocity dispersion
Vmax
Velocity anisotropy
Spin
....
Formation time
Nmergers
...
Mass in torus
around halo
Full Physics
Dark Matter Only
Halo #1
Halo #23
Halo #1
Learn from Illustris
Decision Trees
YES
NO
How good is a given split?
Loss function
Mean Squared Error
Regularisation
Maximum depth
Boosting
Tree depth
Error
Target
Data
Underfitting
Overfitting
Optimal
Bagging
Tree Depth
Bootstrap 1
Bootstrap 2
Bootstrap 3
Decision Tree 1
Decision Tree 2
Decision Tree 3
Average
Extras!
Out of bag errors (no need for validation set)
Can do the same with features (improve over greediness)
Focus on difficult samples: Gradient descent in function space!
Error
Previous prediction
New prediction
Bagging + Boosting =
LightGradientBoostedMachine
(LGBM)
With
Without
i) Decision Trees default: Sum of impurity gains (MSE reduction at a given split) per feature.
Dynamical range, number of splits
Correlations
ii) Difference in MSE after retraining a model without the feature of interest.
Should I ask a question after this talk?
Is it almost lunch time?
Do you want your colleagues to hate you?
Ask!
Do I care?
Don't ask
Is it about magnetic fields?
Ask!
Ask!
Don't ask
YES
NO
NO
NO
YES
YES
NO
NO
YES
Don't ask