Dr. Ben Mather
EarthByte Group
University of Sydney
Essentially, all models are wrong,
but some are useful.
- Box & Draper
Empirical Model-Building and Response Surfaces (1987)
@BenRMather
A Newspoll conducted shortly before the federal election predicted a Labor victory 53% to the Coalition's 47% on a two-party preferred preference
@BenRMather
When it comes to the opinion polling, something’s obviously gone really crook with the sampling
both internally and externally.
- ABC political editor Andrew Probyn
@BenRMather
@BenRMather
@BenRMather
@BenRMather
@BenRMather
There are a lot of words here and most of them mean the same things.
Machine Learning = Inference
@BenRMather
Generate 50%, 95%, 99% confidence intervals using randomly drawn models
There may be many solutions that fit the same set of observations.
posterior
likelihood
prior
model
data
example of an ill-posed problem
@BenRMather
example of a well-posed problem
Input parameters
Model being solved
Compare data & priors
FORWARD MODEL
Prior
Likelihood
Posterior
Inverse Model
We can estimate the value of pi with monte carlo sampling.
from random import random
n = int(input("Enter number of darts"))
c = 0
for i in range(n):
x = 2*random()-1
y = 2*random()-1
if x*x + y*y <= 1:
c += 1
print("Pi is {}".format(4.0*c/n))
Python code to run simulation
Global
minimum
Local
maximum
Local
minimum
Monte Carlo sampling
Global
minimum
Local
maximum
Local
minimum
Markov-Chain Monte Carlo sampling (MCMC)
Global
minimum
Local
maximum
Local
minimum
MCMC with gradient
Global
minimum
Local
maximum
Local
minimum
MCMC with gradient (caveat emptor!)
trapped!
There are known knowns;
there are things we know we know.
We also know there are known unknowns; that is to say we know there are some things we do not know.
But there are also unknown unknowns - the ones we don't know we don't know.
- Donald Rumsfeld
Former US Secretary of Defense
@BenRMather
@BenRMather
Dr. Ben Mather
Madsen Building, School of Geosciences,
The University of Sydney, NSW 2006