* This is joint work with Dr. Miyoshi (Team Principal)
Nagoya University, Japan
* The author was supported by RIKEN Junior Research Associate Program and JST SPRING JPMJSP2110.
Kota Takeda
Kota Takeda
Assistant Prof.
at Nagoya Univ., Japan
Research topics:
Uncertainty Quantification
Fluid mechanics
Data assimilation
Nagoya
Note: As we all know, weather prediction is difficult due to chaos. We use data assimilation to estimate the current state.
Note: As we all know, weather prediction is difficult due to chaos. We use data assimilation to estimate the current state.
Note: As we all know, weather prediction is difficult due to chaos. We use data assimilation to estimate the current state.
Numerical Weather Prediction
State estimation of High dimensional chaotic system
Numerical Weather Prediction
State estimation of High dimensional chaotic system
Unpredictable in long-term
e.g., Typhoon forecast circles
3D grid × variable
Numerical Weather Prediction
State estimation of High dimensional chaotic system
3D grid × variable
→ Estimating uncertainty in prediction by ensemble
Orange: Estimate
Large ensemble
Small ensemble
Success
Failed
(Filter divergence)
Green: True state
(Insert Image: Past Slide p.26 - Log-Log Schematic)
Fig: Schematic of Noise-scaled Accuracy
Note: Instead of looking at a fixed noise level, we look at the scaling behavior as noise goes to zero. This gives us a sharp, qualitative distinction between success and failure.
Note: We propose that $N_+$ is the key number.
Continuous-time
Evolution map
Discrete-time
from
Filtering problem
Estimate
known:
Obs. up to now
True
Obs.
Estim.
...
...
...
Induced from differential equations:
: Model dynamics
State space model
: Observation matrix
Assume
: Gaussian noise
- Estimates by ensemble
- Update:
- Correct mean and covariance
using observations
Repeat (I) & (II)...
(II)Analysis
(I) Forecast
Just evolve each sample
Correct samples based on the least squares
Estimate
ensemble
ensemble
(Evensen2009)
Ensemble Kalman filter (EnKF)
(I) Forecast
Remark: the rank is restricted as
Estimate forecast uncertainty using eigenvalues and vectors of forecast covariance:
(II) Analysis
Stronger correction in higher uncertainty along the direction.
→ rank-deficient
→ No correction in the degenerated direction.
Multiplicative inflation
"accutual"
Compensate for underestimated variability of forecasts with a limited ensemble.
(*An efficient implementation is used in practice.)
* The rank of covariance is not improved
ensemble size
Question
How many samples are required
for 'accurate state estimation' using EnKF?
Disadvantage
- varies depending on (out of our scope).
- Hard to distinguish accuracy similar with the observation noise level.
Question
What is the minimum ensemble size required
for 'accurate state estimation' using EnKF?
The standard metric for accuracy
obs. noise leve.
for large
(r-asymptotic) filter accuracy
: state estimation error.
: variance of obs. noise,
where
squared error
log-log
→ Commonly used in mathematical studies.
Question
What is the minimum ensemble size required
for 'accurate state estimation' using EnKF?
ensemble
Large
Accurate
Small
ensemble
Inaccurate
← Find minimum
achieving accuracy!
Question
What is the minimum ensemble size required
for 'accurate state estimation' using EnKF?
Advantage:
Qualitative distinction
- Mechanism of EnKF <-> dynamical instability
- Question: minimum ensemble size m*
- (Our) Reformulate accuracy
infinitesimal perturbation
expanded
contracted
Jacobian matrix
→ High forecast uncertainty along this direction.
Tangent linear model
linearize
Meaning
: Asymptotic exponential growth/decay rate
of the i-th principal axis.
positive → growth
negative → decay
Lyapunov exponents:
:
:
-singular value of
Exponents
Unstable dimension
ex.) 40-dim. Lorenz 96 model (chaotic toy model)
unstable
stable
Primitive equations
(core of atmospheric model)
Other dissipative systems too
Lyapunov exponents
Ansatz (low-dimensional structure):
Most geophysical flows satisfy
(de Wiljes+2018, T.+2024)
(Sanz-Alonso+2025)
(González-Tokman+2013)
for
using
'Stability' of
in unobserved sp.
'Lipschitz'
using
too many
additional factor
Accurate initial ensemble aligning with the unstable subspace.
Mathematical analyses have revealed sufficient conditions for .
(de Wiljes+2018, T.+2024)
(Sanz-Alonso+2025)
(González-Tokman+2013)
for
using
'Stability' of
in unobserved sp.
'Lipschitz'
using
too many
additional factor
Accurate initial ensemble aligning with the unstable subspace.
unrealistic (should be relaxed)
Mathematical analyses have revealed sufficient conditions for .
Focus on
Conjecture the minimum ensemble size for filter accuracy with the EnKF is
(※ with any initial ensemble)
González-Tokman+2013 assume this.
→ How to obtain?
Tracking only the unstable directions
unstable direction
: critical small ensemble
Efficient & Accurate Weather Prediction
Conjecture the minimum ensemble size for filter accuracy with the EnKF is
(※ with any initial ensemble)
Ansatz
squared error
log-log
EnKF with
: 小
: 大
パラメータ
:spin up期間
:最初のアンサンブル数
:削減後のアンサンブル数
ねらい:最初は多数のアンサンブルを使うことで真値に近く,不安定方向を向いた「良い」アンサンブルを構成.
Ensemble reduction
でアンサンブル数を削減.
SVD/PCAに基づき共分散行列を低ランク近似.
At step $n = N_{spinup}$:
Result: Efficient ensemble aligned with instability.
Note: This ensures that when we test $m=14$, for example, those 14 members are the "best possible" 14 members, capturing the most variance.
Require:Note: Why do we use spin-up? Because the dynamics naturally rotate the ensemble into the unstable subspace. This gives us an "optimal" initialization for testing the limit.
- Characterize instability via LEs
- key: ensemble alignment connects LEs and m*
- (Our) Proposing method
Supporting the conjecture
To support the conjecture, we perform numerical experiments estimating synthetic data generated by the Lorenz 96 model.
Setup (T.+2025)
obs.:
model:Lorenz96 ( )
noise:
EnKF:
(others are chosen appropriately)
numerical integration: RungeKutta
obs. interval:
For each , we compute the dependency of the worst error
on .
To support the conjecture, we perform numerical experiments estimating synthetic data generated by the Lorenz 96 model.
To support the conjecture, we perform numerical experiments estimating synthetic data generated by the Lorenz 96 model.
(Lorenz1996, Lorenz+1998)
Mimics chaotic variation of
physical quantities at equal latitudes.
non-linear conserving
linear dissipating
forcing
Lorenz 96 model
To support the conjecture, we perform numerical experiments estimating synthetic data generated by the Lorenz 96 model.
Spatio-temporal plot
Note: For F=8, we have 13 positive exponents. Note that N_x is 40, so the instability is low-dimensional.
Note: Here is the main result. Look at the slope. m=12, 13 are flat (divergent). m=14 and above follow the O(r^2) line. The theory holds.
(Insert Image: Fig.3 in manuscript)
$m=14$ (border size)
(Insert Image: Fig.4 in manuscript)
$m=14$ (border size), mean-accurate init.
blue
red
gray
For each , we compute the dependency of the worst error
on → log-log plot.
blue
red
gray
For each , we compute the dependency of the worst error
on → log-log plot.
blue
red
gray
accurate
→ This supports the conjecture.
Inaccurate
For each , we compute the dependency of the worst error
on → log-log plot.
(filter accuracy)
Note: We repeated this for F=16. The threshold moved exactly to 16, matching the number of positive exponents plus one.
New framework:
$r$-asymptotic filter accuracy
$$ \limsup_{n\rightarrow \infty}\mathbb{E}[\mathrm{SE}_n] = O(r^2)$$
distinguishes convergence/divergence qualitatively.
Main Finding:
The minimum ensemble size is given by
$$m^* = N_+ + 1$$
(numerically verified for Lorenz 96 with).
New supporting method:
The ensemble downsizing for faster alignment of the ensemble with the unstable space.
ensemble
Future work
Numerical studies: Further experiments validating for high-dimensional & complex systems with multi. zero LEs.
Theoretical analysis: Mechanism of ensemble alignment in the spin-up period.
Extension: Extending this theory to localized methods.
Paper: K.T. and T. Miyoshi (preprint), Quantifying the minimum ensemble size for asymptotic accuracy of the ensemble Kalman filter using the degrees of instability.
Thank you
Code: github.com/KotaTakeda/enkf_ensemble_downsizing
(T.+2024)
K. T. and T. Sakajo, SIAM/ASA Journal on Uncertainty Quantification, 12(4), 1315–1335,
(T.+2025)
K. T. and T. Miyoshi, EGUsphere preprint, https://egusphere.copernicus.org/preprints/2025/egusphere-2025-5144/.
(de Wiljes+2018) J. de Wiljes, S. Reich, and W. Stannat (2018), Long-Time Stability and Accuracy of the Ensemble Kalman-Bucy Filter for Fully Observed Processes and Small Measurement Noise, Siam J. Appl. Dyn. Syst., 17, pp. 1152–1181.
(Evensen2009)Evensen, G. (2009), Data Assimilation: The Ensemble Kalman Filter. Springer, Berlin, Heidelberg.
(Burgers+1998) G. Burgers, P. J. van Leeuwen, and G. Evensen (1998), Analysis Scheme in the Ensemble Kalman Filter, Mon. Weather Rev., 126, 1719–1724.
(Bishop+2001) C. H. Bishop, B. J. Etherton, and S. J. Majumdar (2001), Adaptive Sampling with the Ensemble Transform Kalman Filter. Part I: Theoretical Aspects, Mon. Weather Rev., 129, 420–436.
(Anderson 2001) J. L. Anderson (2001), An Ensemble Adjustment Kalman Filter for Data Assimilation, Mon. Weather Rev., 129, 2884–2903.
(Reich+2015) S. Reich and C. Cotter (2015), Probabilistic Forecasting and Bayesian Data Assimilation, Cambridge University Press, Cambridge.
(Law+2015) K. J. H. Law, A. M. Stuart, and K. C. Zygalakis (2015), Data Assimilation: A Mathematical Introduction, Springer.
(Sanz-Alonso+2025), D. Sanz-Alonso and N. Waniorek (2025), Long-Time Accuracy of Ensemble Kalman Filters for Chaotic Dynamical Systems and Machine-Learned Dynamical Systems, SIAM J. Appl. Dyn. Syst., pp. 2246–2286.
(Biswas+2024), A. Biswas and M. Branicki (2024), A unified framework for the analysis of accuracy and stability of a class of approximate Gaussian filters for the Navier-Stokes Equations, arXiv preprint, https://arxiv.org/abs/2402.14078.
(T.2025) K. T. (2025), Error analysis of the projected PO method with additive inflation for the partially observed Lorenz 96 model, arXiv preprint, https://doi.org/10.48550/arXiv.2507.23199.
(González-Tokman+2013) C. González-Tokman and B. R. Hunt (2013) Ensemble data assimilation for hyperbolic systems, Physica D: Nonlinear Phenomena, 243(1), pp. 128–142.
(T.+2025) K. T. and T. Miyoshi, Quantifying the minimum ensemble size for asymptotic accuracy of the ensemble Kalman filter using the degrees of instability, EGUsphere preprint, https://egusphere.copernicus.org/preprints/2025/egusphere-2025-5144/.
(Evensen2009)
Require:ETKF algorithm