Flexible uncertainty quantification in medical imaging
12th CVPR Workshop on Medical Computer Vision
2026



Jeremias Sulam
50 years ago ...

first CT scan


ELECTRIC & MUSICAL INDUSTRIES
50 years ago ...

imaging
diagnostics
complete hardware & software description
human expert diagnosis and recommendations

imaging was "simple"
... 50 years forward

Data

Compute & Hardware



Sensors & Connectivity







Research & Engineering
... 50 years forward


data-driven imaging
automatic analysis and rec.
societal implications
Data

Compute & Hardware



Sensors & Connectivity







Research & Engineering




data-driven imagingautomatic analysis and rec.societal implicationsProblems in trustworthy biomedical imaging
inverse problems
uncertainty quantification
robustness
generalization
demographic fairness
hardware & protocol optimization
model-agnostic interpretability
policy & regulation
monitoring & auditing



data-driven imagingautomatic analysis and rec.societal implicationsProblems in trustworthy biomedical imaging
inverse problems
uncertainty quantification
robustness
generalization
demographic fairness
hardware & protocol optimization
model-agnostic interpretability
policy & regulation
monitoring & auditing

in a box
Denoiser
Measurements

Reconstruction
Uncertainty Quantification in Inverse Problems
x^=fθ(y)

pixelj
x^j
(point predictors)
What is the uncertainty in the guess x^j?
y=Ax+ϵ, ϵ∼N(0,σ2I)
How do we report uncertainty rigorously?
Measurements
y=Ax+ϵ, ϵ∼N(0,σ2I)

Uncertainty Quantification in Inverse Problems
X^=F(y)∼Py






Sampling

in a box
Denoiser
pixelj
x^j
(predictive distribution)
What is the uncertainty in the guess x^j?
How do we report uncertainty rigorously?
Mathematical tractability vs Complexity

in a box
simpler models
more assumptions
any model
no assumptions
Denoiser
Linear models
Linear networks
Shallow
ReLU Networks
Just ask GPT
Conformal guarantees

Bayesian
MC Dropout
0
1
l(y)j
u(y)j
Uncertainty through Prediction Sets
How do we construct them?
- pixel-wise mean ± standard deviation
- Quantile regression
- MC-dropout (Gal & Ghahramani, 2016)
- any other heuristics...
C:y↦C(y)⊆[0,1]d
C(y)j=[l(y)j,u(y)j]
Conformal Risk Control (CRC)
ℓ(y,x)=d1j∈[d]∑1{xj∈/C(y)j}
0
1
l(y)j
u(y)j
Uncertainty through Prediction Sets
C:y↦C(y)⊆[0,1]d
ground truth!
C(y) controls risk at level ϵ if
"On average, no more than ϵ pixels are outside the sets"
E[ℓ(C(Y),X)]≤ϵ
\(x_j\)
Conformal Risk Control (CRC)
0
1
l(y)j
C(y)j
u(y)j
λ
λ
C(y)j=[lj(y),uj(y)]⟶Cλ(y)j=[lj(y)−λ,uj(y)+λ]
Given cal. set Scal={Xi,Yi}i=1n, let ℓcal(λ)=n1∑ℓ(Cλ(Yi,Xi))
λ^= smallest λ so that n+1nℓcal(λ)+1+n1≤ϵ
Lemma
[Angelopoulos et al, 2024]
Then, E[ℓ(Cλ(Y),X)]≤ϵ.
Conformal Risk Control (CRC)
0
1
l(y)j
C(y)j
u(y)j
λ
λ
Cλ(y)j=[lj(y)−λ,uj(y)+λ]
Given cal. set Scal={Xi,Yi}i=1n, let ℓcal(λ)=n1∑ℓ(Cλ(Yi,Xi))
λ^= smallest λ so that n+1nℓcal(λ)+1+n1≤ϵ
Then, E[ℓ(Cλ(Y),X)]≤ϵ.
Lemma

[Angelopoulos et al, 2024]
High Dimensional Risk Control
Cλ(y)j=[lj(y)−λ,uj(y)+λ]
Observation 1: Single λ for all d dimensions... suboptimal

High dimensional alternative λ∈Rd:
λ=(λ1,λ2,…,λd)
Cλ(y)j=[lj(y)−λj,uj(y)+λj]
Goal: minimize the mean interval length
λ∈Rdminj∈[d]∑λjs.t.E[ℓ(Cλ(X),Y)]≤ϵ
[Teneggi et al, 2023]
Semantic Risk Control
Cλ(y)j=[lj(y)−λj,uj(y)+λj]
Observation 2: High-dim data is heterogenous


Let λ vary according to content/semantics (e.g. per organ via a segmentation model)
Cλ(y)j=[lj(y)−λs(y)j,uj(y)+λs(y)j]
Segmentation model s(y):Y→[K]d
Semantic uncertainty λsem=(λ1,…,λK)∈RK
Semantic Risk Control


1. Find an anchor λ~sem:
λ~sem=λsem∈RKargmink∈[K]∑skλks.t.ℓ^optγ(λsem)≤ϵ
ℓ^γ: convex upper bound to ℓ(λ)
2. Calibrate
λ^sem=λ~sem+ω⋆1K, ω⋆=inf{ω≥0:Rcal+(λ~sem+ω1K)≤ϵ}.



\(s_k:\) expected size of organ \(k\)
Semantic Risk Control
2. Calibrate
λ^sem=λ~sem+ω⋆1K, ω⋆=inf{ω≥0:Rcal+(λ~sem+ω1K)≤ϵ}.
1. Find an anchor λ~sem:
λ~sem=λsem∈RKargmink∈[K]∑skλks.t.ℓ^optγ(λsem)≤ϵ
Guarantee
For any segmentation model \(s(Y)\in[K]\), any \(\epsilon > 0\) and exchangeable and independent calibration samples,
\[ \mathbb{E}\!\left[ \ell\!\left(\mathcal C_{\hat{\lambda}_{\mathrm{sem}}}(Y),X\right) \right] \le \epsilon . \]
Experiments
- CT reconstruction on TotalSegmentor (Wasserthal et al. 2023)
- Quantile Regression (Unet) for heuristic l(y) and u(y)

Experiments
- CT reconstruction on TotalSegmentor (Wasserthal et al. 2023)
- Quantile Regression (Unet) for heuristic l(y) and u(y)


- Segmentation via SuPrem (Li, Yuille, and Zhou 2024)
spleen, kidneys, gallbladder, liver, stomach, aorta, inferior vena cava (IVC), pancreasSemantic risk control



Semantic risk control


risk controlled uniformly for every organ
Recap
-
Conformal prediction allows for flexible UQ, with minimal assumptions
-
In high-dimensional settings, K-CRC allows for optimizing mean interval lengths
-
When samples are heterogeneous, semantic CRC allows for input-specific semantic calibration
Acknowledgements

Jacopo Teneggi
sem-CRC https://github.com/Sulam-Group/semantic_uq
K-CRC https://github.com/Sulam-Group/k-rcps
Funding: NSF CAREER Award CCF 2239787 and NIH R01CA287422


Teneggi, J., Tivnan, M., Stayman, W., & Sulam, J. How to trust your diffusion model: A convex optimization approach to conformal risk control. ICML 2023
Teneggi, J., Stayman, J. W., & Sulam, J. Conformal risk control for semantic uncertainty quantification in computed tomography. MICCAI 2025
policy & regulation
robustness
generalization
uncertainty quantification







CVPR Medical Imaging 2026
By Jeremias Sulam
CVPR Medical Imaging 2026
- 77