75% risk!

Black box

model

Domain expert

But why?

Data

Machine learning

75% risk!

Black box

model

Domain expert

Explanation

Aha!

But why?

Data

Explainer

Machine learning

Global

Local

Fraud Dashboard

Overview

ExplainExplore

Global

Local

Fraud Dashboard

Overview

ExplainExplore

Contribution-Value Plots

Global

Local

Fraud Dashboard

Overview

ExplainExplore

CV Plots

Global

Local

Fraud Dashboard

StrategyAtlas

Overview

Fraud detection explanations
for sick-leave insurances

FRAUD DETECTION EXPLANATIONS

Data

Missing/incorrect values

Model

100 Random Forest
500 trees each
~25 decisions per tree
1.312.471 decisions total!

×

OOB error: 27.7%

Real world scenario

FRAUD DETECTION EXPLANATIONS

My solution

FRAUD DETECTION EXPLANATIONS

[1] Palczewska, Anna et. al. Interpreting random forest classification models using a feature contribution method. In Integration of reusable systems, pp. 193–218. Springer, 2014.

0 1 2 3 x

y

2

1

7 : 7

6 : 2

...

\(Y_{mean}\) = 0.5

\(Y_{mean}\) = 0.75

\(LI_{X}\) = 0.25

Contribution per Decision Tree:

\(FC_{i,t}^f = \sum_{N \in R_{i,t}} LI_f^N\)

Contribution per Random Forest:

\(FC_i^f = \frac{1}{T}\sum_{t=1}^T FC_{i,t}^f\)

X < 2.5

Feature contribution

FRAUD DETECTION EXPLANATIONS

[2] Friedman, Jerome H. Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29(5): pp. 1189–1232, 2001.

300

250

200

150

100

50

1

0%

100%

200

100

0

Duration illness

Fraud?

Fraud (55%)

Non-fraud (35%)

Company	ABC Inc
Employees	5
Duration illness	days
...	...

Fraud (65%)

Fraud (90%)

Non-fraud (45%)

Non-fraud (40%)

Non-fraud (25%)

Partial dependence

FRAUD DETECTION EXPLANATIONS

[3] Ribeiro, Marco Tulio et. al. Why should i trust you?: Explaining the predictions of any classifier. In

Proceedings of the 22nd ACM SIGKDD, pp. 1135–1144. ACM, 2016.

[4] Deng, Houtao. Interpreting tree ensembles with inTrees. arXiv preprint arXiv:1408.5456 , pp. 1–18, 2014.

0 1 2 3 x

y

2

1

Local rule extraction

FRAUD DETECTION EXPLANATIONS

Fraud team happy! 🎉

FRAUD DETECTION EXPLANATIONS

Paper presented at:
Workshop on Human Interpretability in Machine Learning

Stockholm, Sweden

FRAUD DETECTION EXPLANATIONS

Questions?

Visual exploration of
machine learning explanations

EXPLAINEXPLORE

Surrogate learning

0 1 2 3 x

y

2

1

Feature 1

Feature 2

Feature 3

Feature 1

Feature 2

Feature 3

Feature 1

Feature 2

Feature 3

Problem

EXPLAINEXPLORE

Help data scientists to create and tune explanatory surrogate models.

EXPLAINEXPLORE

← Any tabular data set
← Any Python classifier, or PMML
← Different surrogate models

Configuration view

EXPLAINEXPLORE

← Surrogate fidelity: R2
← Prediction
← Feature contribution

Local columns

Global columns

Shows values or contribution →
Line color = predicted class →
Compare selected instance with data →
Clusters indicate ‘strategies’ →

Feature view

EXPLAINEXPLORE

Context view

EXPLAINEXPLORE

Context view

EXPLAINEXPLORE

Context view

EXPLAINEXPLORE

More info at

explaining.ml

Use case

EXPLAINEXPLORE

Paper accepted at:
IEEE Pacific Visualization 2020

@

Tianjin, China

😢

EXPLAINEXPLORE

Questions?

More info at

explaining.ml

Can be used for any Python model...

Can be used for any model...

EXPLAINER TECHNIQUES

Applications

sklearn-pmml-model

bit.ly/
sklearn-pmml

or

Paper and software submitted to:
Journal of Machine Learning Research

Machine Learning Open Source Software (MLOSS)

EXPLAINER TECHNIQUES

Machine Learning Interpretability through Contribution-Value Plots

CONTRIBUTION-VALUE PLOTS

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

x

ŷ

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

[1] Friedman, J. H. "Greedy function approximation: a gradient boosting machine.", 2001.

Prediction (ŷ)

Local PDP [1]

Sensitivity

analysis

x

ŷ

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

Sensitivity

analysis

x

ŷ

[2] Goldstein, A., et al. "Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation.", 2015.

Local PDP [1]

ICE plot [2]

Repeat

x

ŷ

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

[4] Lundberg, S. M., et. al.. "A unified approach to interpreting model predictions.". 2017.

Feature 1

Feature 2

Feature 3

Feature 4

Feature 5

[3] Ribeiro, M. T., et. al. ""Why should i trust you?" Explaining the predictions of any classifier.", 2016.

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

Sensitivity

analysis

x

ŷ

Local PDP [1]

ICE plot [2]

Repeat

β

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

Local PDP

ICE plot

Contribution (β)

Sensitivity

analysis

Repeat

x

ŷ

β

x

β

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

Local PDP

ICE plot

Contribution (β)

LCV plot

Sensitivity

analysis

Sensitivity

analysis

Repeat

x

ŷ

β

x

β

CONTRIBUTION-VALUE PLOTS

How?

Pre-existing techniques as elementary building blocks.

Prediction (ŷ)

Local PDP

ICE plot

Contribution (β)

LCV plot

GCV plot

Sensitivity

analysis

Sensitivity

analysis

Repeat

x

ŷ

β

x

β

CONTRIBUTION-VALUE PLOTS

Wine acidity (pH)

x

β

CONTRIBUTION-VALUE PLOTS

An exemplary ~~data~~ model exploration

Data

998 red wines
11 features

Model

Random Forest (100 trees)
Predict quality: 👍 / 👎

CONTRIBUTION-VALUE PLOTS

An exemplary ~~data~~ model exploration

x

β

x

β

CONTRIBUTION-VALUE PLOTS

Wine acidity (pH)

x

β

x

ŷ

CONTRIBUTION-VALUE PLOTS

An exemplary ~~data~~ model exploration

??

CONTRIBUTION-VALUE PLOTS

Line fading . .

x

β

CONTRIBUTION-VALUE PLOTS

Questions?

More info at

explaining.ml

Strategy analysis for
machine learning interpretability

STRATEGYATLAS

How?

Strategy A

Strategy B

The basic principle

STRATEGYATLAS

My solution

STRATEGYATLAS

Paper accepted and in press at:
IEEE Transactions on Visualization and Computer Graphics

STRATEGYATLAS

More info at

explaining.ml

STRATEGYATLAS

Visualization for Explainable AI

Machine learning

Machine learning

Overview

Overview

Overview

Overview

Fraud detection explanations for sick-leave insurances

Data

Model

Real world scenario

My solution

Feature contribution

Partial dependence

Local rule extraction

Fraud team happy! 🎉

Questions?

Visual exploration ofmachine learning explanations

Problem

Configuration view

Feature view

Context view

Context view

Context view

Use case

Questions?

Applications

Machine Learning Interpretability through Contribution-Value Plots

How?

How?

How?

How?

How?

How?

How?

How?

Questions?

Strategy analysis formachine learning interpretability

How?

My solution

Questions?

Conclusion

VISxAI NEC Labs Europe

More from iamdecode

Fraud detection explanations
for sick-leave insurances

Visual exploration of
machine learning explanations

Strategy analysis for
machine learning interpretability