Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro | Nicholas Bishop | Joel Dyer |
Yorgos Felekis | Anisoara Calinescu | Michael Wooldridge |
Theodoros Damoulas |
Introduction
Causal Abstraction
Causal Abstraction
Subset of relevant variables in
Surjective map between abstract and relevant variables
Collection of maps, one for each abstract variable
Domain of mapped variables
Domain of abstract variable
Causal Abstraction
Causal Abstraction
Causal Abstraction
Causal Abstraction
Causal Abstraction
Causal Abstraction
Interventional Consistency
Interventional Consistency
Interventional Consistency
Interventional Consistency
Interventional Consistency
Interventional Consistency
Interventional Consistency
Interventional Consistency
IC Error
IC Error
Intervene then abstract
IC Error
Intervene then abstract
Abstract then intervene
IC Error
Intervene then abstract
Abstract then intervene
Wasserstein
distance
CMABs
Regret
Final randomised action
Average reward of best action
Average reward of chosen action
Regret
CAMABS
Can we transfer information across CMABS?
Transfer of Optimal Action
Transfer of Optimal Action
Flipping Interventions
Merging Rewards
Reward Discrepancy
Before abstraction
After abstraction
A Triangle Inequality
Transfer of Actions
The Imitation Algorithm
Performance
Performance
Intuition
You need to pull more arms in the base model than in the abstract model
Fixed cost for sampling more arms
Intuition
Scaling cost associated with suboptimality gaps of each arm
Representatives of abstract interventions may have very large suboptimality gaps
You need to pull more arms in the base model than in the abstract model
Fixed cost for sampling more arms
Intuition
Scaling cost associated with suboptimality gaps of each arm
Representatives of abstract interventions may have very large suboptimality gaps
You need to pull more arms in the base model than in the abstract model
Fixed cost for sampling more arms
Optimal arm preservation is required!
Transfer of Expectations
The Transfer Algorithm
The Transfer Algorithm
The Transfer Algorithm
What arms can we eliminate?
Abstracting Expectations
Abstract Expectations
Approximation Quality
Approximation Quality
Confidence in our approximation
of
Number of times was played
Approximation Quality
The average error of the best linear interpolator for intervention
Eliminating arms
Cases
Summary