Intrinsic social motivation

via causal influence

in multi-agent RL

 

Natasha Jaques, Angeliki Lazaridou,

Edward Hughes, Çaglar Gulçehre et al.

Presented by

Breandan Considine

2005

Intrinsically motivated Reinforcement Learning

2005

2008

Intrinsic motivation

Novelty/Surprise

1. Barto, A. et al. 2013. Novelty or Surprise? Frontiers in Psychology.

2. Houthooft, R. et al. 2016. Information Maximizing Exploration. Advances in Neural Information Processing Systems.

3. Itti, L. and  Baldi, P. Bayesian surprise attracts human attention. Vision Research, 49(10):1295–1306, 2009.

4. Conti, et al. Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents. CoRR, abs/1712.06560.

Empowerment

1. Klyubin, A.S., et al. 2005. All else being equal be empowered. In European Conference on Artificial Life (pp. 744–753).

2. Mnih, V., et al., 2015. Human-level control through deep reinforcement learning. Nature, 518(7540), p.529.

3. Klyubin, A.S., et al. 2008. Keep your options open: An information-based driving principle for sensorimotor systems.

4. Jung, T., et al., 2011. Empowerment for continuous agent — environment systems. Adaptive Behavior, 19(1), pp.16–39.

2016

Unifying count-based exploration and intrinsic motivation

2017

Surprise-based intrinsic motivation

for deep reinforcement learning

2017

2017

2017

Sequential social dilemmas

Notation

 <State, Transition, Action, reward>

 

The actions of all N agents are combined to form a joint action

 

 

 

Discounted future rewards

 

Extrinsic and Intrinsic rewards

 

Trajectories

 

 

 

 

How would B’s action change if

I had acted differently in this situation?

Averaging over multiple counterfactuals

Averaging over multiple counterfactuals

Mutual information and empowerment

Sequential social dilemmas

Model of other agents

References

Intrinsic social motivation via causal influence in MARL

By Breandan Considine

Intrinsic social motivation via causal influence in MARL

  • 184