Martin Biehl and Nathaniel Virgo
Terminology:
Original framework used two insights:
Practical side of original framework:
Bayesian network
goal
policies
Bayesian network
goal
Bayesian network
goal
policies
Bayesian network
goal
policies
Bayesian network
goal
policies
Bayesian network
goal
policies
Multiple, possibly competing goals
Coordination and communication from an information theoretic perspective
Dynamic scalability of multi-agent systems
Dynamically changing goals that depend on knowledge acquired through observations
Example multi agent setups:
Two agents interacting with same environment
Two agents with same goal
Two agents with different goals
Example non-cooperative game: matching pennies
Example non-cooperative game: matching pennies
joint pdists \(p(a_1,a_2)\)
disjoint goal manifolds
agent manifold
\(p(a_1,a_2)=p(a_1)p(a_2)\)
EM
EM
Text
Text
2. Planning to learn / uncertain MDP, bandit example.
if x=1
if x=1
But for adding and removing agents probably needed
Thank you for your attention!