• Top priority for AI: combinatorial generalisation.

"Humans’ capacity for combinatorial generalization depends critically on our cognitive mechanisms for representing structure and reasoning about relations"

Nature vs Nurture

  • Previous eras: AI community focused on "nature", structured approaches.
  • Modern deep learning: minimal "a priori" representation.
         Cheap data + Cheap computing resources

Challenges of deep learning

+ transfer learning, learning from small amounts of experience, etc.

A hybrid approach

  • Need to go back at structured architectures.
  • Capacity for performing calculations over discrete entities and their relations.

Relational reasoning

  • Entity: element with attributes.
  • Relation: property between entities.
  • Rule: Function
F: (\mathrm{entities},\mathrm{relations}) \longrightarrow (\mathrm{entities}, \mathrm{relations})

Inductive biases

An inductive bias allows a learning algorithm to prioritize one solution over another, independent of the observed data.

Examples:

  • Prior distribution
  • Regularisation term
  • Least-squares fitting
  • etc.

Inductive biases on NN

What are the entities, what are the relations, and what are the rules for composing them, and their implications?

How can we impose arbitrary biases in our learning algorithm?

(centre of mass, N-body sim.)

Graphs

Graph networks

Graph neural networks have been extensively used successfully.

The "graph network (GN)" generalises and extends them, allowing to build complex architecture from simple blocks.

Example:

predicting the movement of rubber balls connected by springs in a gravitational field

  • u: Gravitational field
  • V: ball position, velocity, mass...
  • E: springs

Relational inductive biases in graph networks

  • We can express arbitrary relationships
  • Invariant under permutations
  • GN's per-edge and per-node functions are reused across all graph.
  • A single GN can operate on graphs of different size and shape.

Design principles for graph network architectures

  • The global, node, and edge attributes of a GN block can use arbitrary representational formats.
  • Data may be naturally represented as a graph, or not.
  • Multiblock architectures.

Conclusion

  • A vast gap between human and machine intelligence remains, especially with respect to efficient, generalizable training.
  • We explored flexible learning-based approaches which implement strong relational inductive biases.
  • Graph Networks naturally support combinatorial generalisation and improve sample efficiency over other standard ML algorithms.

deck

By arnauqb

deck

  • 542