Meta-learning Spiking Neural Networks with Surrogate Gradient Descent

Research salon

Kenneth Stewart, Emre Neftci

https://arxiv.org/abs/2201.10777

What is meta learning?

Deep learning works well when we have lots of data available
What to do when we have little data? -> use data from different tasks to learn how to learn and then learn new tasks more efficiently
learn an optimizer, a model, a learning rule
in practice closely related to multi-task learning

Why might meta learning be good idea?

might have only little data available for a given task (e.g. medical image recognition), but much more available for related tasks (image classification)
Collect a small amount of labeled data for specific task (fine-tuning)
higher model prediction accuracy by optimizing hyperparameters
generalized models: meta learning does not focus on training one model on one specific dataset

Model Agnostic Meta Learning (MAML)

aim: train models that can achieve rapid adaptation
maximizing the sensitivity of the loss functions of new
tasks with respect to the parameters: when the sensitivity
is high, small local changes to the parameters can lead to large improvements in the task loss.

Summary

+ easy to apply to any architecture or loss function
- hard to optimize, sensitive to hyperparameters
- requires second order derivative

Made with Slides.com