Follow The Regularized Leader
The Algorithm with a Thousand Faces
Victor Sanches Portella
PhD Student in Computer Science @ UBC
October, 2019
Online Convex Optimization
Online Convex Optimization (OCO)
At each round
Player chooses a point
Enemy chooses a function
Player suffers a loss
SIMULTANEOUSLY
Player
Enemy
!
!
CONVEX
Player and Enemy see
Enemy may be
Adversarial
Formalizing Online Convex Optimization
An Online Convex Optimization Problem
convex set
set of convex functions
Player
Enemy
Rounds
Expert's Problem
Player
Enemy
Experts
0.5
0.1
0.3
0.1
1
0
-1
1
Probabilities
Costs
Online Regression
Online Linear Regression
Player
Enemy
Regression Function
Query & Answer
Loss
We want to predict the answer based on the query
Regret
Cost of always choosing
Goal: sublinear Regret
Player's Loss
Player Strategies
Sublinear regret under mild conditions
Focus of this talk: algorithms for the Player
Hupefully efficiently implementable
Unified view of the algorithms from FTRL
Motivation
"Why should I care?"
OCO in Practice
Optimization for Big Data
Stochastic Gradient Descent
Adaptive Gradient Descent (AdaGrad)
Web Ad Placement
(Bandit - Limited Feedback)
Deep Nets Training
[Large Scale Distributed Deep Networks, Dean et. al. 12']
Applications of OCO in Other Areas
Computational Complexity
Approximately Maximum Flow
Robust Optimization
Competitive Analysis
Linear Spectral Sparsification
SDP Solver
QIP = PSPACE
k-server problem
"Boosting"
[QIP = PSPACE, Jain et. al. '09]
[k-server via multiscale entropic regularization, Bubeck et. al. '17]
[Spectral Sparsification and Regret Minimization Beyond Matrix Multiplicative Updates, Allen-Zhu, Liao, and Orecchia '16]
[A Combinatorial, Primal-Dual approach to Semidefinite Programs, Arora, Kale, Street '07]
[Electrical Flows, Laplacian Systems, and Faster Approximation of Maximum Flow in Undirected Graphs, Christiano et. al. '11]
[A Combinatorial, Primal-Dual approach to Semidefinite Programs, Arora, Kale, Street '07]
Adaptive
FTRL
Cummulative Loss
Experts
Follow the Leader
Enemy
Player
UNSTABLE!
Adding Regularization
Enemy
Player
FTRL
Fixed Regularizer
Regret Upper-Bounds for Experts
Experts
Can we do better?
Adding Adaptive Regularization
At round use regularizer
?
Regularizer Increment
Convex Function
AdaFTRL
Efficiently computable?
Not clear in general
Online Mirror Descent
Online (sub)Gradient Descent
Round
projection
Another Perspective
Representation of derivative
What is
?
direction
Online Gradient Descent Update
point
functional
(Riesz Repr. Theorem)
functional
functional
Directional derivative of at
dual
primal
dual
dual
Avoiding Inner-Product
What if we make other choices for ?
How to make projections w.r.t. ?
Avoiding Inner-Product
Dual
Primal
?
Bregman Divergence
Bregman Divergence
Bregman Projection
1st-order Taylor
Online Mirror Descent
Bregman
Projection
Dual
Primal
Adaptive?
Adaptive!
Lazy Online Mirror Descent
Bregman
Projection
Classic Online Mirror Descent
First round
First round
For
For
Eager
Lazy
Let us use FTRL to unify them
Only proof sketch of the talk
LOMD as FTRL
...
inside
outside
FTRL
EOMD as FTRL
inside
outside
Normal Cone
Subgradients
EOMD as FTRL
EOMD as FTRL
inside
outside
FTRL
EOMD vs LOMD
Eager = Lazy
A Genealogy of
Algorithms
A Bird's-eye View
Connection Among the Main Algorithms
Connection Among the Main Algorithms
AdaReg
AdaReg
Minimize size of (sub)gradients
Minimize "complexity" of
VS
FTRL
Follow The Regularized Leader
The Algorithm with a Thousand Faces
Victor Sanches Portella
PhD Student in Computer Science @ UBC
October, 2019
Algorithms We Shall See
Generalizations and Special Cases
Limited Feedback: Bandit, two-point Bandit feedback
Special Cases: Combinatorial, other specific settings
Player
Drop or Add Hypotheses: Convexity, adversarial enemies,
Hypercube
L2-Ball
Change Metric: Policy Regret, Raw Loss
side information
Mirror Maps
What if we make other choices for ?
strictly convex and differentiable on
For every
there is
such that
Bregman Projections onto attained by
Bregman Projector
Adaptive Online Mirror Descent
First round
Round
for
Mirror Map Increments
FTRL - The Algorithm with a Thousand Faces
By Victor Sanches Portella
FTRL - The Algorithm with a Thousand Faces
- 312