The Algorithm with a Thousand Faces
Victor Sanches Portella
PhD Student in Computer Science @ UBC
October, 2019
At each round
Player chooses a point
Enemy chooses a function
Player suffers a loss
SIMULTANEOUSLY
Player
Enemy
!
!
CONVEX
Player and Enemy see
Enemy may be
Adversarial
An Online Convex Optimization Problem
convex set
set of convex functions
Player
Enemy
Player
Enemy
Experts
0.5
0.1
0.3
0.1
1
0
-1
1
Probabilities
Costs
Online Linear Regression
Player
Enemy
Regression Function
Query & Answer
Loss
Cost of always choosing
Player's Loss
Sublinear regret under mild conditions
Focus of this talk: algorithms for the Player
Hupefully efficiently implementable
Unified view of the algorithms from FTRL
Optimization for Big Data
Stochastic Gradient Descent
Adaptive Gradient Descent (AdaGrad)
Web Ad Placement
(Bandit - Limited Feedback)
Deep Nets Training
[Large Scale Distributed Deep Networks, Dean et. al. 12']
Computational Complexity
Approximately Maximum Flow
Robust Optimization
Competitive Analysis
Linear Spectral Sparsification
SDP Solver
QIP = PSPACE
k-server problem
"Boosting"
[QIP = PSPACE, Jain et. al. '09]
[k-server via multiscale entropic regularization, Bubeck et. al. '17]
[Spectral Sparsification and Regret Minimization Beyond Matrix Multiplicative Updates, Allen-Zhu, Liao, and Orecchia '16]
[A Combinatorial, Primal-Dual approach to Semidefinite Programs, Arora, Kale, Street '07]
[Electrical Flows, Laplacian Systems, and Faster Approximation of Maximum Flow in Undirected Graphs, Christiano et. al. '11]
[A Combinatorial, Primal-Dual approach to Semidefinite Programs, Arora, Kale, Street '07]
Enemy
Player
Enemy
Player
Fixed Regularizer
At round use regularizer
Regularizer Increment
Convex Function
Not clear in general
projection
Representation of derivative
What is
?
direction
Online Gradient Descent Update
point
functional
(Riesz Repr. Theorem)
functional
functional
Directional derivative of at
dual
primal
dual
dual
Bregman Divergence
Bregman Projection
1st-order Taylor
Bregman
Projection
Bregman
Projection
First round
First round
For
For
inside
outside
inside
outside
Normal Cone
Subgradients
inside
outside
The Algorithm with a Thousand Faces
Victor Sanches Portella
PhD Student in Computer Science @ UBC
October, 2019
Limited Feedback: Bandit, two-point Bandit feedback
Special Cases: Combinatorial, other specific settings
Player
Drop or Add Hypotheses: Convexity, adversarial enemies,
Hypercube
L2-Ball
Change Metric: Policy Regret, Raw Loss
side information
strictly convex and differentiable on
For every
there is
such that
Bregman Projections onto attained by
Bregman Projector
First round
Round
for
Mirror Map Increments