Sarah Dean PRO
asst prof in CS at Cornell
Prof. Sarah Dean
MW 2:45-4pm
110 Hollister Hall
0. Announcements & Recap
1. Motivation: Context
2. Setting: Contextual Bandits
3. Naive Approach
4. Function Approximation
Prelim corrections due 4/12
5789 Paper Review Assignment (weekly pace suggested)
A simplified setting for studying exploration
Explore-then-Commit
Upper Confidence Bound
For \(t=1,...,T\):
Explore-then-Commit
Upper Confidence Bound
For \(t=1,...,T\):
Set exploration \(N \approx T^{2/3}\),
\(R(T) \lesssim T^{2/3}\)
\(R(T) \lesssim \sqrt{T}\)
Example: online advertising
Journalism
Programming
"Arms" are different job ads:
But consider different users:
CS Major
English Major
Example: online shopping
"Arms" are various products
But what about search queries, browsing history, items in cart?
Example: social media feeds
"Arms" are various posts: images, videos
Personalized to each user based on demographics, behavioral data, etc
By Sarah Dean