Research Guild Meeting
160Mb for 80,000 words
(d=500)
1. Find optimal codebooks
2. Find optimal codes
Goals:
$$d_w^i$$
is one-hot encoded vector
Predict
codes
Predict
codebooks
Original
embedding
Reconstructed
embedding
How to back-propagate through
discrete one-hot?
Gumbel-softmax trick!