Symanto Research
Reading Group
Friday 08, February 2019
presented by Angelo Basile
What this presentation is not about
A disclaimer on Meta
What this presentation IS about
source: https://en.wikipedia.org/wiki/Technical_debt
ML? Really?
Problems and possible solutions
Complex Models Erode Boundaries
traditional SE practice
ML
Data Dependencies Cost More than Code Dependencies
import pandas vs. import numpy
PROBLEM
SOLUTION
create a versioned copy of your data signals
PROBLEM
SOLUTION
do a feature ablation test
ML-System Anti-Patterns
PROBLEM
SOLUTION
package black-box packages into common API's.
Exactly what I did with BERT for EmoContext
PROBLEM
SOLUTION
think holistically about data, work closely with engineering team
PROBLEM
SOLUTION
see what you need and prune the unused branches
PROBLEM
SOLUTION
-
Configuration Debt
PROBLEM
SOLUTION
Dealing with Changes in the External World
Other
Conclusions