ĦØƁΔЯ°
ĂĮ
ƘႸԒᕊƬჄ₽Δ
| For customers | For real |
|---|---|
| Deep Learning | Logistic Regression |
| Machine Learning | Logistic Regression |
| NLP | Regular expressions |
| Domain adaptation | Handcrafted hacks |
| Magic | Matrix multiplication |
| AI | Any random sh*t |
import joblib
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
X, y = load_iris(return_X_y=True)
clf = LogisticRegression()
clf.fit(X, y)
print(clf.score(X, y))
joblib.dump(clf, "my_deeplearning_model")$ python3 awesome_predict.py \
--model "my_deeplearning_model" \
--input "1.csv" \
--output "1.predictions.csv"
GET /info
POST /predictQueue
Storage
Machines
/a/1.csv
/a/2.csv
/a/1.csv
/a/2.csv
/a/1.out.csv
/a/2.out.csv
/a/1.csv
/a/2.csv
/a/1.out.csv
/a/2.out.csv
| Data size | Model size | Batch/Online | Pick |
|---|---|---|---|
| Small | Small | Batch | * (even CLI) |
| Small | Small | Online | REST |
| Small | Large | Batch | REST |
| Small | Large | Online | REST |
| Large | Small | Batch | |
| Large | Small | Online |
REST / SageMaker / |
| Large | Large | Batch | ВĘԒѺԸNПΣΔ / REST / SageMaker |
| Large | Large | Online | REST / SageMaker |
0 < Small < 0.5TB
Large >= 0.5TB
0 < Small < 500MB
Large >= 500MB
REST is the most universal choice
В Ę Ԓ Ѻ Ը N П Σ Д isn't always a bad idea
DataSatanist should be able to solve such problems (or at least know about them)