ĦØƁΔЯ°
ĂĮ
ƘႸԒᕊƬჄ₽Δ
For customers | For real |
---|---|
Deep Learning | Logistic Regression |
Machine Learning | Logistic Regression |
NLP | Regular expressions |
Domain adaptation | Handcrafted hacks |
Magic | Matrix multiplication |
AI | Any random sh*t |
import joblib
from sklearn.datasets import load_iris
from sklearn.linear_model import LogisticRegression
X, y = load_iris(return_X_y=True)
clf = LogisticRegression()
clf.fit(X, y)
print(clf.score(X, y))
joblib.dump(clf, "my_deeplearning_model")
$ python3 awesome_predict.py \
--model "my_deeplearning_model" \
--input "1.csv" \
--output "1.predictions.csv"
GET /info
POST /predict
Queue
Storage
Machines
/a/1.csv
/a/2.csv
/a/1.csv
/a/2.csv
/a/1.out.csv
/a/2.out.csv
/a/1.csv
/a/2.csv
/a/1.out.csv
/a/2.out.csv
Data size | Model size | Batch/Online | Pick |
---|---|---|---|
Small | Small | Batch | * (even CLI) |
Small | Small | Online | REST |
Small | Large | Batch | REST |
Small | Large | Online | REST |
Large | Small | Batch | |
Large | Small | Online |
REST / SageMaker / |
Large | Large | Batch | ВĘԒѺԸNПΣΔ / REST / SageMaker |
Large | Large | Online | REST / SageMaker |
0 < Small < 0.5TB
Large >= 0.5TB
0 < Small < 500MB
Large >= 500MB
REST is the most universal choice
В Ę Ԓ Ѻ Ը N П Σ Д isn't always a bad idea
DataSatanist should be able to solve such problems (or at least know about them)