Methods, Tools, and Use Cases of Deploying Machine Learning Models to Production

Nawfal Tachfine

Data Scientist

Company

hybrid business model
- we buy and sell cars online
- 28 dealerships nationwide
a car sold every 5 minutes
400+ employees

About

Data Team

1 chief data officer
1 manager
3 data engineers
6 data scientists
1 architect

Data Science @AramisAuto

Mission - leveraging data to create business value
Approach
- measurement based
- business driven
- iterative (business problem → production < 3 months)
Perspectives
- Products
  - Which cars to buy?
  - At which price to sell?
- Customers
  - Which leads to call?
  - How to distribute acquisition resources?

Infrastructure

METHODOLOGY

Typical Data Science Workflow

formulate

problem

get data

build and clean dataset

study dataset

train model

+ feature selection

+ algorithm selection

+ hyperparameter optimization

Then what?

Enter Industrialization...

Goal

automatically serve predictions to any given information system

Constraints

Data - source refresh rate, enrichment
Model - stability, maintainability
Operations - scalability, resilience, availability
Resources - no dedicated developers

Philosophy

start from final use case and work your way back to data preparation

Conception

On demand - client/server architecture
- Complete features → REST API
- Partial features → enrichment necessary
  - Lookup sufficient → REST API + key-value data store
  - Complex enrichment from database → API usable but slow
Trigger-based (time/event) → batch mode
- Computationally efficient
- Can serve predictions directly to destination

When will predictions be computed?

Design Patterns

Think about your production use-case as early as possible.
Your data pre-processing pipelines must be deterministic and reproduce the same conditions as in training.
- make your transformations robust against abnormal values
- variable order matters (cf. scikit-learn)
Log as much as you can.
Make sure your development and production environments are identical (esp. framework versions!)
Write clean code: your models will evolve (re-training, new features, ... etc)
Keep things simple - stability trumps complexity.

TOOLS

Dataiku

DSS Scenarios
- customizable (cf. global variables)
- simple to integrate
- API, triggers, and reporters → easy scheduling and monitoring
API Scoring Node
- easy to deploy and update models
- lightweight
- compatible with DSS visual models
Automation Node
- data-change triggers
- built-in orchestration
- advanced monitoring

Toolbox

Flask - a python web application microframework

simple,
lightweight,
powerful

Serialization - persisting models on disk

pickle / joblib dump
POJO objects for JVM-based environments
PMML

@app.route('/api/v1.0/aballone', methods=['POST'])

def index():
    query = request.get_json()['inputs']
    data = prepare(query)
    output = model.predict(data)
    return jsonify({'outputs': output})

if __name__ == '__main__':
    app.run(host='0.0.0.0')

JSON - universal I/O format
Docker - soft virtualization:
- no more dependency problems
- runs (almost) anywhere
AWS EC2 - flexible cloud computing resources
- spot instances for cost optimization
AWS Elastic Beanstalk - web app deployment and scaling
- (very) easy to use
- (relatively) cheap
- provides auto-scaling and load-balancing

Redis - powerful in-memory key-value data store
Elasticsearch - NoSQL text search engine for JSON documents

Toolbox (cont.)

USE CASES

Use case 1 - Pricing

Goal - providing price estimates for clients' used cars online
Use - web service callable by website
Features - 14 car descriptors (brand, model, age, energy, trim, mileage, transmission, engine)
Model/Performance - XGBoost regression, test MAPE <5%
Deployment Specs - response <100ms, JSON I/O
Solution - dockerized API Scoring Node on AWS Elastic Beanstalk, manual model deployment

Dataiku

API Node

query

prediction

Use case 2 - Inventory Turnover

Goal - predicting how long a car would take to be sold
Use - web service callable by internal quote generation tool
Features - car characteristics, competition descriptors (25 total)
Model/Performance - XGBoost classification, test accuracy 76%
Deployment Specs - response <5s, JSON I/O
Solution - dockerized Flask enrichment API via AWS Redshift, dockerized API Scoring Node on AWS Elastic Beanstalk

API Node

query

predictions

Backoffice

Redshift

Master API

partial features

full features

features

predictions

Use case 3 - Lead Scoring

Goal - detecting leads most likely to transform into customers
Use - sorting lead portfolios for sales representatives in Salesforce
Features - 11 offline (localization, email, phone calls, ...) + 47 online (visits, searches, viewed pages, quotes, vehicle preferences, ...)
Model/Performance - scikit-learn RandomForest classification (unbalanced), 75% of test set targets in top 10% scores
Deployment Specs - history-based enrichment, fresh data every 15min
Solution - Flask app, batch mode, loop run, dataprep in Redshift, result sync to Salesforce, Rundeck deployment on EC2

Redshift

Scoring App

leads

features

scores

Use case 4 - Visitor Scoring

Goal - detecting visitors most likely to transform into leads
Use - selecting visitors to target in SEM/Retargeting campaigns
Features - online behavior descriptors + acquisition channels interactions
Model/Performance - scikit-learn RandomForest classification (unbalanced), 79% of test set targets in top 3% scores
Deployment Specs - history-based enrichment, fresh data every 15min
Solution - [TEST] DSS daily Scenario, Redshift dataprep, sync to Redis

Redshift

DSS Scenario

lookup

score

features

scores

Infrastructure (again)

@NawfalTachfine

Machine Learning in Production at Aramisauto.com - DUG 06/2017

By Nawfal Tachfine

Machine Learning in Production at Aramisauto.com - DUG 06/2017

4,355

Nawfal Tachfine

Data Scientist

Methods, Tools, and Use Cases of Deploying Machine Learning Models to Production

Nawfal Tachfine

Company

About

Data Team

Data Science @AramisAuto

Infrastructure

METHODOLOGY

Typical Data Science Workflow

Then what?

Enter Industrialization...

Goal

Constraints

Philosophy

Conception

When will predictions be computed?

Design Patterns

TOOLS

Dataiku

DSS Scenarios

API Scoring Node

Automation Node

Toolbox

Flask - a python web application microframework

Serialization - persisting models on disk

Toolbox (cont.)

USE CASES

Use case 1 - Pricing

Dataiku

API Node

Use case 2 - Inventory Turnover

API Node

Backoffice

Redshift

Master API

Use case 3 - Lead Scoring

Redshift

Scoring App

Use case 4 - Visitor Scoring

Redshift

DSS Scenario

Infrastructure (again)

Machine Learning in Production at Aramisauto.com - DUG 06/2017

More from Nawfal Tachfine