Patrick Merlot
Data Science, Computational Science, Electronic Systems, IoT, Mobile Communication, Dialectical materialism
Patrick Merlot (Feb. 2016)
Data Scientist intern @ iKnow Solutions Norge AS
source: TheTransportPolitic
The Goal
Facilitate the maintenance of
a bike-sharing system
Key Metrics — What to improve?
Bikes/Docs availability @ Market at Sansome
Business levers — How to improve?
Visual & Intuitive
Dashboard
DATA COLLECTION
MODELING
EXPLORATORY ANALYSIS
MODELING
RESULTS/VISUALIZATION
DATA PREPARATION
DATA COLLECTION
Historical data:
year 1: Aug. 2013 - Aug. 2014
year 2: Sep. 2014 - Aug. 2015
station data (5Kb): name, id, coordinates, #docks
weather data (155Kb): daily temp./precipitations/...
trip data (42Mb): tripID, start/stop date/station, userID, ...
hist. status data (1.1Gb): #freeDocks, #freeBikes /min.
Realtime time data (every minute): status data (JSON format)
DATA COLLECTION
DATA PREPARATION
DATA COLLECTION
EXPLORATORY ANALYSIS
DATA PREPARATION
Visualization libraries
Data Manipulation
Statistics
Scientific computing
Machine Learning
DATA COLLECTION
MODELING
EXPLORATORY ANALYSIS
MODELING
DATA PREPARATION
DATA COLLECTION
MODELING
EXPLORATORY ANALYSIS
MODELING
RESULTS/VISUALIZATION
DATA PREPARATION
Producer
Consumer
Machine
Learning
webApp
Kafka cluster
bike
arriving
bike
leaving
status
Filter
Aggregate
Build model
Train
Predict
REST api
Dashboard
Map/Path
Trips
Station
Weather
By Patrick Merlot
Data Science, Computational Science, Electronic Systems, IoT, Mobile Communication, Dialectical materialism