Machine Learning

Frank Qiu

Chris Pang

Haider Shah

Stephanie Zhang

Intro

Algorithms

Business Value

Best Practices

The science of getting computers to act without being explicitly programmed.

-Stanford University CS 229 (Machine Learning) 

What is Machine Learning?

Ng, A, 2015

The Restaurant Case

John

Jack used to come a lot, why he hasn't showed up for a while?

Why Bill's bar is more popular?

Is Roy a slow server?

Vancouver's best selling beer

Average cost to run a restaurant

Transaction records related to Jack

......

Popular sauces

Popular restaurants

Query From Computer

Busiest hour of the week

But,is that good enough?

Machine Can do it Again and Again.

 MachineLearning.py

 

 

import os

import scipy as sp

import matplotlib.pyplot as plt

 
print("I'm MachineLearning")
def ML(models, x, y):

 if models:

   if mx is None and x = 0

       for x, y in zip(models):

# print "Model:",model

# print "Coeffs:",model.coeffs

          plt.plot(model(x), c=color)

          plt.legend(["d=%i" % m.order for m in models], loc="upper left") Python Hi, Python.

Top Restaurants

 

Other Restaurants 

Factor 2

Factor 1

Let the Machine Decide 

 MachineLearning.py



import os

import scipy as sp

import matplotlib.pyplot as plt

 
print("I'm MachineLearning")
def ML(models, x, y):

 if models:

   if mx is None and x = 0

       for x, y in zip(models):

# print "Model:",model

# print "Coeffs:",model.coeffs

          plt.plot(model(x), c=color)

          plt.legend(["d=%i" % m.order for m in models], loc="upper left") Python Hi, Python.

Factor 1

Factor 4

Factor 567

Factor 3294923

Factor 1412331232412

Factor 314235312431215

Factor 214321043214321431321

Factor 41890-234890231408923

Machine Is Better

KPI

View Span

John's View

Machine's View

Artificial Intelligence

Machine Learning

Data Mining

Machine Learning VS Data Mining

Machine learning is a science that involves development of self-learning algorithms. These algorithms are more generic in nature that it can be applied to various domain related problems

       

 

 

 

   Data mining is a practice of applying algorithms (mostly Machine learning algorithms) with the data available from domain to solve domain related problems

 

               

 

 

 

Machine Learning Algorithms

Supervised Learning

Unsupervised Learning

Reinforced Learning

Supervised Learning

Algorithms are trained using labelled examples

 

Historical data predicts likely future events

 

Example : Credit Card Transactions

 

 

Unsupervised Learning

No Historical Labels

 

Goal is to explore data and find some structure within

 

Example : Customer Segmentation

 

 

Reinforced Learning

Learn by trial and error

 

Agent, Environment, Action

 

Example: Robotics, Gaming, Navigation

 

 

Summary

 

Supervised Learning

Mapping Inputs to Outputs

Unsupervised Learning

Clustering Data

Reinforced Learning

Maximize Rewards

 

Machine Learning

is

everywhere

Ranking

webpages

Customized

Recommendation

Face

Recognition

Handwriting

Recognition

Machine

Learning

Big Data

Can't

Program

by Hand

Beyond

Human

Capability

 $48
billion
/year

Online Advertising

 $11.5
/year
billion

Fraud detection systems

Gene prediction for cancer

nature disaster prediction

self-driving cars 

...

ML Workflow: Best Practices

Historical Data

Model Building

Model Optimization

Model Evaluation

Answers

New Data

  1. Pair the best algorithm for the problem type
     
  2. Employ data discipline
     
  3. Consider scalability and production issues early on
     
  4. Actively monitor performance of model

Evans, J, 2014
SAS, Machine Learning, 2015
Wood, D, 2013

Available Software

Muenchen, R, 2015.

Kaggle Competitions

Kaggle, All Competitions, 2015.

Q & A

Frank Qiu

Chris Pang

Haider Shah

Stephanie Zhang

Machine Learning - BAIT 527 Presentation

By Christopher Pang

Machine Learning - BAIT 527 Presentation

Machine Learning Presentation

  • 1,488