Karl Ho
School of Economic, Political and Policy Sciences
University of Texas at Dallas
Source: Wei, Chih-Ping, Selwyn Piramuthu, and Michael J. Shaw. "Knowledge discovery and data mining." In Handbook on Knowledge Management, pp. 157-189. Springer, Berlin, Heidelberg, 2003.
Source: Wei, Chih-Ping, Selwyn Piramuthu, and Michael J. Shaw. "Knowledge discovery and data mining." In Handbook on Knowledge Management, pp. 157-189. Springer, Berlin, Heidelberg, 2003.
(roughly) if you look in more places for interesting patterns than your amount of data will support, you are bound to find crap.
- Rajaraman A., Leskovec J. and Ullman J. Mining of Massive Datasets
Today we live in a data rich, information driven, knowledge strained, and wisdom scant world.
- Graham Williams 2021
- Rajaraman A., Leskovec J. and Ullman J. Mining of Massive Datasets
Data mining overlaps with:
Different cultures:
Data Mining and Machine Learning
Data Mining methods
Field of study that gives computers the ability to learn without being explicitly programmed.
- Arthur Samuel 1959
A computer can be programmed so that it will learn to play a better game of checkers than can be played by the person who wrote the program.
Programming computers to learn from experience should eventually eliminate the need for much of this detailed programming effort.
The ultimate goal of data modeling is to explain and predict the variable of interest using data. Machine learning is to achieve this goal using computer algorithms in particular to make the prediction and solve the problem.
Source: Tom Mitchell website
According to Carnegie Mellon Computer Science professor Tom M. Mitchell,
"Machine learning is the study of computer algorithms that allow computer programs to automatically improve through experience."
“A computer program is said to learn from experience E with respect to some class of tasks T and performance measure P if its performance at tasks in T, as measured by P, improves with experience E.”
Source: Tom Mitchell website
Study the past if you would define the future.
– Confucius
Tom Mitchell. 1997. Machine Learning, McGraw Hill.
1928 – 2005
One assumes that the data are generated by a given stochastic data model. |
---|
The other uses algorithmic models and treats the data mechanism as unknown. |
---|
Data Model |
---|
Algorithmic Model |
---|
Small data |
---|
Complex, big data |
---|
Data are generated in many fashions. Picture this: independent variable x goes in one side of the box-- we call it nature for now-- and dependent variable y come out from the other side.
The analysis in this culture starts with assuming a stochastic data model for the inside of the black box. For example, a common data model is that data are generated by independent draws from response variables.
Response Variable= f(Predictor variables, random noise, parameters)
Reading the response variable is a function of a series of predictor/independent variables, plus random noise (normally distributed errors) and other parameters.
The values of the parameters are estimated from the data and the model then used for information and/or prediction.
The analysis in this approach considers the inside of the box complex and unknown. Their approach is to find a function f(x)-an algorithm that operates on x to predict the responses y.
The goal is to find algorithm that accurately predicts y.
Source: https://www.mathworks.com
Source: Attewell, Paul A. & Monaghan, David B. 2015. Data Mining for the Social Sciences: an Introduction, Table 2.1, p. 27
Regression | Classification | Clustering | Q-Learning |
Linear regression | Logistic regression | - K-Means Clustering | State Action Reward State Action (SARSA) |
Polynomial regression | K-Nearest Neighbors | - Hierarchical Clustering | Deep Q-Network |
Support vector regression | Support Vector Machines | Dimensionality Reduction | Markov Decision Processes |
Ridge Regression | Kernal Support Vector Machines | Principal Component Analysis | Deep Deterministic Policy Gradient (DDPG) |
Lasso | Naïve Bayes | Linear Discriminant Analysis | |
ElasticNet | Decision Tree | Kernal PCA | |
Decision tree | Random forest | ||
Random forest |
Supervised Learning | Unsupervised Learning | Reinforcement Learning |
---|