Introducción a machine learning

MT3006 - Robótica 2

Dos caras de la misma moneda

Versus

¿Realidad?

Un poco de ambos

¿Dónde lo encontramos?

Algunas aplicaciones

sistemas de recomendación

procesamiento natural de lenguaje

medicina y bioquímica computacional

reconocimiento de imágenes y visión por computadora

Generación de contenido sintético

¿Qué es entonces machine learning?

Partamos de un ejemplo simple

¿Cómo clasificar perros y gatos?

Ingeniería tradicional es top-down

\mathbf{x}

\mathbf{f}

principios fundamentales

matemática

física

química

electrónica

mecánica

etc.

Machine learning es bottom-up

\mathbf{x}

\mathbf{f}

el sistema consume data y aprende el modelo

data

Aprender se refiere a emplear un conjunto de ejemplos para inferir algo acerca del proceso subyacente

Tipos de machine learning

Data como punto de inicio

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

datos, observaciones o ejemplos

Data como punto de inicio

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(i)}=\begin{bmatrix} x_1 \\ x_2 \\ \vdots \\ x_d \end{bmatrix}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

feature vector

vector de características

Ejemplo: clasificación

Ejemplo: clasificación

\mathbf{x}^{(1)}

\mathbf{x}^{(2)}

\mathbf{x}^{(3)}

\mathbf{x}^{(4)}

\mathbf{x}^{(15)}

\mathbf{x}^{(14)}

\cdots

Ejemplo: clasificación

\mathbf{x}^{(1)}

\mathbf{x}^{(2)}

\mathbf{x}^{(3)}

\mathbf{x}^{(4)}

\mathbf{x}^{(15)}

\mathbf{x}^{(14)}

\cdots

perro

gato

perro

gato

Ejemplo: regresión

precio ($k)

área

superficial $(m^3)$

100

150

200

250

150

300

Ejemplo: regresión

precio ($k)

área

superficial $(m^3)$

100

150

200

250

150

300

Ejemplo: regresión

precio ($k)

área

superficial $(m^3)$

100

150

200

250

150

300

225

Ejemplo: regresión

precio ($k)

área

superficial $(m^3)$

100

150

200

250

150

300

225

Estos son ejemplos de aprendizaje supervisado (supervised learning)

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

Supervised learning

datos junto con...

Supervised learning

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

labels |

y^{(1)}

y^{(2)}

y^{(3)}

y^{(4)}

y^{(9)}

y^{(7)}

etiquetas

y^{(5)}

y^{(8)}

y^{(10)}

y^{(11)}

y^{(12)}

Supervised learning

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

y^{(1)}

y^{(2)}

y^{(3)}

y^{(4)}

y^{(9)}

y^{(7)}

training data

training set

y^{(5)}

y^{(8)}

y^{(10)}

y^{(11)}

y^{(12)}

Supervised learning

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

y^{(1)}

y^{(2)}

y^{(3)}

y^{(4)}

y^{(9)}

y^{(7)}

objetivo:

explicación

predicción

y^{(5)}

y^{(8)}

y^{(10)}

y^{(11)}

y^{(12)}

\mathbf{x}^{(13)}

¿y^{(13)}?

Supervised learning

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

y^{(1)}

y^{(2)}

y^{(3)}

y^{(4)}

y^{(9)}

y^{(7)}

objetivo:

explicación

predicción

y^{(5)}

y^{(8)}

y^{(10)}

y^{(11)}

y^{(12)}

\mathbf{x}^{(13)}

¿y^{(13)}?

Clasificación: $\quad y\in\{1,2,...,m\}$
Regresión: $\quad y \in \mathbb{R}$

Ejemplo: clustering

Ejemplo: diffusion and denoising

Ejemplo: diffusion and denoising

Estos son ejemplos de aprendizaje no supervisado (unsupervised learning) y/o autosupervisado

Unsupervised [self-supervised] learning

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

\mathbf{x}^{(1)}

\mathbf{x}^{(3)}

\mathbf{x}^{(11)}

\mathbf{x}^{(5)}

\mathbf{x}^{(4)}

\mathbf{x}^{(9)}

\mathbf{x}^{(2)}

\mathbf{x}^{(7)}

\mathbf{x}^{(10)}

\mathbf{x}^{(8)}

\mathbf{x}^{(12)}

encontrar y/o comprender estructura o distribución

Unsupervised [self-supervised] learning

Ejemplo: robots jugando fútbol

Reinforcement learning

environment

policy

actor

action

state

reward

a_t

R_t

s_t

LeCun cake analogy

unsupervised

(self-supervised) learning

supervised learning

reinforcement learning

Más sobre supervised learning partiendo de un ejemplo conocido

¿Modelo que prediga $y_i$ para un $x_i$ desconocido?

y=3.0403x+0

¿Modelo que prediga $y_i$ para un $x_i$ desconocido?

y=3.0403x+0

trivial, pero con esto ya entrenamos un modelo de machine learning

El modelo se encontró con data.
El modelo no sólo explica los ejemplos, sino que permite predicciones.
- Machine learning $\approx$ inferencia estadística.

¿Por qué?

El modelo se encontró con data.
El modelo no sólo explica los ejemplos, sino que permite predicciones.
- Machine learning $\approx$ inferencia estadística.

¿Por qué?

Esto, sin embargo, esconde sutilezas fundamentales.

Formalizando conceptos

Emplear regresión lineal implica resolver