introml-sp24-lec2

Intro to Machine Learning

https://introml.mit.edu/

Lecture 2: Linear regression and regularization

Shen Shen

Feb 9, 2024

(many slides adapted from Tamara Broderick)

Optimization + first-principle physics

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

Now, the catch:

may not be well-defined

$\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}$ is not well-defined if $\left(\tilde{X}^{\top} \tilde{X}\right)$ is not invertible

Indeed, it's possible that $\left(\tilde{X}^{\top} \tilde{X}\right)$ is not invertible.

In particular, $\left(\tilde{X}^{\top} \tilde{X}\right)$ is not invertible if and only if $\tilde{X}$ is not full column rank

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

Now, the catch:

is not well-defined

if $\tilde{X}$ is not full column rank

if $n$ < $d$
if columns (features) in $\tilde{X}$ have linear dependency

Recall

indeed $\tilde{X}$ is not full column rank

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

\theta^*=\left(\tilde{X}^{\top} \tilde{X}\right)^{-1} \tilde{X}^{\top} \tilde{Y}

Recap:

if $n$ < $d$ (i.e. not enough data)
if columns (features) in $\tilde{X}$ have linear dependency (i.e., so-called co-linearity)

Both cases do happen in practice
In both cases, loss function is a "half-pipe"
In both cases, infinitily-many optimal hypotheses
Side-note: sometimes noise can resolve invertabiliy issue, but undesirable

is not defined

Intro to Machine Learning

Lecture 2: Linear regression and regularization

Logistics

Outline

Outline

Now, the catch:

Now, the catch:

Recap:

Outline

Regularization

Ridge Regression Regularization

Ridge Regression Regularization

Ridge Regression Regularization

Outline

Cross-validation

Cross-validation

Cross-validation

Cross-validation

Cross-validation

Cross-validation

Cross-validation

Cross-validation

Comments about cross-validation

Summary

Summary

Thanks!

introml-sp24-lec2

introml-sp24-lec2

Shen Shen

Intro to Machine Learning

Lecture 2: Linear regression and regularization

introml-sp24-lec2

More from Shen Shen