Introduction to Deep Learning

Architecture overview

The most basic component of an artificial neural network is the activation unit.

It is made of an input, or set of n inputs (which may include a constant bias term) an 'activation' function and an output.

X_1

X_1

X_2

X_2

\dots

\dots

X_n

X_n

\sim

\sim

\theta_1

\theta_1

\theta_2

\theta_2

\theta_n

\theta_n

X_0

X_0

\theta_0

\theta_0

O_i

O_i

\sum

\sum

Activation node

\sim

\sim

\sum

\sum

Multilayer network

When we stack this units together into layers, we get a multilayer artificial neural network

O_1

O_1

O_2

O_2

O_3

O_3

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

X_1

X_1

X_2

X_2

X_3

X_3

Learning rules

Classification example:

XOR function

Let us suppose that we want to create a two layer neural network able to classify these observations.

(0,1) \rightarrow 1

(0,1) \rightarrow 1

(1,1) \rightarrow 0

(1,1) \rightarrow 0

(1,0) \rightarrow 1

(1,0) \rightarrow 1

(0,0) \rightarrow 0

(0,0) \rightarrow 0

Learning rules

Classification example:

XOR function

Or equivalently, we want a neural network able to create a classification region such as the yellow one.

Learning rules

Classification example:

XOR function

Proposed solution

\sim

\sim

\sum

\sum

-0.5

-0.5

1

1

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

-1

-1

+1

+1

1

1

X_1

X_1

X_2

X_2

+1

+1

+1

+1

+1

+1

+1

+1

-0.5

-0.5

-1.5

-1.5

Learning rules

\sim

\sim

\sum

\sum

-0.5

-0.5

1

1

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

-1

-1

+1

+1

1

1

X_1

X_1

X_2

X_2

+1

+1

+1

+1

+1

+1

+1

+1

-0.5

-0.5

-1.5

-1.5

0

0

1

1

heaviside(0*1 + 1*1 + (-1.5*1))=heaviside(-0.5)=0

heaviside(0*1 + 1*1 + (-1.5*1))=heaviside(-0.5)=0

heaviside(0*1 + 1*1 + (-0.5*1))=heaviside(0.5)=1

heaviside(0*1 + 1*1 + (-0.5*1))=heaviside(0.5)=1

heaviside(-1*0 + 1*1 + (-0.5*1))=heaviside(0.5)=1

heaviside(-1*0 + 1*1 + (-0.5*1))=heaviside(0.5)=1

Learning rules

\sim

\sim

\sum

\sum

-0.5

-0.5

1

1

\sim

\sim

\sum

\sum

\sim

\sim

\sum

\sum

-1

-1

+1

+1

1

1

X_1

X_1

X_2

X_2

+1

+1

+1

+1

+1

+1

+1

+1

-0.5

-0.5

-1.5

-1.5

0

0

0

0

heaviside(0*1 + 1*0 + (-1.5*1))=heaviside(-1.5)=0

heaviside(0*1 + 1*0 + (-1.5*1))=heaviside(-1.5)=0

heaviside(0*1 + 1*0 + (-0.5*1))=heaviside(-0.5)=0

heaviside(0*1 + 1*0 + (-0.5*1))=heaviside(-0.5)=0

heaviside(-1*0 + 1*0 + (-0.5*1))=heaviside(-0.5)=0

heaviside(-1*0 + 1*0 + (-0.5*1))=heaviside(-0.5)=0

Introduction to Deep Learning

Contents

Neural network's architecture overview

Architecture overview

Activation node

Multilayer network

Learning rules

Learning rules

Learning rules

Learning rules

Learning rules

Activation functions

Activation function

Activation function

Backpropagation

Backpropagation

Backpropagation

Backpropagation

Introduction to Deep Learning

Introduction to Deep Learning

Luis Roman

Introduction to Deep Learning

Introduction to Deep Learning

More from Luis Roman