Backward propagation
Training samples :
Optimize the weights
Softmax function
probability of the sample
to belong to class c
How to compute
}