Andrés Esteban Páez Torres
MindLab
Universidad Nacional de Colombia
The challenge is to devise effective and efficient mechanisms to perform matrix factorization in high dimensional feature spaces implicitly defined by kernels.
Let's consider the following factorization
And its reconstruction error
Which we can express as
Using the kernel function
But the terms have a dimension
which leads to the impossibility to apply the factorization to a large number of samples
To solve that problem let's consider the following factorization
With the factorization stated we pose this optimization problem using stochastic gradient descent
In order to use SGD we must calculate the gradient
With the gradients we get the update rules