Socher et al, 2011
Semi-supervised learning uses both labeled data (S) and unlabeled data (U) as inputs. Its goal can either be to learn the labels of U or to learn the correct mapping from X to Y.
Autoencoders are a form of unsupervised learning using neural networks. They map a set of inputs X to itself by learning an approximation of the identity function.
A recursive autoencoder (RAE) takes a set of inputs and recursively merges pairs until a single element remains, which captures the information of all inputs.
Sentiment analysis is the process of identifying and classifying the opinion or emotion expressed in a piece of text. A sentiment distribution is the distribution of sentiment labels over a body of texts.
Learn how to identify the sentiment of a piece of text.
A semi-supervised recursive RAE can learn this mapping without the help of traditional resources.
A continuous word vector is a mapping of a word to a vector in a feature space where each dimension captures some syntactic or semantic meaning.
Structure prediction uses an RAE to find the vector representation of a sentence that minimizes the total reconstruction error over all levels of the recursion tree.
A semi-supervised RAE applies an additional softmax layer at each level of the recursion tree to predict the sentiment distribution of the parent feature vector, p.
Given the current set of parameters
greedily construct the optimal tree for each sentence.
Compute the gradient for the objective function
and update the parameter values using L-BFGS.
Weight decay
Length Normalization
Example of multinomial sentiment distribution.
Website for anonymous sharing of personal stories with other users who can respond by "voting" for one of five categories:
Most traditional sentiment analysis methods only perform binary polarity classification. In order to compare semi-supervised RAEs to these methods, the model was evaluated on two common datasets: