SpeechProject-week17 (Final version)

Task 5:

Aspect Based Sentiment Analysis (ABSA)

We focus on slots 1 and 3 of subtask 1 first

Subtask 1: Sentence-level ABSA

Given a review text about a target entity (laptop, restaurant, etc.),

identify the following information:

Slot 1: Aspect Category
- ex. ''It is extremely portable and easily connects to WIFI at the library and elsewhere''
  ----->{LAPTOP#PORTABILITY}, {LAPTOP#CONNECTIVITY}
Slot 2: Opinion Target Expression (OTE)
- an expression used in the given text to refer to the reviewed E#A
- ex. ''The fajitas were delicious, but expensive''
  ----->{FOOD#QUALITY, “fajitas”}, {FOOD#PRICES, “fajitas”}
Slot 3: Sentiment Polarity
- label: (positive, negative, or neutral)

Slot1 : Aspect Detection

Subtask1: Sentence-level

Slot1 : Aspect Detection

	Restaurant	Laptop
Model	12-class F-Measure	81-class F-Measure
Linear SVM (BOW)	62.58	52.16
Linear SVM (Glove vec.)	61.14	46.37
Linear SVM(BOW + Glove)	64.17	51.05
RBF SVM (BOW)	60.71	50.42
RBF SVM (Glove Vec.)	61.03	42.16
RBF SVM (BOW + Glove)	64.32	40.77
2015's BSET	62.68	50.86

Slot3 : Polarity Detection

Subtask1: Sentence-level

Slot3 :

Polarity Detection

	Restaurant	Laptop
Model	3-class accuracy	3-class accuracy
TreeLSTM	71.23%	74.93%
Sentinue (SemEval 2015 best)	78.69%	79.34%

Small Experiment: predict by training a TreeLSTM model

Seems like we're on the right track...

removed conflicting labels for different aspects in a sentence

accuracy tested on dev set of training data where conflicting labels are removed, so it cannot completely reflect real acc.

Slot3 : Polarity Identification

Proposed Model

Slot3 :

Polarity Detection

Predict by training the proposed model

Details:
- Word embedding: GloVe vectors
- Target: 5 classes
- structure: dependency tree
- learning rate: 0.005; embedding learning rate: 0.1
- dropout: 0.5; activation: tanh
- restaurant: 2 layers, 300 hidden
- laptop: 3 layers, 300 hidden

sentence Features obtain from last layer of TreeLSTM

Stage 1: TreeLSTM

when training TreeLSTM,
conflicting labels for different aspects in a sentence are removed

To extract sentence features for next stage

Slot3 :

Polarity Detection

Details:
- Target: 3 classes
- learning rate: 0.0001
- dropout: 0.3
- 3 layers, 256 hidden
- activation: relu

Experimented with NN, will try SVM in the future

Stage 2: Classifier

conflicting labels can be trained in this stage

Slot3 :

Polarity Detection

	Restaurant	Laptop
Model	3-class accuracy	3-class accuracy
Our model	83%	84.5%
Sentinue (SemEval 2015 best)	78.69%	79.34%

Note: accuracy obtained through cross-validation

Results

Exciting results!!! Note that the size of training data
for 2016 is about 50% larger than 2015, and there are about 1~3 % of training instances in 2015 that aren't in the 2016 dataset.

Future Work

Before submission deadline (1/31)

Slot 2: OTE detection

Structured Learning

Subtask 1: Sentence-level

Slot 3: Polarity Detection

SVM in stage 2
Label internal nodes when training stage 1 TreeLSTM
Use other resources
Write our own TreeLSTM so our model can be trained end-to-end (if time permits...)
QA memory network (if time permits...)

Subtask 1: Sentence-level

sentence

\rightarrow

\rightarrow

Subtask1-slot1 SVM

See if sentence contains aspect

\rightarrow

\rightarrow

\rightarrow

\rightarrow

if yes

predict polarity by Subtask1-slot3 model

For each sentence:

Finally, combine all aspect and polarity pairs

Speech Project

Task 5:

Aspect Based Sentiment Analysis (ABSA)

Subtask 1: Sentence-level ABSA

Slot1 : Aspect Detection

Subtask1: Sentence-level

Slot1 : Aspect Detection

Slot3 : Polarity Detection

Subtask1: Sentence-level

Slot3 :

Polarity Detection

Slot3 : Polarity Identification

Proposed Model

Slot3 :

Polarity Detection

Slot3 :

Polarity Detection

Slot3 :

Polarity Detection

Future Work

Before submission deadline (1/31)

Slot 2: OTE detection

Structured Learning

Subtask 1: Sentence-level

Slot 3: Polarity Detection

Subtask 1: Sentence-level

Subtask 2: Text-level

Subtask 3: Out-of-domain