Week 17 Report
b02901085 徐瑞陽
b02901054 方為
We focus on slots 1 and 3 of subtask 1 first
Given a review text about a target entity (laptop, restaurant, etc.),
identify the following information:
bag of word : 62 %
glove vector : 61 %
bag of word + glove vector : 64 %
Restaurant
bag of word : 48 %
glove vector : 42%
bag of word + glove vector : 47 %
Laptop
Linear SVM
bag of word : %
glove vector : 61 %
bag of word + glove vector : 64 %
Restaurant
bag of word : %
glove vector : %
bag of word + glove vector : %
Laptop
RBF SVM
Without LSI :
restaurant : 61% (12 category)
laptop : 52% (81 category)
restaurant : 60% (12 category)
laptop : 50.05% (81 category)
With LSI : (reduce dimension to 1000)
restaurant : 60% (12 category)
laptop : 52% (81 category)
Need more time...
Seems no improvement ,
but the 1000 dimension seems preserve the essential info
Bigram
9000/? (origin size) |
3000 (unigram size) |
|
---|---|---|
restaurant | 61% | 61% |
laptop | 51% | 51% |
seems high input dimension is not a problem :p
15k/17k (origin size) |
3000 (unigram size) |
|
---|---|---|
restaurant | 61% | |
laptop | 51% |
Trigram
Nothing special :p
Two kinds of classifier
something like dimensionality reduction
use sample to classify directly
use sample to guess hidden model's param <learn a model>
use these models to predict unseen sample <classify>
Restaurant | Laptop | |
---|---|---|
Model | 3-class accuracy | 3-class accuracy |
TreeLSTM | 71.23% | 74.93% |
Sentinue (SemEval 2015 best) | 78.69% | 79.34% |
Seems like we're on the right track...
removed conflicting labels for different aspects in a sentence
accuracy tested on dev set of training data where conflicting labels are removed, so it cannot completely reflect real acc.
Restaurant | Laptop | |
---|---|---|
Model | 3-class accuracy | 3-class accuracy |
Our model | 83% | 84.5% |
Sentinue (SemEval 2015 best) | 78.69% | 79.34% |
Note: accuracy obtained through cross-validation
Results
Exciting results!!! Though we have yet to check whether the data for this year and last year are the same
sentence
Subtask1-slot1 SVM
See if sentence contains aspect
if yes
predict polarity by Subtask1-slot3 model
For each sentence:
Finally, combine all aspect and polarity pairs