Димитрина
Златкова
(ИИ)
Даниел
Копев
(ИИ)
Атанас
Атанасов
(ИИ)
SemEval-2018
So this happened :) with my girls @user and erinblonshine. ️ #29 #sacredhearttattoo @ Sacred…
Original:
So this happened smile with my girls @user and erinblonshine. ️ #29 #sacredhearttattoo @ Sacred…
Pattern replace:
['so', 'this', 'happened', 'smile', 'with', 'my', 'girls', 'and', 'erinblonshine', 'sacred', 'heart', 'tattoo', 'sacred']
Tokenize:
So this happened smile with my girls and erinblonshine#sacredhearttattoo Sacred
Char filter:
['happened', 'smile', 'girls', 'erinblonshine', 'sacred', 'heart', 'tattoo', 'sacred']
Stop words:
['happen', 'smile', 'girl', 'erinblonshine', 'sacred', 'heart', 'tattoo', 'sacred']
Lemmatize:
[('so','RB'),('this','DT'),('happened','VBD'),('smile','NN'), ('with','IN'),('my','PRP$'), ('girls','NNS'),('and','CC'), ('erinblonshine','NN'),('sacred','JJ'),('heart','NN'),('tattoo','NN'),('sacred','VBD')]
POS Tagger:
- fun, sun
- fun, sun
- sun
- sun
- sun
- sun
- park
Positive:
Negative:
"pos_0", "pos_.15", "pos_.20", "pos_.27", "pos_.4", "pos_above"
"neg_0", "neg_.15", "neg_.25", "neg_.35", "neg_.6", "neg_above"
^010011000 | got qot gott g0t gotz qott gottt gawt ghot gotcho goht ggot |
^111010100010 | lmao lmfao lmaoo lmaooo lool rofl loool lmfaoo lmfaooo lmaoooo |
^111010100011 | haha hahaha hehe hahahaha hahah aha hehehe ahaha hah hahahah hahaa ahah |
Precision | Recall | F1 Macro | |
Naive Bayes | 1.00 | 0.21 | 1.763 |
SVM (non-linear) | 1.00 | 0.21 | 1.763 |
Random Forest | 0.57 | 0.27 | 14.979 |
MLP | 0.41 | 0.26 | 17.173 |
StarSpace + NN |
(10k train, 1k test)
StarSpace
Glove
Word2Vec
Precision | Recall | F1 Macro | |
SVM (linear kernel, SGD) | 0.65 | 0.61 | 59.171 |
Standart BiLSTM(10epo) | 0.60 | 0.35 | 39.28 |
Convolutional LSTM | 0.60 | 0.30 | 40 |
Hierarchical Attention | 0.76 | 0.48 | 49 |
(488k train, 50k test)
valentine, loveofmylife, heart full, heart
cool kid, sunglasses, coolin, shade, cool, sunglass
ti season, christmastree, tree, christmas tree, merry christmas, merry, christmas
pretty pink, breast, pink, breast cancer
daze, beachin, sunshine state, fun sun, sunny day, sun, sunny, sunshine
veteran day, murica, veteran, america, ivoted, election, merica, vote, usa