Update #4: CNN results and answer extraction methodology
August 11th, 2016
Results
First Run:
MAP = 0.2503
MRR = 0.2503
Results
MAP = 0.2394
MRR = 0.2394
Second Run:
General Metodologies
Metodology 1.1 (almost question "agnostic")
Idea: Use features that extract lexical, syntactical and semantical structure of sentence, question and answer to train a classifier.
For each word in answers:
Metodology 1.1 (almost question "agnostic")
Idea: Use features that extract lexical, syntactical and semantical structure of sentence, question and answer to train a classifier.
Example: "it"
(False, u'It', u'PRP', u'O', 'whom', '', '', '', u'is', u'VBZ', u'O', u'INANIMATE', u'SINGULAR', u'NEUTRAL', u'PRONOMINAL')
Metodology 1.1 (almost question "agnostic")
Random forest classifier
Parameters:
Results:
502,419 | 5,853 |
60,095 | 5,335 |
True
Pred
0
1
0
1
Metodology 1.1 (almost question "agnostic")
Random forest classifier
Parameters:
Results:
502,671 | 5,644 |
60,352 | 5,345 |
True
Pred
0
1
0
1
Metodology 1.2 (question sensitive)
Idea: Use features that extract lexical, syntactical and semantical structure of sentence, question and answer to train a classifier.
For each word in answers:
Metodology 1.2 (question sensitive)
Idea: Use features that extract lexical, syntactical and semantical structure of sentence, question and answer to train a classifier.
Example: "it"
(False, u'It', u'PRP', u'O', False, 'whom', '', '', '', '', '', u'is', u'VBZ', u'O', False, False, u'replica', u'NN', u'O', u'nsubj', False, False, u'INANIMATE', u'SINGULAR', u'NEUTRAL', u'PRONOMINAL')
What's next?