Learning to Control the Specificity in Neural Response Generation

Zhang et al., ACL 2018

Paper reading fest 20180819

Problem
Ideas
Architecture & Training method
Experiment & Result

The Problem

Two major streams of research in NLP:

task oriented dialog
general purpose dialog (eg. chit-chat)

=> generative conversational model

Generative conversational

Approach

Statistical machine translation (SMT)

Conversation is continuous of utterance-response where the model tries to "translates" response for each input.

=> best case: have 1-vs-1 match for utterance-response

Problem?

H: What's your name?

B: My name is B.

H: What's the weather like today?

B: I don't know.

H: Do you like her?

B: I don't know...

H: What do you know?

B: I don't kno....

Ideas

Two major ways to go:

Retrieval-based: Find the best-fit response

Li et al. 2016a: A diversity-promoting objective function for neural conversation models.
Zhou et al., 2017: Mechanism-aware
neural machine for dialogue response generation.
Xing et al. 2017: Topic aware neural response generation.

=> Overlay point: Seq2Seq model, rely on preexisting responses

Ideas

Generation-based:

Serban et al. 2016: Building end-to-end dialogue systems using generative hierarchical neural network models.
Cho et al., 2014: Learning phrase representations using rnn encoder-decoder for statistical machine translation.

Paper's idea

Response Specificity

introduce an explicit specificity control variable into a Seq2Seq model to handle different utterance-response relationships in terms of specificity.