论文

Question Generation for Question Answering

Nan Duan
Microsoft Research Asia
2017
原文链接

这篇是从长文章中利用两种方式CNN和RNN来生成高质量的问题，并利用生成的这些问题来进一步提高QA的表现，这篇和下一篇是一个作者的文章。

数据集：SQuAD, MS MARCO, and WikiQA

结构：
整个QG结构分为四个部分，Question Pattern Mining，Question Pattern Prediction，Question Topic Selection，Question Ranking

Question Pattern Mining
将问句输入YahooAnswers获得一系列相关的问题，这些问题构成了一个question cluster。在这个cluster中，n gram出现的频率越高越有可能是topic word
Question Pattern Prediction
给定一篇文章，利用一些关键短语，eg co-founded by -> who found # ?，预测相关的question pattern有哪些。训练数据格式为 ${<A,Q_p,Q_t>}$ ，预测时输入为 ${<S,Q_p>}$ 。
A是answer， ${Q_p}$ 是question pattern， ${Q_t}$ 是question type

– Attention + CNN 将predict视为一个ranking的任务
– Attention + Bi-GRU RNN 将predict视为一个生成任务
Question Topic Selection
– 对于第一种attention + CNN，先利用freebase提取实体和stanford sparser提取名词词组作为question type的候选，然后计算这些候选qt与预测的question type对应的每个historical question topic的相似度，取相似度较高的topic。
– 对于第二种attention + Bi-GRU RNN，对于S中所有单词计算S与 ${Q_p}$ 的attention概率来选取topic。这里softmax输出维度只用前一部分频率最高的question word，对于OOV（out of vocabulary），用attention probability最高的单词替换，类似一种copy机制。
Question Ranking
question pattern prediction score + question topic selection score + QA matching score + word overlap between Q and S + question pattern frequency这五个分数的线性之和