arxiv
Facebook AI
2015.02
主要工作
- 构建数据集 bAbI-tasks 使用 QA 的方式评估阅读理解能力
dataset:babi
source:bAbI-tasks
Tasks
- answer 都是单词或者单词列表
- 无噪声,人类水准可以达到 100% 准确率
- 每个任务有 1000 个 question 用于训练,1000 个 question 用于测试
| # | Task | Class name |
|---|---|---|
| 1 | Basic factoid QA with single supporting fact | WhereIsActor |
| 2 | Factoid QA with two supporting facts | WhereIsObject |
| 3 | Factoid QA with three supporting facts | WhereWasObject |
| 4 | Two argument relations: subject vs. object | IsDir |
| 5 | Three argument relations | WhoWhatGave |
| 6 | Yes/No questions | IsActorThere |
| 7 | Counting | Counting |
| 8 | Lists/Sets | Listing |
| 9 | Simple Negation | Negation |
| 10 | Indefinite Knowledge | Indefinite |
| 11 | Basic coreference | BasicCoreference |
| 12 | Conjunction | Conjunction |
| 13 | Compound coreference | CompoundCoreference |
| 14 | Time manipulation | Time |
| 15 | Basic deduction | Deduction |
| 16 | Basic induction | Induction |
| 17 | Positional reasoning | PositionalReasoning |
| 18 | Reasoning about size | Size |
| 19 | Path finding | PathFinding |
| 20 | Reasoning about agent’s motivation | Motivations |
- Single Supporting Fact (1):用 1 个 supporting fact 就可以回答的 question
- Two or Three Supporting Facts (2, 3):用 2 个或者 3 个 supporting facts 可以回答的 question
- Two or Three Argument Relations (4, 5):回答 question 的关键是能分清楚 subjects 和 objects
- Yes/No Questions (6):回答 Yes/No 类型的问题的能力
- Counting and Lists/Sets (7, 8):计数、回答列表物件的能力
- Simple Negation and Indefinite Knowledge (9, 10):对否定句以及可能性的理解能力
- Basic Coreference, Conjunctions and Compound Coreference (11, 12, 13):对代指、多 subject 的理解能力
- Time Reasoning (14):时间推理能力
- Basic Deduction and Induction (15, 16):推理归纳能力
- Positional and Size Reasoning (17, 18):空间推理
- Path Finding (19):根据地理位置进行路径推理
- Agent’s Motivations (20):主体动机推理
提供了 3 种版本
- 正常英文
- Hindi 文
- 顺序打乱的英文
Example

训练
各种 baseline 
- AM:adaptive memory
- NG:N-grams
- NL:nonlinear matching function
读后感
就是提出了 babi-tasks 数据集,20 个 tasks 构造的思路很有意思,涵盖了语言理解能力的各种方向