Order-Planning Neural Text Generation from Structured Data

Similar documents
Deep Neural Networks [GBC] Chap. 6, 7, 8. CS 486/686 University of Waterloo Lecture 18: June 28, 2017

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

Prioritizing Issues in Islamic Economics and Finance

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution

Teller Of Tales: The Life Of Arthur Conan Doyle By Daniel Stashower READ ONLINE

COS 226 Algorithms and Data Structures Fall Midterm

Artificial Intelligence: Valid Arguments and Proof Systems. Prof. Deepak Khemani. Department of Computer Science and Engineering

Investing: The Last Liberal Art

A Scientific Model Explains Spirituality and Nonduality

Gesture recognition with Kinect. Joakim Larsson

NPTEL NPTEL ONINE CERTIFICATION COURSE. Introduction to Machine Learning. Lecture-59 Ensemble Methods- Bagging,Committee Machines and Stacking

Entailment as Plural Modal Anaphora

Torah Code Cluster Probabilities

How to Study the Bible, Part 2

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

The nature of consciousness underlying existence William C. Treurniet and Paul Hamden, July, 2018

Document-level context in deep recurrent neural networks

THE GREAT DETOUR. John Perry. June 22, 2016 Institut Jean Nicod Paris

Universitas Saraviensis Project Seminar Text Mining for Historical Documents Antonia Scheidel February An Introduction To Ontologies

Knowledge, Language, and Nonexistent Entities

Houghton Mifflin MATHEMATICS

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

Functionalism and the Chinese Room. Minds as Programs

Noun Compound Interpretation

Inimitable Human Intelligence and The Truth on Morality. to life, such as 3D projectors and flying cars. In fairy tales, magical spells are cast to

From Machines To The First Person

TEXT MINING TECHNIQUES RORY DUTHIE

CS224W Project Proposal: Characterizing and Predicting Dogmatic Networks

Tests of Homogeneity and Independence

Review: The Objects of Thought, by Tim Crane. Guy Longworth University of Warwick

Practical English: Learning and Teaching Prof. Bhaskar Dasgupta Department of Mechanical Engineering Indian Institute of Technology, Kanpur

QCAA Study of Religion 2019 v1.1 General Senior Syllabus

INTRODUCTION TO LOGIC 1 Sets, Relations, and Arguments

QUESTION ANSWERING SYSTEM USING SIMILARITY AND CLASSIFICATION TECHNIQUES

Quorums. Christian Plattner, Gustavo Alonso Exercises for Verteilte Systeme WS05/06 Swiss Federal Institute of Technology (ETH), Zürich

The Impact of Oath Writing Style on Stylometric Features and Machine Learning Classifiers

McDougal Littell High School Math Program. correlated to. Oregon Mathematics Grade-Level Standards

Dave Piscitello: issues and try to (trap) him to try to get him into a (case) to take him to the vet.

Studying Adaptive Learning Efficacy using Propensity Score Matching

Smith Waterman Algorithm - Performance Analysis

1. Introduction Formal deductive logic Overview

Dennett's Reduction of Brentano's Intentionality

The World Wide Web and the U.S. Political News Market: Online Appendices

PARISH SHARE OPTION 2

Sorting: Merge Sort. College of Computing & Information Technology King Abdulaziz University. CPCS-204 Data Structures I

Ms. Shruti Aggarwal Assistant Professor S.G.G.S.W.U. Fatehgarh Sahib

Information Retrieval LIS 544 IMT 542 INSC 544

Closing Remarks: What can we do with multiple diverse solutions?

DALI power line communication

CHAPTER 17: UNCERTAINTY AND RANDOM: WHEN IS CONCLUSION JUSTIFIED?

MISSOURI S FRAMEWORK FOR CURRICULAR DEVELOPMENT IN MATH TOPIC I: PROBLEM SOLVING

Anaphora Resolution in Biomedical Literature: A

MLLunsford, Spring Activity: Conditional Probability and The Law of Total Probability

End of the year test day 2 #3

Predictive Coding. CSE 390 Introduction to Data Compression Fall Entropy. Bad and Good Prediction. Which Context to Use? PPM


Formalizing a Deductively Open Belief Space

15 Does God have a Nature?

9/7/2017. CS535 Big Data Fall 2017 Colorado State University Week 3 - B. FAQs. This material is built based on

TRAMPR: A package for analysis of Terminal-Restriction Fragment Length Polymorphism (TRFLP) data

Intelligent Agent for Information Extraction from Arabic Text without Machine Translation

PARSEC An R package for PARtial orders in Socio- EConomics Alberto Arcagni and Marco Fattore

Evaluation of potential mergers of the Provo-Orem MSA and the Ogden-Clearfield MSA with the Salt Lake City MSA

Comparing World Religions Using Primary Sources

DNA, Information, and the Signature in the Cell

Baker Street Elementary. Presents The Life and Times in Victorian London

TECHNICAL WORKING PARTY ON AUTOMATION AND COMPUTER PROGRAMS. Twenty-Fifth Session Sibiu, Romania, September 3 to 6, 2007

Tails Of Sweetbrier By Deanie Humphrys-Dunne

Strand 1: Reading Process

CHAPTER FOUR RESEARCH FINDINGS. Introduction. D.Min. project. A coding was devised in order to assign quantitative values to each of the

Performance Analysis with Vampir

INTERMEDIATE LOGIC Glossary of key terms

Now consider a verb - like is pretty. Does this also stand for something?

Blaise Pascal

LDS Church Resources by Brett W. Smith

Congregational Vitality Survey

STRATEGIC PLANNING PROCESS

DOWNLOAD OR READ : THE LOGIC BOOK PDF EBOOK EPUB MOBI

Assessment of Common Fund for 2018, incorporating the former How do we decide?

THE JOURNAL OF CURIOUS LETTERS BOOK ONE OF THE 13TH REALITY SERIES

occasions (2) occasions (5.5) occasions (10) occasions (15.5) occasions (22) occasions (28)

ECE 6504: Deep Learning for Perception

A PREDICTION REGARDING THE CONFESSIONAL STRUCTURE IN ROMANIA IN 2012

Union for Reform Judaism. URJ Youth Alumni Study: Final Report

Draft 11/20/2017 APPENDIX C: TRANSPORTATION PLAN FORECASTS

Tuen Mun Ling Liang Church

7 th Grade Summer Reading

6.080 / Great Ideas in Theoretical Computer Science Spring 2008

Six Sigma Prof. Dr. T. P. Bagchi Department of Management Indian Institute of Technology, Kharagpur. Lecture No. # 18 Acceptance Sampling

Report of the Committee's Decision Regarding A Study in Scarlet by Sir Arthur Conan Doyle

Sounds of Love. Intuition and Reason

Improving Tree-to-Tree Translation with Packed Forests

Module 5. Knowledge Representation and Logic (Propositional Logic) Version 2 CSE IIT, Kharagpur

By the Numbers Movie How We Measured the Stats

ECE 5984: Introduction to Machine Learning

Final Review Ch. 1 #1

Factors related to students spiritual orientations

Listening Guide. Developing Your Spiritual Life. Developing Your Spiritual Life. SF104 Lesson 01 of 05

A theory of adjudication is a theory primarily about what judges do when they decide cases in courts of law.

Lead Student Lesson Plan L06: 2 Nephi 9-16

Transcription:

Order-Planning Neural Text Generation from Structured Data Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Sujian Li, Baobao Chang, Zhifang Sui Institute of Computational Linguistics, Peking University David R. Cheriton School of Computer Science, University of Waterloo February 5, 2018 Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 1 / 23

Table of Contents 1 Introduction 2 Generating Text from Structured Data 3 Experiments 4 Conclusion Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 2 / 23

Table of Contents 1 Introduction 2 Generating Text from Structured Data 3 Experiments 4 Conclusion Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 3 / 23

Table-to-Text Brief Summary Generation A table can be a list of RBF tuples: John E Blaha birthdate 1942,08,26 John E Blaha birthplace San Antonio John E Blaha occupation Fighter pilot San Antonio located in USA Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 4 / 23

Table-to-Text Brief Summary Generation A table can be also a list of attributes (like Wiki infobox): Figure: An example of Wikipedia infobox. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 5 / 23

Table-to-Text Brief Summary Generation Generate brief summary from structured data is useful In the last step of QA system, Table-to-text is used to generate answer. Question Question Analysis Search from KB User Answer Text Table Answer Generation Figure: Table-to-text in question answering system. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 6 / 23

Table-to-Text Brief Summary Generation Table-to-text can also be used to generate response in dialogue system Slot extraction Intent tracking State tracking Search KB Table to Text Generator Slot extraction Intent tracking State tracking Search KB Table to Text Generator Figure: Table-to-text in dialogue system. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 7 / 23

Table of Contents 1 Introduction 2 Generating Text from Structured Data 3 Experiments 4 Conclusion Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 8 / 23

Table-to-Text Brief Summary Generation We generate brief summary for wikipedia infobox Sir Arthur Ignatius Conan Doyle (22 May 1859 7 July 1930) was a British writer best known for his detective fiction featuring the character Sherlock Holmes. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, DataFebruary Peking University) 5, 2018 9 / 23

Table-to-Text Brief Summary Generation Motivation: Traditional: language model based generator Use probability of word-by-word: P(w t w t 1 ) Different from human s generation process Human: first plan for order, then write Use probability of field-by-field: P(f t f t 1 ) We propose to add human nature into machine learning models Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 10 / 23

Table-to-Text Brief Summary Generation In our work, we use the attention mechanism to assist the generation process Content-based attention Use the last output word y t 1 to predict the importance of each table content for the next output. Link-based attention See which field we are going to generate this time. Hybrid attention Combine content-based and link-based attention together. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 11 / 23

Table-to-Text Brief Summary Generation How to build field-by-field probability (P(f t f t 1 ))? The element in the i-th row and j-th column is the probability of field j occurs after field i Figure: Field-by-field probability matrix (Link matrix). Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 12 / 23

Table-to-Text Brief Summary Generation However,... We have more than 1400 different fields in our dataset To tune a full field-by-field matrix each time is expensive So, We extract link sub-matrix for each input example. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 13 / 23

Table-to-Text Brief Summary Generation How to build link sub-matrix? Name Born Occupation Nationality Name Born Occupation Nationality Link matrix Link sub-matrix Figure: The process of select link sub-matrix. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 14 / 23

LSTM Table-to-Text Brief Summary Generation We calculate the hybrid attention as follows: (a) Encoder: Table Representation (b) Dispatcher: Planning What to Generate Next Field Content Name Arthur Name Ignatius Name Conan Name Doyle Born 22 Born May Born 1859 Occupation writer Occupation physician Nationality British Content-based attention Weighted sum Attention vector Hybrid attention Last step's attention = Link (sub)matrix Link-based attention Figure: Illustration of content-based attention and link-based attention. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 15 / 23

Table-to-Text Brief Summary Generation Then we generate text according to the hybrid attention: <eos> LSTM LSTM... LSTM... LSTM <start> Table content Last LSTM state LSTM Embedding of the generated word in last step Attention vector Figure: The decoder in our model, which is incorporated with a copying mechanism. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 16 / 23

Table of Contents 1 Introduction 2 Generating Text from Structured Data 3 Experiments 4 Conclusion Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 17 / 23

Experiments Overall performance of our model: Figure: Comparison of the overall performance between our model and previous methods. l Best results in Lebret, Grangier, and Auli (2016). Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 18 / 23

Experiments Simple case study: Figure: Case study. Left: Wikipedia infobox. Right: A reference and two generated sentences by different attention (both with the copy mechanism). Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 19 / 23

Experiments Visualization of attention probabilities in our model. x-axis: generated words...) was an american economist... ; y-axis: field : content word pairs in the table. death place:united death place:states nationality:american occupation:governor occupation:of occupation:the occupation:federal occupation:reserve occupation:system occupation:, occupation:economics occupation:professor known for:expert (a) α content (b) α link (c) α hybrid ) was an american economist ) was an american economist ) was an american economist Figure: Subplot (b) exhibits strips because, by definition, link-based attention will yield the same score for all content words with the same field. Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 20 / 23

Table of Contents 1 Introduction 2 Generating Text from Structured Data 3 Experiments 4 Conclusion Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 21 / 23

Conclusion We propose to add human nature, namely, the field-by-field generation method to neural network models. We propose the link-based attention mechanism to model the generate order of the fields We conduct a series of experiments and ablation tests to prove our model s effectiveness Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 22 / 23

Thank you. Any questions? Lei Sha, Lili Mou, Tianyu Liu, Pascal Poupart, Order-Planning Sujian Li, Neural Baobao Text Chang, Generation Zhifang from Structured Sui (ICL, Data Peking February University) 5, 2018 23 / 23