Reference Resolution. Announcements. Last Time. 3/3 first part of the projects Example topics

Similar documents
Reference Resolution. Regina Barzilay. February 23, 2004

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution

Coreference Resolution Lecture 15: October 30, Reference Resolution

08 Anaphora resolution

A Machine Learning Approach to Resolve Event Anaphora

Outline of today s lecture

807 - TEXT ANALYTICS. Anaphora resolution: the problem

Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases

Anaphora Resolution in Biomedical Literature: A

Anaphora Resolution in Hindi Language

ANAPHORIC REFERENCE IN JUSTIN BIEBER S ALBUM BELIEVE ACOUSTIC

Automatic Evaluation for Anaphora Resolution in SUPAR system 1

Hybrid Approach to Pronominal Anaphora Resolution in English Newspaper Text

Pronominal, temporal and descriptive anaphora

Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems

Dialogue structure as a preference in anaphora resolution systems

Anaphora Resolution. Nuno Nobre

The Reliability of Anaphoric Annotation, Reconsidered: Taking Ambiguity into Account

TEXT MINING TECHNIQUES RORY DUTHIE

Keywords Coreference resolution, anaphora resolution, cataphora, exaphora, annotation.

TURCOLOGICA. Herausgegeben von Lars Johanson. Band 98. Harrassowitz Verlag Wiesbaden

HS01: The Grammar of Anaphora: The Study of Anaphora and Ellipsis An Introduction. Winkler /Konietzko WS06/07

INFORMATION EXTRACTION AND AD HOC ANAPHORA ANALYSIS

ANAPHORA RESOLUTION IN HINDI LANGUAGE USING GAZETTEER METHOD

Houghton Mifflin English 2004 Houghton Mifflin Company Level Four correlated to Tennessee Learning Expectations and Draft Performance Indicators

Discourse Constraints on Anaphora Ling 614 / Phil 615 Sponsored by the Marshall M. Weinberg Fund for Graduate Seminars in Cognitive Science

Factivity and Presuppositions David Schueler University of Minnesota, Twin Cities LSA Annual Meeting 2013

CAS LX 522 Syntax I Fall 2000 November 6, 2000 Paul Hagstrom Week 9: Binding Theory. (8) John likes him.

Anaphora Resolution Exercise: An overview

Anaphora Resolution in Biomedical Literature: A Hybrid Approach

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

1. Read, view, listen to, and evaluate written, visual, and oral communications. (CA 2-3, 5)

An Introduction to Anaphora

Statistical anaphora resolution in biomedical texts

Natural Language Processing

Palomar & Martnez-Barco the latter being the abbreviating form of the reference to an entity. This paper focuses exclusively on the resolution of anap

Presupposition and Rules for Anaphora

Semantics and Pragmatics of NLP DRT: Constructing LFs and Presuppositions

Performance Analysis of two Anaphora Resolution System for Hindi Language

A Survey on Anaphora Resolution Toolkits

Paninian Grammar Based Hindi Dialogue Anaphora Resolution

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES. Design of Amharic Anaphora Resolution Model. Temesgen Dawit

Some observations on identity, sameness and comparison

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three Grade Five

SEVENTH GRADE RELIGION

Russell s Problems of Philosophy

Long-distance anaphora: comparing Mandarin Chinese with Iron Range English 1

Anaphora Resolution in Hindi: Issues and Directions

What is the Frege/Russell Analysis of Quantification? Scott Soames

1. Introduction. Against GMR: The Incredulous Stare (Lewis 1986: 133 5).

Early Russell on Philosophical Grammar

StoryTown Reading/Language Arts Grade 2

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Five. correlated to. TerraNova, Second Edition Level 15

A Linguistic Interlude

ELA CCSS Grade Three. Third Grade Reading Standards for Literature (RL)

ANAPHORA RESOLUTION IN MACHINE TRANSLATION

15 DEPENDENT CLAUSES. 1 Note that other alternatives than those shown here may be possible:

Anaphoric Deflationism: Truth and Reference

Comments on Lasersohn

Kai von Fintel (MIT)

AliQAn, Spanish QA System at multilingual

The Interpretation of Complement Anaphora: The Case of The Others

Models of Anaphora Processing and the Binding Constraints

An Empirical Study on the Generation of Anaphora in Chinese

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level 2002 Correlated to: West Virginia English Language Arts IGO s (Grade 8)

Bertrand Russell Proper Names, Adjectives and Verbs 1

Exercises Introduction to morphosyntax

ADAIR COUNTY SCHOOL DISTRICT GRADE 03 REPORT CARD Page 1 of 5

Houghton Mifflin Harcourt Collections 2015 Grade 8. Indiana Academic Standards English/Language Arts Grade 8

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Six. correlated to. TerraNova, Second Edition Level 16

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 8)

Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 7)

The Semantics and Pragmatics of Presupposition

Solutions for Assignment 1

Introduction to the Special Issue on Computational Anaphora Resolution

4) When are complex discourse entities constructed in the process of text comprehension?

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

9 th Grade English Placement Test

Correlation to Georgia Quality Core Curriculum

PAGE(S) WHERE TAUGHT (If submission is not text, cite appropriate resource(s))

Category Mistakes in M&E

Article selection and anaphora in the German relative clause Julian Grove and Emily Hanink University of Chicago

Coordination Problems

Circularity in ethotic structures

Introduction to Koiné Greek

What would count as Ibn Sīnā (11th century Persia) having first order logic?

CHAPTER III RESEARCH METHOD. source, data collection, subject of the research, and data analysis.

Resolving This-issue Anaphora

That's Your Evidence?: Using Mechanical Turk To Develop A Computational Account Of Debate And Argumentation In Online Forums

A Typology of Clause Combining

finagling frege Mark Schroeder University of Southern California September 25, 2007

By the Time Viewing relative progress or completion

Mandy Simons Carnegie Mellon University June 2010

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three. correlated to. IOWA TESTS OF BASIC SKILLS Forms M Level 9

An Analysis of Reference in J.K. Rowling s Novel: Harry Potter and the Half-Blood Prince

A Correlation of. To the. Language Arts Florida Standards (LAFS) Grade 3

RECIPIENT ENCODING IN SOUTHERN SELKUP ANJA HARDER, UNIVERSITY OF HAMBURG

III Knowledge is true belief based on argument. Plato, Theaetetus, 201 c-d Is Justified True Belief Knowledge? Edmund Gettier

On "deep and surface. anaphora. Eunice Pontes

Transcription:

Announcements Last Time 3/3 first part of the projects Example topics Segmentation Symbolic Multi-Strategy Anaphora Resolution (Lappin&Leass, 1994) Identification of discourse structure Summarization Anaphora resolution Clustering-based Coreference Resolution (Cardie&Wagstaff, 1999) Supervised ML Coreference Resolution + Clustering (Soon et al, 2001), (Ng&Cardie, 2002) Cue phrase selection Reference Resolution 1/30 Reference Resolution 3/30 Reference Resolution Reference Resolution Regina Barzilay regina@csail.mit.edu February 23, 2004 Captain Farragut was a good seaman, worthy of the frigate he commanded. His vessel and he were one. He was the soul of it. Coreference resolution: {the frigate, his vessel, it} Anaphora resolution: {his vessel, it} Coreference is a harder task! Reference Resolution 2/30

Observations Observations (Ng&Cardie 2002) 0,76,83,C,D,C,D,D,D,D,D,I,I,C,I,I,D,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,D,1,D,D,C,N,Y,D,D,D,20,20,D,D,-. 0,75,83,C,D,C,D,D,D,C,D,I,I,C,I,I,C,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,C,1,D,D,C,Y,Y,D,D,D,20,20,D,D,+. 0,74,83,C,D,C,D,D,D,D,D,I,I,C,I,I,D,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,D,1,D,D,C,N,Y,D,D,D,20,20,D,D,-. Feature selection plays an important role in classification accuracy: MUC-6 62.6% (Soon et al., 2001) Ng&Cardie, 2002) 69.1% Clustering operates over the results of hard clustering, which may negatively influence the final results Machine learning techniques rely on large amounts of annotated data: 30 texts All the methods are developed on the same corpus of newspaper articles Reference Resolution 5/30 Reference Resolution 7/30 Features (Soon et al, 2001) distance in sentences between anaphora and antecedent? Classification Rules antecedent in a pronoun? weak string identity between anaphora and antecedent? anaphora is a definite noun phrase? anaphora is a demonstrative pronoun? number agreement between anaphora and antecedent semantic class agreement anaphora and antecedent gender agreement between anaphora and antecedent anaphora and antecedent are both proper names? + 786 59 IF SOON-WORDS-STR = C + 73 10 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C SENTNUM <= 1 PRO- RESOLVE = C ANIMACY = C + 40 8 IF WNCLASS = C CONSTRAINTS = D PARANUM <= 0 PRO-RESOLVE = C + 16 0 IF WNCLASS = C CONSTRAINTS = D SENTNUM <= 1 BOTH-IN-QUOTES = I APPOSITIVE = C + 17 0 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C PARANUM <= 1 BPRONOUN-1 = Y AGREEMENT = C CONSTRAINTS = C BOTH-PRONOUNS = C + 38 24 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C SENTNUM <= 2 BOTH- PRONOUNS = D AGREEMENT = C SUBJECT-2 = Y + 36 8 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C BOTH-PROPER-NOUNS = C + 11 0 IF WNCLASS = C CONSTRAINTS = D SENTNUM <= 3 SUBJECT-1 = Y SUBJECT- 2 = Y SUBCLASS = D IN-QUOTE-2 = N BOTH-DEFINITES = I an alias feature an appositive feature Reference Resolution 4/30 Reference Resolution 6/30

Co-training Results Improvements for some types of references (Blum&Mitchell, 1998) 1. Given a small amount of training data, train two classifiers based on orthogonal set of features 2. Add to training set n instances on which both classifiers agree Definite noun phrases: from 19% to 28% (2000 training instances) No improvements for possessives, proper names and possessive pronouns Study of learning curves 3. Retrain both classifiers on the extended set 4. Return to step 2 Personal and possessive pronoun can be trained from very small training data (100 instances) Other types of references require large amounts of training data Reference Resolution 9/30 Reference Resolution 11/30 Today Co-training for Coreference Coreference does not support natural split of features Algorithm for feature splitting Minimizing amounts of training data: Train a classifier on each feature separately Co-training Weakly-supervised learning Hobbs algorithm Anaphora resolution in dialogs Select the best feature and assign it to the first view, and the second best feature assign to the second view Iterate over the remaining feature, and add them to one of the views Separate training for each reference type (personal pronouns, possessives,...) Reference Resolution 8/30 Reference Resolution 10/30

Example of Dialog Abstract Referents A1:..[he] i s nine months old... A2:..[He] i likes to dig around a little bit. A3:..[His mother] i mother comes in and says, why did you let [him] i [plays in the dirt] j. Webber (1990): each discourse unit produces a pseudo discourse entity proxy for its propositional content Abstract Pronoun interpretation: requires presentation of fact referents A4: I guess [[he] i s enjoying himself] k. B5: [That] k s right. B6: [It] j s healthy... Walker&Whittaker (1990): in problem-solving dialogs, people refer to aspects of the solution that were not explicitly mentioned (Byron, 2002) A1 Send engine to Elmira. A2 That s six hours. Reference Resolution 13/30 Reference Resolution 15/30 Anaphora In Spoken Dialogue Abstract Referents Differences between spoken and written text High frequency of anaphora Presence of Vague anaphora (Eckert&Strube 2000) 33% Presence of non-np-antecedents (Byron&Allen 1998) TRAINS93: 50% (Eckert&Strube 2000) SwitchBoard: 22% (Webber, 1988) (A0) Each Fall, penguins migrate to Fiji. (A1) That s where they wait out the winter. (A2) That s when it s cold even for them. (A3) That s why I m going there next month. (A4) It happens just before the eggs hutch. Presence of repairs, disfluences, abandoned utterances and so on... Reference Resolution 12/30 Reference Resolution 14/30

Activated Entities Semantic Constraints Generation of Multiple Proxies To load the boxcars/loading them takes an hour (infinitive or gerund phrase) I think he that he s an alien (the entire clause) Heavily-typed system Verb Senses (selectional restrictions) Load them into the boxcar (them has to be CARGO) I think that he s an alien (sentential) Predicate NPs That s a good route (that has to be a ROUTE) If he s an alien (Subordinate clause) Predicate Adjectives It s right (it has to be a proposition) Reference Resolution 17/30 Reference Resolution 19/30 Symbolic Approach Types of Speech Acts Pronominal Anaphora Resolution (Byron, 2002) Mentioned Entities referents nouns phrases Activated Entities entire sentences and nominals Discourse Entity attributes: Input: The surface linguistic constituent Type: ENGINE, PERSON,... Composition: hetero- or homogeneous Tell, Request, Wh-Questions, YN-Question, Confirm (1) The highway is closed (Tell) (2) Is the highway closed? (Y/N Question) (3) That s right. (4) Why is the highway closed? (WH-Q) (5) *That s right. Specificity: individual or kind Reference Resolution 16/30 Reference Resolution 18/30

Evaluation Features 10 dialogues, 557 utterances, 180 test pronouns Salience-based resolution: 37% Features induced for spoken dialogue: ante-exp-type [type of antecedent (NP, S, VP)] ana-np-pref [preference for NP arguments] Adding Semantic constraints: 43% Adding Abstract referents: 67% mdist-3mf3p [the number of NP-markables between anaphora and potential antecedent] ante-tfidf [the relative importance of the expression in the Smart Search order: 72% dialogues] Domain Independent Semantics: 51% average-ic [information content: neg. log of the total frequency of the word divided by number of words ] Reference Resolution 21/30 Reference Resolution 23/30 Example Knowledge-Lean Approach Engine 1 goes to Avon to get the oranges. (Strube&Muller 2003) (TELL (MOVE :theme x :dest y :reason (LOAD :theme w))) (the x (refers-to x ENG1)) Switchboard: 3275 sentences, 1771 turns, 16601 markables (the y (refers-to y AVON)) (the w (refers-to w ORANGES)) So it ll get there at 3 p.m. Data annotated with disfluency information Problematic utterances were discarded (ARRIVE :theme x :dest: y :time z) get there requires MOVABLE-OBJECT Approach: ML combines standard features with dialogue specific features Reference Resolution 20/30 Reference Resolution 22/30

Observations Example Coreference for speech processing is hard! New features for dialogue are required U1: Lyn s mother is a gardener. U2: Craige likes her. Prosodic featires seems to be useful Reference Resolution 25/30 Reference Resolution 27/30 Features Hobbs Algorithm F-measure: Fem&Masc Pronoun: 17.4% baseline, 17.25% Third Person Neuter Pronoun: 14.68%, 19.26% Third Person Plural: 28.30%, 28.70% Task: Pronoun resolution Features: Fully Syntactic Accuracy: 82% Reference Resolution 24/30 Reference Resolution 26/30

Algorithm Check Success: see if the contracted description picks up one entity from the context Choose Property: determine which properties of the referent would rule out the largest number of entities Extend Description: add the chosen properties to the description being constructed and remove relevant entities from the discourse. Reference Resolution 29/30 Anaphora Generation Statistical Generation (Reiter&Dale 1995) Application: Lexical choice for generation Framework: Context Set C = a 1, a 2,..., a n Properties: p k1, p k2,..., p km (Radev,1998): classification-based (Nenkova&McKeown,2003): HMM-based Goal: Distinguish Referent from the Rest Reference Resolution 28/30 Reference Resolution 30/30