Reference Resolution. Regina Barzilay. February 23, 2004

Similar documents
Reference Resolution. Announcements. Last Time. 3/3 first part of the projects Example topics

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution

Coreference Resolution Lecture 15: October 30, Reference Resolution

08 Anaphora resolution

A Machine Learning Approach to Resolve Event Anaphora

Outline of today s lecture

807 - TEXT ANALYTICS. Anaphora resolution: the problem

Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases

Anaphora Resolution in Hindi Language

ANAPHORIC REFERENCE IN JUSTIN BIEBER S ALBUM BELIEVE ACOUSTIC

Anaphora Resolution in Biomedical Literature: A

Automatic Evaluation for Anaphora Resolution in SUPAR system 1

Pronominal, temporal and descriptive anaphora

Hybrid Approach to Pronominal Anaphora Resolution in English Newspaper Text

Dialogue structure as a preference in anaphora resolution systems

Anaphora Resolution. Nuno Nobre

Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems

The Reliability of Anaphoric Annotation, Reconsidered: Taking Ambiguity into Account

TEXT MINING TECHNIQUES RORY DUTHIE

Keywords Coreference resolution, anaphora resolution, cataphora, exaphora, annotation.

TURCOLOGICA. Herausgegeben von Lars Johanson. Band 98. Harrassowitz Verlag Wiesbaden

INFORMATION EXTRACTION AND AD HOC ANAPHORA ANALYSIS

ANAPHORA RESOLUTION IN HINDI LANGUAGE USING GAZETTEER METHOD

Houghton Mifflin English 2004 Houghton Mifflin Company Level Four correlated to Tennessee Learning Expectations and Draft Performance Indicators

Discourse Constraints on Anaphora Ling 614 / Phil 615 Sponsored by the Marshall M. Weinberg Fund for Graduate Seminars in Cognitive Science

Factivity and Presuppositions David Schueler University of Minnesota, Twin Cities LSA Annual Meeting 2013

HS01: The Grammar of Anaphora: The Study of Anaphora and Ellipsis An Introduction. Winkler /Konietzko WS06/07

Anaphora Resolution Exercise: An overview

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

1. Read, view, listen to, and evaluate written, visual, and oral communications. (CA 2-3, 5)

Statistical anaphora resolution in biomedical texts

Natural Language Processing

Paninian Grammar Based Hindi Dialogue Anaphora Resolution

Palomar & Martnez-Barco the latter being the abbreviating form of the reference to an entity. This paper focuses exclusively on the resolution of anap

Anaphora Resolution in Biomedical Literature: A Hybrid Approach

Presupposition and Rules for Anaphora

An Introduction to Anaphora

CAS LX 522 Syntax I Fall 2000 November 6, 2000 Paul Hagstrom Week 9: Binding Theory. (8) John likes him.

Performance Analysis of two Anaphora Resolution System for Hindi Language

A Survey on Anaphora Resolution Toolkits

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

Semantics and Pragmatics of NLP DRT: Constructing LFs and Presuppositions

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES. Design of Amharic Anaphora Resolution Model. Temesgen Dawit

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three Grade Five

SEVENTH GRADE RELIGION

Russell s Problems of Philosophy

Long-distance anaphora: comparing Mandarin Chinese with Iron Range English 1

Anaphora Resolution in Hindi: Issues and Directions

What is the Frege/Russell Analysis of Quantification? Scott Soames

1. Introduction. Against GMR: The Incredulous Stare (Lewis 1986: 133 5).

Early Russell on Philosophical Grammar

StoryTown Reading/Language Arts Grade 2

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Five. correlated to. TerraNova, Second Edition Level 15

A Linguistic Interlude

ELA CCSS Grade Three. Third Grade Reading Standards for Literature (RL)

15 DEPENDENT CLAUSES. 1 Note that other alternatives than those shown here may be possible:

Some observations on identity, sameness and comparison

Anaphoric Deflationism: Truth and Reference

Comments on Lasersohn

ANAPHORA RESOLUTION IN MACHINE TRANSLATION

Kai von Fintel (MIT)

AliQAn, Spanish QA System at multilingual

Models of Anaphora Processing and the Binding Constraints

An Empirical Study on the Generation of Anaphora in Chinese

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level 2002 Correlated to: West Virginia English Language Arts IGO s (Grade 8)

Bertrand Russell Proper Names, Adjectives and Verbs 1

Exercises Introduction to morphosyntax

ADAIR COUNTY SCHOOL DISTRICT GRADE 03 REPORT CARD Page 1 of 5

Houghton Mifflin Harcourt Collections 2015 Grade 8. Indiana Academic Standards English/Language Arts Grade 8

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Six. correlated to. TerraNova, Second Edition Level 16

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 8)

Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 7)

Introduction to the Special Issue on Computational Anaphora Resolution

4) When are complex discourse entities constructed in the process of text comprehension?

Visual Analytics Based Authorship Discrimination Using Gaussian Mixture Models and Self Organising Maps: Application on Quran and Hadith

9 th Grade English Placement Test

Correlation to Georgia Quality Core Curriculum

PAGE(S) WHERE TAUGHT (If submission is not text, cite appropriate resource(s))

Category Mistakes in M&E

Article selection and anaphora in the German relative clause Julian Grove and Emily Hanink University of Chicago

The Interpretation of Complement Anaphora: The Case of The Others

Coordination Problems

Circularity in ethotic structures

Introduction to Koiné Greek

The Semantics and Pragmatics of Presupposition

Resolving This-issue Anaphora

That's Your Evidence?: Using Mechanical Turk To Develop A Computational Account Of Debate And Argumentation In Online Forums

A Typology of Clause Combining

finagling frege Mark Schroeder University of Southern California September 25, 2007

Mandy Simons Carnegie Mellon University June 2010

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three. correlated to. IOWA TESTS OF BASIC SKILLS Forms M Level 9

An Analysis of Reference in J.K. Rowling s Novel: Harry Potter and the Half-Blood Prince

A Correlation of. To the. Language Arts Florida Standards (LAFS) Grade 3

RECIPIENT ENCODING IN SOUTHERN SELKUP ANJA HARDER, UNIVERSITY OF HAMBURG

III Knowledge is true belief based on argument. Plato, Theaetetus, 201 c-d Is Justified True Belief Knowledge? Edmund Gettier

On "deep and surface. anaphora. Eunice Pontes

Argument Harvesting Using Chatbots

Recall. Validity: If the premises are true the conclusion must be true. Soundness. Valid; and. Premises are true

Correlates to Ohio State Standards

GRAMMAR IV HIGH INTERMEDIATE

Transcription:

Reference Resolution Regina Barzilay February 23, 2004

Announcements 3/3 first part of the projects Example topics Segmentation Identification of discourse structure Summarization Anaphora resolution Cue phrase selection Reference Resolution 1/30

Reference Resolution Captain Farragut was a good seaman, worthy of the frigate he commanded. His vessel and he were one. He was the soul of it. Coreference resolution: {the frigate, his vessel, it} Anaphora resolution: {his vessel, it} Coreference is a harder task! Reference Resolution 2/30

Last Time Symbolic Multi-Strategy Anaphora Resolution (Lappin&Leass, 1994) Clustering-based Coreference Resolution (Cardie&Wagstaff, 1999) Supervised ML Coreference Resolution + Clustering (Soon et al, 2001), (Ng&Cardie, 2002) Reference Resolution 3/30

Features (Soon et al, 2001) distance in sentences between anaphora and antecedent? antecedent in a pronoun? weak string identity between anaphora and antecedent? anaphora is a definite noun phrase? anaphora is a demonstrative pronoun? number agreement between anaphora and antecedent semantic class agreement anaphora and antecedent gender agreement between anaphora and antecedent anaphora and antecedent are both proper names? an alias feature an appositive feature Reference Resolution 4/30

Observations (Ng&Cardie 2002) 0,76,83,C,D,C,D,D,D,D,D,I,I,C,I,I,D,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,D,1,D,D,C,N,Y,D,D,D,20,20,D,D,-. 0,75,83,C,D,C,D,D,D,C,D,I,I,C,I,I,C,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,C,1,D,D,C,Y,Y,D,D,D,20,20,D,D,+. 0,74,83,C,D,C,D,D,D,D,D,I,I,C,I,I,D,N,N,D,C,D,D,N,N,N,N,N,C,Y, Y,D,D,D,C,0,D,D,D,D,D,D,D,1,D,D,C,N,Y,D,D,D,20,20,D,D,-. Reference Resolution 5/30

Classification Rules + 786 59 IF SOON-WORDS-STR = C + 73 10 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C SENTNUM <= 1 PRO RESOLVE = C ANIMACY = C + 40 8 IF WNCLASS = C CONSTRAINTS = D PARANUM <= 0 PRO-RESOLVE = C + 16 0 IF WNCLASS = C CONSTRAINTS = D SENTNUM <= 1 BOTH-IN-QUOTES = I APPOSITIVE = C + 17 0 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C PARANUM <= 1 BPRONOUN-1 = Y AGREEMENT = C CONSTRAINTS = C BOTH-PRONOUNS = C + 38 24 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C SENTNUM <= 2 BOTH PRONOUNS = D AGREEMENT = C SUBJECT-2 = Y + 36 8 IF WNCLASS = C PROPER-NOUN = D NUMBERS = C BOTH-PROPER-NOUNS = C + 11 0 IF WNCLASS = C CONSTRAINTS = D SENTNUM <= 3 SUBJECT-1 = Y SUBJECT 2 = Y SUBCLASS = D IN-QUOTE-2 = N BOTH-DEFINITES = I Reference Resolution 6/30

Observations Feature selection plays an important role in classification accuracy: MUC-6 62.6% (Soon et al., 2001) Ng&Cardie, 2002) 69.1% Clustering operates over the results of hard clustering, which may negatively influence the final results Machine learning techniques rely on large amounts of annotated data: 30 texts All the methods are developed on the same corpus of newspaper articles Reference Resolution 7/30

Today Minimizing amounts of training data: Co-training Weakly-supervised learning Hobbs algorithm Anaphora resolution in dialogs Reference Resolution 8/30

Co-training (Blum&Mitchell, 1998) 1. Given a small amount of training data, train two classifiers based on orthogonal set of features 2. Add to training set n instances on which both classifiers agree 3. Retrain both classifiers on the extended set 4. Return to step 2 Reference Resolution 9/30

Co-training for Coreference Coreference does not support natural split of features Algorithm for feature splitting Train a classifier on each feature separately Select the best feature and assign it to the first view, and the second best feature assign to the second view Iterate over the remaining feature, and add them to one of the views Separate training for each reference type (personal pronouns, possessives,...) Reference Resolution 10/30

Results Improvements for some types of references Definite noun phrases: from 19% to 28% (2000 training instances) No improvements for possessives, proper names and possessive pronouns Study of learning curves Personal and possessive pronoun can be trained from very small training data (100 instances) Other types of references require large amounts of training data Reference Resolution 11/30

Anaphora In Spoken Dialogue Differences between spoken and written text High frequency of anaphora Presence of Vague anaphora (Eckert&Strube 2000) 33% Presence of non-np-antecedents (Byron&Allen 1998) TRAINS93: 50% (Eckert&Strube 2000) SwitchBoard: 22% Presence of repairs, disfluences, abandoned utterances and so on... Reference Resolution 12/30

Example of Dialog A1:..[he] i s nine months old... A2:..[He] i likes to dig around a little bit. A3:..[His mother] i mother comes in and says, why did you let [him] i [plays in the dirt] j. A4: I guess [[he] i s enjoying himself] k. B5: [That] k s right. B6: [It] j s healthy... Reference Resolution 13/30

Abstract Referents (Webber, 1988) (A0) Each Fall, penguins migrate to Fiji. (A1) That s where they wait out the winter. (A2) That s when it s cold even for them. (A3) That s why I m going there next month. (A4) It happens just before the eggs hutch. Reference Resolution 14/30

Abstract Referents Webber (1990): each discourse unit produces a pseudo discourse entity proxy for its propositional content Abstract Pronoun interpretation: requires presentation of fact referents Walker&Whittaker (1990): in problem-solving dialogs, people refer to aspects of the solution that were not explicitly mentioned (Byron, 2002) A1 Send engine to Elmira. A2 That s six hours. Reference Resolution 15/30

Symbolic Approach Pronominal Anaphora Resolution (Byron, 2002) Mentioned Entities referents nouns phrases Activated Entities entire sentences and nominals Discourse Entity attributes: Input: The surface linguistic constituent Type: ENGINE, PERSON,... Composition: hetero- or homogeneous Specificity: individual or kind Reference Resolution 16/30

Activated Entities Generation of Multiple Proxies To load the boxcars/loading them takes an hour (infinitive or gerund phrase) I think he that he s an alien (the entire clause) I think that he s an alien (sentential) If he s an alien (Subordinate clause) Reference Resolution 17/30

Types of Speech Acts Tell, Request, Wh-Questions, YN-Question, Confirm (1) The highway is closed (Tell) (2) Is the highway closed? (Y/N Question) (3) That s right. (4) Why is the highway closed? (WH-Q) (5) *That s right. Reference Resolution 18/30

Semantic Constraints Heavily-typed system Verb Senses (selectional restrictions) Load them into the boxcar (them has to be CARGO) Predicate NPs That s a good route (that has to be a ROUTE) Predicate Adjectives It s right (it has to be a proposition) Reference Resolution 19/30

Example Engine 1 goes to Avon to get the oranges. (TELL (MOVE :theme x :dest y :reason (LOAD :theme w))) (the x (refers-to x ENG1)) (the y (refers-to y AVON)) (the w (refers-to w ORANGES)) So it ll get there at 3 p.m. (ARRIVE :theme x :dest: y :time z) get there requires MOVABLE-OBJECT Reference Resolution 20/30

Evaluation 10 dialogues, 557 utterances, 180 test pronouns Salience-based resolution: 37% Adding Semantic constraints: 43% Adding Abstract referents: 67% Smart Search order: 72% Domain Independent Semantics: 51% Reference Resolution 21/30

Knowledge-Lean Approach (Strube&Muller 2003) Switchboard: 3275 sentences, 1771 turns, 16601 markables Data annotated with disfluency information Problematic utterances were discarded Approach: ML combines standard features with dialogue specific features Reference Resolution 22/30

Features Features induced for spoken dialogue: ante-exp-type [type of antecedent (NP, S,VP)] ana-np-pref [preference for NP arguments] mdist-3mf3p [the number of NP-markables between anaphora and potential antecedent] ante-tfidf [the relative importance of the expression in the dialogues] average-ic [information content: neg. log of the total frequency of the word divided by number of words ] Reference Resolution 23/30

Features F-measure: Fem&Masc Pronoun: 17.4% baseline, 17.25% Third Person Neuter Pronoun: 14.68%, 19.26% Third Person Plural: 28.30%, 28.70% Reference Resolution 24/30

Observations Coreference for speech processing is hard! New features for dialogue are required Prosodic featires seems to be useful Reference Resolution 25/30

Hobbs Algorithm Task: Pronoun resolution Features: Fully Syntactic Accuracy: 82% Reference Resolution 26/30

Example U1: Lyn s mother is a gardener. U2: Craige likes her. Reference Resolution 27/30

Anaphora Generation (Reiter&Dale 1995) Application: Lexical choice for generation Framework: Context Set C = a 1,a 2,...,a n Properties: p k1,p k2,...,p km Goal: Distinguish Referent from the Rest Reference Resolution 28/30

Algorithm Check Success: see if the contracted description picks up one entity from the context Choose Property: determine which properties of the referent would rule out the largest number of entities Extend Description: add the chosen properties to the description being constructed and remove relevant entities from the discourse. Reference Resolution 29/30

Statistical Generation (Radev,1998): classification-based (Nenkova&McKeown,2003): HMM-based Reference Resolution 30/30