AliQAn, Spanish QA System at multilingual

Similar documents
The UPV at 2007

Question Answering. CS486 / 686 University of Waterloo Lecture 23: April 1 st, CS486/686 Slides (c) 2014 P. Poupart 1

Dialogue structure as a preference in anaphora resolution systems

Automatic Evaluation for Anaphora Resolution in SUPAR system 1

Anaphora Resolution in Hindi Language

ANAPHORIC REFERENCE IN JUSTIN BIEBER S ALBUM BELIEVE ACOUSTIC

Palomar & Martnez-Barco the latter being the abbreviating form of the reference to an entity. This paper focuses exclusively on the resolution of anap

Identifying Anaphoric and Non- Anaphoric Noun Phrases to Improve Coreference Resolution

PAGE(S) WHERE TAUGHT (If submission is not text, cite appropriate resource(s))

Anaphora Resolution in Biomedical Literature: A

TEXT MINING TECHNIQUES RORY DUTHIE

1. Read, view, listen to, and evaluate written, visual, and oral communications. (CA 2-3, 5)

Reference Resolution. Regina Barzilay. February 23, 2004

Reference Resolution. Announcements. Last Time. 3/3 first part of the projects Example topics

SEVENTH GRADE RELIGION

Performance Analysis of two Anaphora Resolution System for Hindi Language

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three Grade Five

Anaphora Resolution. Nuno Nobre

Outline of today s lecture

08 Anaphora resolution

Hybrid Approach to Pronominal Anaphora Resolution in English Newspaper Text

Artificial Intelligence. Clause Form and The Resolution Rule. Prof. Deepak Khemani. Department of Computer Science and Engineering

ANAPHORA RESOLUTION IN HINDI LANGUAGE USING GAZETTEER METHOD

ELA CCSS Grade Three. Third Grade Reading Standards for Literature (RL)

Correlates to Ohio State Standards

Natural Language Processing (NLP) 10/30/02 CS470/670 NLP (10/30/02) 1

Discourse Constraints on Anaphora Ling 614 / Phil 615 Sponsored by the Marshall M. Weinberg Fund for Graduate Seminars in Cognitive Science

ANAPHORA RESOLUTION IN MACHINE TRANSLATION

Resolving Direct and Indirect Anaphora for Japanese Definite Noun Phrases

Coreference Resolution Lecture 15: October 30, Reference Resolution

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Five. correlated to. TerraNova, Second Edition Level 15

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 8)

Prentice Hall Literature: Timeless Voices, Timeless Themes, Bronze Level '2002 Correlated to: Oregon Language Arts Content Standards (Grade 7)

Towards a more consistent and comprehensive evaluation of anaphora resolution algorithms and systems

Georgia Quality Core Curriculum 9 12 English/Language Arts Course: American Literature/Composition

Anaphora Resolution Exercise: An overview

INFORMATION EXTRACTION AND AD HOC ANAPHORA ANALYSIS

TURCOLOGICA. Herausgegeben von Lars Johanson. Band 98. Harrassowitz Verlag Wiesbaden

4.7 Constructing Categorical Propositions

Champion Teacher Index

Semantics and Pragmatics of NLP DRT: Constructing LFs and Presuppositions

A Correlation of. To the. Language Arts Florida Standards (LAFS) Grade 3

Prentice Hall The American Nation: Beginnings Through 1877 '2002 Correlated to: Chandler USD Social Studies Textbook Evaluation Instrument (Grade 8)

***** [KST : Knowledge Sharing Technology]

Houghton Mifflin Harcourt Collections 2015 Grade 8. Indiana Academic Standards English/Language Arts Grade 8

Reductio ad Absurdum, Modulation, and Logical Forms. Miguel López-Astorga 1

StoryTown Reading/Language Arts Grade 3

Early Russell on Philosophical Grammar

Houghton Mifflin English 2004 Houghton Mifflin Company Grade Six. correlated to. TerraNova, Second Edition Level 16

Introduction. I. Proof of the Minor Premise ( All reality is completely intelligible )

A New Parameter for Maintaining Consistency in an Agent's Knowledge Base Using Truth Maintenance System

DOWNLOAD OR READ : THE LOGIC BOOK PDF EBOOK EPUB MOBI

A Correlation of. To the. Language Arts Florida Standards (LAFS) Grade 5

Strand 1: Reading Process

Coordination Problems

Arkansas English Language Arts Standards

Information Extraction. CS6200 Information Retrieval (and a sort of advertisement for NLP in the spring)

CURRICULUM MAP. Spanish Three

ADAIR COUNTY SCHOOL DISTRICT GRADE 03 REPORT CARD Page 1 of 5

Pronominal, temporal and descriptive anaphora

StoryTown Reading/Language Arts Grade 2

Georgia Quality Core Curriculum 9 12 English/Language Arts Course: Ninth Grade Literature and Composition

Houghton Mifflin ENGLISH Grade 5 correlated to West Virginia Instructional Goals and Objectives

SB=Student Book TE=Teacher s Edition WP=Workbook Plus RW=Reteaching Workbook 47

9 Knowledge-Based Systems

Universitas Saraviensis Project Seminar Text Mining for Historical Documents Antonia Scheidel February An Introduction To Ontologies

The Structure Of Argument 8th Edition

ELA CCSS Grade Five. Fifth Grade Reading Standards for Literature (RL)

In this section you will learn three basic aspects of logic. When you are done, you will understand the following:

Keywords Coreference resolution, anaphora resolution, cataphora, exaphora, annotation.

2.1 Review. 2.2 Inference and justifications

Prentice Hall Literature: Timeless Voices, Timeless Themes, Silver Level 2002 Correlated to: West Virginia English Language Arts IGO s (Grade 8)

South Carolina English Language Arts / Houghton Mifflin English Grade Three

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Four. correlated to. IOWA TESTS OF BASIC SKILLS Forms M Level 10

Instructor s Manual 1

Some observations on identity, sameness and comparison

Proceedings of the Meeting & workshop on Development of a National IT Strategy Focusing on Indigenous Content Development

1. Introduction Formal deductive logic Overview

Gordon-Conwell Theological Seminary Boston Campus Center for Urban Ministerial Education

Anaphoric Deflationism: Truth and Reference

Houghton Mifflin ENGLISH Grade 5 correlated to Indiana Language Arts Standard

ASSEMBLIES OF GOD THEOLOGICAL SEMINARY BGR 611 INDUCTIVE STUDIES IN THE GREEK NEW TESTAMENT. Professor: James D. Hernando Fall, 2008.

Modal verbs. Certain, probable or possible

UNIVERSIDAD AMÉRICA LATINA Estudios Universitarios Abiertos de México Bachillerato General en la Modalidad No Escolarizada

Reflexive Pronouns GRAMMAR

Trinity First Lutheran School 3 rd Grade Curriculum Plan Ms. Anna Schield

Correlation to Georgia Quality Core Curriculum

Artificial Intelligence: Valid Arguments and Proof Systems. Prof. Deepak Khemani. Department of Computer Science and Engineering

Artificial Intelligence Prof. P. Dasgupta Department of Computer Science & Engineering Indian Institute of Technology, Kharagpur

ADDIS ABABA UNIVERSITY SCHOOL OF GRADUATE STUDIES. Design of Amharic Anaphora Resolution Model. Temesgen Dawit

Anaphora Resolution in Biomedical Literature: A Hybrid Approach

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 3 Correlated with Common Core State Standards, Grade 3

PHILOSOPHY AND RELIGIOUS STUDIES

Artificial Intelligence Prof. Deepak Khemani Department of Computer Science and Engineering Indian Institute of Technology, Madras

A Survey on Anaphora Resolution Toolkits

Grade 7. correlated to the. Kentucky Middle School Core Content for Assessment, Reading and Writing Seventh Grade

A Machine Learning Approach to Resolve Event Anaphora

Front Range Bible Institute

Macmillan/McGraw-Hill SCIENCE: A CLOSER LOOK 2011, Grade 1 Correlated with Common Core State Standards, Grade 1

Houghton Mifflin English 2001 Houghton Mifflin Company Grade Three. correlated to. IOWA TESTS OF BASIC SKILLS Forms M Level 9

Transcription:

AliQAn, Spanish QA System at multilingual QA@CLEF-2008 R. Muñoz-Terol, M.Puchol-Blasco, M. Pardiño, J.M. Gómez, S.Roger, K. Vila, A. Ferrández, J. Peral, P. Martínez-Barco Grupo de Investigación en Procesamiento del Lenguaje Natural y Sistemas de Información Natural Language Processing and Information Systems Group Department of Software and Computing Systems University of Alicante, Spain rafamt,marcel,maria,jmgomez,sroger,kvila,antonio,jperal,patricio@dlsi.ua.es Abstract In QA@CLEF 2008, we participate in monolingual (Spanish) and multilingual (English - Spanish) tasks. Specifically, in this paper, we will tackle with the English - Spanish QA task. In this edition we will deal with two main problems: an heterogeneous document collection (news articles and Wikipedia) and a large number of topic-related questions, which make somewhat difficult our participation. We want to highlight in the translation module in our system two possible mechanisms: one based on logic forms, and the other, on machine translation techniques. In addition, it has also been used a system of anaphora resolution that it is described below and a QA System, AliQAn (also used this year in the monolingual task). 1 Introduction For our participation in this task we have used a monolingual QA system: AliQAn; a translation system based on logic forms and an anaphora resolution system to address the interrelated questions and answers. The system AliQAn was widely described by Roger et al. [8] after their participation in the competition of the Spanish CLEF 2005. It is important to emphasize that this system has been adapted to work in this year s task monolingual too, since there are a great number of changes made in Roger et al. [9]. The main problems that contributed to make difficult our participation were the processing of Wikipedia (this corpus is different from those used to work with AliQAn) and the large number of questions related. In this research work, our QA system applies two different methods to perform the translation of the questions from one language to another. The first one consists in applying natural language processing techniques based on the formal representation of questions, by using logic forms [10] and, the second one, simply consists in using machine translation techniques. So, the main goal consists in comparing both methods in the framework of the efficiency of the question answering system to solve the question answering problem. The anaphora resolution problem has been introduced in the last two years in QA task, increasing the number of questions with anaphoric relationships since last year. As a consequence, a This paper has been partially supported by the Spanish government, project TIN-2006-15265-C06-01 and project GV06/028, and by the framework of the project QALL-ME, which is a 6th Framework Research Programme of the European Union (EU), contract number: FP6-IST-033860, by the Generalitat Valenciana throught the research grants BFPI/2008/093 and BFPI06/182 and by the University of Comahue under the project 04/E062. 1

spanish rule-based anaphora resolution system for QA has been implemented to solve this problem. In our system, three kinds of anaphora can be solved: pronoun anaphora, definite descriptions, and zero anaphora. The results obtained by our anaphora resolution system are around 40% precision, using the questions translated from English into Spanish. 2 Background Previous research works such as Bowden et al. [1] directly apply machine translation methods to translate the questions. On the contrary, and in order to perform the translation of the questions from one language to other one, the research work developed by Ferrández and Ferrández [4] and the PhD thesis defended by Ferrández [3] exhibits that the use of machine translation techniques to perform the translation of questions from one language to another decreases the efficiency of the multilingual question answering system versus other approaches based on natural language processing techniques. 3 Description of the System 3.1 English-Spanish Translation based on Logic Forms and lexical resources The translation of questions from English into Spanish is performed by inferring the logic form of the questions and using lexical resources to translate the predicates of the logic form. The technique applied to infer the logic form of the questions is the one developed by Terol et al. [10]. Consequently, this translation technique is performed as follows: The predicates whose types are corresponded to noun or verb are translated using the EuroWordNet [11] lexical resource. The connection between the synsets of the English and Spanish WordNets is performed in a similar way as treated by Ferrandez et al. [5] using the Inter-Lingual-Index (ILI). Each one of the synsets that are mapped from English into Spanish towards ILI contain a set of concepts. The processing consists in counting the occurrences of the different concepts that appear in the mapping process and, finally, the concept with the highest number of occurrences is chosen as the translation of the predicate. The predicates whose type is instantiated to adjective, adverb or preposition, and the predicates treated in the previous step that are not translated by EuroWordNet are translated applying the English-Spanish Babylon dictionary 1. As it occurred in the previous step, the dictionary can return a set of different translations grouped in synsets (different from WordNet synsets). Thus, the processing consists in counting the occurrences of the different translations that the dictionary returns and, finally, the translation with the highest number of occurrences is chosen as the translation of the predicate. Finally, the rest of predicates and the ones than are not translated in the two previous steps are definitely translated by using the Google Translation Toolkit 2. Once the predicates of the logic form are translated according to the recently detailed rules, the last translation task consists in translating the question as a result of applying some English- Spanish contrastive grammar rules to the sequence of translations of predicates of the logic form. The applied English-Spanish contrastive grammar rules are based on the ones derived from the 1 http://www.babylon.com 2 http://www.google.com/translate t 2

previous study developed by Fernandez et al. [2] and Martinez-Vazquez [6]. The applied English- Spanish contrastive grammar rules are detailed in Table 1, and an example of translation, applying each one of these rules, is explained in Table 2. Note that the rule number four is recurrent because a complex nominal (NNC) can recurringly derive others. Rule Id. English Structure Spanish Translation 1 JJ + NN Translation(NN) + Translation(JJ) 2 JJ 1 + JJ 2 + NN Translation(JJ 1 ) + Translation(NN) + Translation(JJ 2 ) 3 NN 1 + NN 2 Translation(NN 1 ) + Translation(NN 2 ) 4 NN + NNC Translation(NNC) + de + Translation(NN) 5 JJ + NN 1 + NN 2 Translation(NN 2 ) + Translation(JJ) + de + Translation(NN 1 ) Table 1: Applied English-Spanish contrastive grammar rules Rule Id. English Expression Spanish Translation 1 red car coche rojo 2 beautiful green eyes bonitos ojos verdes 3 electronic database base de datos electrnica 4 train station ticket office oficina de billete de estacin de tren 5 multiple regression model modelo mltiple de regresin Table 2: Example of the applied English-Spanish contrastive grammar rules Finally, the translation performed by the rest of predicates of the logic form, whose logic structure does not match these English-Spanish contrastive grammar rules, consists in the concatenation of the sequence of translations of these predicates. 3.2 Anaphora Resolution As last year, anaphora resolution problem has been introduced this year. Questions are grouped in topics. In those topics, the first question is anaphora-free, but the other questions may require information from data contained in the first question, or in the first answer (it can only be the first question or the first answer for a specific topic, exactly as described in the guidelines for this year). From our analysis of Spanish examples used last year, we have discovered three types of possible anaphora: pronoun anaphora, definite descriptions, and zero anaphora. These types of anaphora have been analyzed and are the basis for our system. Our approach for anaphora resolution is rule-based. Basically, it is based on the papers referenced in Mitkov [7]. The first step is to get the Part-of-Speech tagging for possible antecedents (the first question or the first answer for an specific topic) and a question with a possible anaphora. Later, an anaphora detection module is executed. This module can be considered as three different subsystems in which each subsystem corrsponds to: pronoun anaphora detection, definite description detection and zero anaphora detection. In the first one, pronouns are taken as anaphora. In definite descriptions, noun phrases are taken as a possible anaphora. And, in zero anaphora, the entire phrase can be taken as a possible anaphora. For all anaphora resolution modules, for each topic group, noun phrases are extracted from the first question-answer pair as possible antecedent. Then, when we have a pronoun anaphora, the possible anaphora is compared in gender and number with each possible antecedent. In definite descriptions and zero anaphora, for each noun phrase, a google search is launched joining the possible antecedent and the main words of the anaphora (or main words contained in the noun phrases, in case of zero anaphora). Later, for each noun phrase in the possible anaphora, 3

the relations between it and the main words contained in the antecedents are extracted using MultiWordNet 3. In all previous cases, a specific weight is assigned to each possible anaphora resolution. Later, those weights are ordered, and the best case is selected as anaphora resolution for the related question. It is important to mention that if the noun phrase is in the answer, the greater weight is assigned to it, due to the analysis done in the last year corpus. A few manual experiments are performed using the last year Spanish corpus. The results obtained are around 56% precision. The results obtained using the corpus for this year, once the questions are translated from English into Spanish, are worse, obtaining around 40% precision. 4 Results and Conclusions The scores obtained in the application of the machine translation techniques are a bit better than the ones obtained in the application of the techniques based on logic forms. This can be due to the fact that the use of logic forms is a good method to perform the language-independent knowledge representation, but this method must be improved to perform the translation of sentences from one language to another. As a further work, the next research goal will be to improve the logic form processing methods in the translation process. We have had some problems with the anaphora resolution system, because it is a rule-based system and it needs well-formed sentences to work. Due to the fact that translations offered by the system are not well-formed, in most cases, the precision obtained by the anaphora resolution system has decreased considerably, arriving at around 40% precision. Finally, we present the results obtained with our system in this task in Tables 3 and 4. Specifically, Tables 3 shows the number of right answers(r), wrong answers (W), inexact answers (X), unsupported answers (U) obtained by our system. Run ID #Questions #R #W #X #U Run 1 (machine translation) 200 25 173 0 2 Run 2 (logic forms) 200 18 176 3 3 Table 3: Results obtained by our system at English-Spanish QA task 2008 Table 4 shows the comparative of the values obtained for MRR, CSW and Accuracy with both runs. Run ID # Questions Accuracy(%) CSW MRR Run 1 (machine translation) 200 12.5 0.01114 0.17797 Run 2 (logic forms) 200 9.0 0.00626 0.11499 Table 4: Results obtained by our system at English-Spanish QA task 2008 References [1] Mitchell Bowden, Marian Olteanu, Pasin Suriyentrakorn, Thomas d Silva, and Dan Moldovan. Multilingual question answering through intermediate translation: Lcc s poweranswer at qa@clef 2007. In Working Notes for the CLEF 2007 Workshop, 2007. [2] F. Fernández and B. Montero-Fleta. La premodificación nominal en el ámbito de la informtica. Estudio contrastivo inglés-español. Universidad de Valencia, 2003. 3 http://multiwordnet.itc.it/online/multiwordnet.php 4

[3] S. Ferrández. Arquitectura multilingüe de sistemas de Búsqueda de Respuestas basada en ILI y Wikipedia. PhD thesis, University of Alicante, 2008. [4] S. Ferrández and A. Ferrández. The negative effect of machine translation on crosslingual question answering. In Computational Linguistics and Intelligent Text Processing, pages 494 505, 2007. [5] S. Ferrández, A. Ferrández, S. Roger, P. López-Moreno, and J. Peral. Brili, an english-spanish question answering system. In Proceedings of the International Multiconference on Computer Science and Information Technology, pages 23 29, 2006. [6] M. Martínez-Vázquez. Gramática contrastiva inglés-español. Servicio de publicaciones de la Universidad de Huelva, 1996. [7] Ruslan Mitkov. Anaphora Resolution. Longman, London, 2002. [8] S. Roger, S. Ferrández, A. Ferrández, J. Peral, F. Llopis, A. Aguilar, and D. Tomás. Aliqan, spanish qa system at clef-2005. In CLEF, pages 457 466, 2005. [9] S. Roger, K. Vila, A. Ferrández, M. Pardiño, J.M. Gómez, M. Puchol-Blasco, and J. Peral. Aliqan, spanish qa system at clef-2008. In CLEF, 2008. [10] R.M. Terol, P. Martínez-Barco, and M. Palomar. A knowledge based method for the medical question answering problem. In Computers in Biology and Medicine, volume 37, pages 1511 1521, 2007. [11] P. Vossen. EuroWordNet General Document. Part A. Final Document. EuroWordNet (LE2-4003, LE4-8328), 2002. 5