For a unified treatment of particle verbs

Similar documents
Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Double Double, Morphology and Trouble: Looking into Reduplication in Indonesian

LFG Semantics via Constraints

Adapting Stochastic Output for Rule-Based Semantics

A relational approach to translation

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

AN LFG ANALYSIS OF VERBAL MODIFIERS IN HUNGARIAN. Tibor Laczkó University of Debrecen. Proceedings of the LFG14 Conference

Improving coverage and parsing quality of a large-scale LFG for German

Switched Control and other 'uncontrolled' cases of obligatory control

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

The Development of Linking Theory in lfg

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

Interfacing Phonology with LFG

cmp-lg/ Jul 1995

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Construction Grammar. University of Jena.

Developing a TT-MCTAG for German with an RCG-based Parser

An Interactive Intelligent Language Tutor Over The Internet

The Pennsylvania State University. The Graduate School. College of the Liberal Arts THE TEACHABILITY HYPOTHESIS AND CONCEPT-BASED INSTRUCTION

THE VERB ARGUMENT BROWSER

Negation through reduplication and tone: implications for the LFG/PFM interface 1

Type-driven semantic interpretation and feature dependencies in R-LFG

Feature-Based Grammar

Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

PROJECTIONS AND GLUE FOR CLAUSE-UNION COMPLEX PREDICATES. Avery D Andrews The Australian National University. Proceedings of the LFG07 Conference

Heads and history NIGEL VINCENT & KERSTI BÖRJARS The University of Manchester

Derivational and Inflectional Morphemes in Pak-Pak Language

LING 329 : MORPHOLOGY

Constructions with Lexical Integrity *

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

! XLE: A First Walkthrough! Robustness techniques! Generation! Disambiguation! Applications: ! Provide detailed syntactic/semantic analyses

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Age Effects on Syntactic Control in. Second Language Learning

Parsing of part-of-speech tagged Assamese Texts

Chapter 3: Semi-lexical categories. nor truly functional. As Corver and van Riemsdijk rightly point out, There is more

Specifying Logic Programs in Controlled Natural Language

Some Principles of Automated Natural Language Information Extraction

Building an HPSG-based Indonesian Resource Grammar (INDRA)

Control and Boundedness

The Interface between Phrasal and Functional Constraints

Indeterminacy by Underspecification Mary Dalrymple (Oxford), Tracy Holloway King (PARC) and Louisa Sadler (Essex) (9) was: ( case) = nom ( case) = acc

"f TOPIC =T COMP COMP... OBJ

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

Describing Motion Events in Adult L2 Spanish Narratives

Accurate Unlexicalized Parsing for Modern Hebrew

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Pre-Processing MRSes

Which verb classes and why? Research questions: Semantic Basis Hypothesis (SBH) What verb classes? Why the truth of the SBH matters

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Aspectual Classes of Verb Phrases

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

A First-Pass Approach for Evaluating Machine Translation Systems

Coast Academies Writing Framework Step 4. 1 of 7

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Words come in categories

cambridge occasional papers in linguistics Volume 8, Article 3: 41 55, 2015 ISSN

Hindi Aspectual Verb Complexes

CS 598 Natural Language Processing

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

LINGUISTICS. Learning Outcomes (Graduate) Learning Outcomes (Undergraduate) Graduate Programs in Linguistics. Bachelor of Arts in Linguistics

The optimal placement of up and ab A comparison 1

Degree Qualification Profiles Intellectual Skills

The Discourse Anaphoric Properties of Connectives

Natural Language Processing. George Konidaris

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Efficient Normal-Form Parsing for Combinatory Categorial Grammar

An Introduction to the Minimalist Program

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Pseudo-Passives as Adjectival Passives

Character Stream Parsing of Mixed-lingual Text

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Iraide Ibarretxe Antuñano Universidad de Zaragoza

Compositional Semantics

CHILDREN S POSSESSIVE STRUCTURES: A CASE STUDY 1. Andrew Radford and Joseph Galasso, University of Essex

A Grammar for Battle Management Language

On the Notion Determiner

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

English Language and Applied Linguistics. Module Descriptions 2017/18

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Dissertation Summaries. The Acquisition of Aspect and Motion Verbs in the Native Language (Aristotle University of Thessaloniki, 2014)

WHY SOLVE PROBLEMS? INTERVIEWING COLLEGE FACULTY ABOUT THE LEARNING AND TEACHING OF PROBLEM SOLVING

More Morphology. Problem Set #1 is up: it s due next Thursday (1/19) fieldwork component: Figure out how negation is expressed in your language.

Constraining X-Bar: Theta Theory

Procedia - Social and Behavioral Sciences 141 ( 2014 ) WCLTA Using Corpus Linguistics in the Development of Writing

Phonological and Phonetic Representations: The Case of Neutralization

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

Ch VI- SENTENCE PATTERNS.

CEFR Overall Illustrative English Proficiency Scales

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Lemmatization of Multi-word Lexical Units: In which Entry?

Spanish progressive aspect in stochastic OT

What Can Near Synonyms Tell Us? 1

Chapter 4: Valence & Agreement CSLI Publications

Transcription:

For a unified treatment of particle verbs Lionel Clément, Sekou Diao University of Bordeaux - France HeadLex 16, July 2016 Polish Academy of Sciences, Warsaw Introduction In English, as well as in other languages, there exists a class of verbs composed of a collocation between a verb, and a particle which appears in a distinct syntactic position: the particle verbs. It is not clear if particles are morphological or syntactic elements (Martin Forst, Tracy Holloway King and Tibor Laczkó (2010)). For the purpose of this paper, we will adopt a syntactic treatment of particles in this paper. Particle verb constructions can be compositional or idiomatic. In compositional constructions such as 1b, the meaning of the combination of the two morphosyntactic elements is partly predictable from the meaning of each separate element, whereas in idiomatic constructions 1a, the meaning of the combined elements is idiosyncratic, requiring a specific lexical entry for the idiomatic particle verbs. (1) a. The student gave it up. b. The student moved the box up. Following the analysis of verbal particles introduced in the English ParGram 1 grammar (Martin Forst, Tracy Holloway King and Tibor Laczkó (2010)), in this paper we try to develop an LFG analysis of particle verb constructions which seems to us to be closer to a general linguistic description of this phenomena and argue in favor of a lexically oriented approach to its formalization. We use the XLFG (Clément (2016)) parser/framework which provides us with tools to efficiently combine lexical entries as we will show in the next section. It is well known that compositional particle verb constructions may be productive (Villavicencio (2003)), a fact which can be difficult to handle in an electronic lexicon with a wide coverage. Our approach seems to gracefully handle this difficulty for computational linguistics: each lexical entry for non-compositional idiomatic particle verb contains only idiomatic information such as predicate argument structure and sub-categorization frame. It allows us to capture the fact that the argument structure of an idiomatic particle verb can differ from the argument structure of the same base verb without a particle. The major focus of our approach concerns the treatment of compositional particle verbs. To analyze these constructions, we use a ranking mechanism to select the non-compositional analysis by default when such an analysis is present in the lexicon. If such an entry is not found, the parser will generate a compositional analysis of the particle verb construction. Martin Forst, Tracy Holloway King and Tibor Laczkó note that systematically analyzing particle verbs as idiomatic constructions is a problem for the coverage of computational grammars, as every possible combination of a verb and a particle should explicitly be listed in the lexicon. However, some verb + particle combinations are highly productive and the particle may contribute the same meaning or the same discursive context in each case, it should therefore be more parsimonious if compositional constructions were generated on the fly by the parser. The approach we have taken to handle particle verb constructions combines these two strategies: we list all idiomatic particle verbs in the lexicon and generate compositional particle verb construction by combining the syntactic information contributed by both the verb and its particle. 1 Butt et al. (2002)

1 Idiomatic constructions Idiomatic constructions of particle verbs are those constructions where the meaning and the argument structure of the particle verb can not be derived from composing the meaning and argument structure of the verb and its particle. Idiomatic particle verbs must then be listed in the lexicon. (2) a. John gave Mary the book. b. John gave the book to Mary. c. John gave up playing the piano. d. John gave up his house. e. John gave up on her. As the argument structure of gave and gave up are different (as illustrated in 2) while the other morphosyntactic information such as tense, aspect, agreement etc.. are shared between the two verbs, only the sub-categorization frame of the particle verb gave up is listed in the lexicon. The remainder of the feature-structure of the particle verb is provided by the information in the lexical entry of gave. See section 3.1 for a technical account of this approach. 2 Productive constructions Productive constructions of particle verbs are those constructions where the meaning and argument structure of the particle verb is predictable from composing the meaning and argument structure of the verb and the particle. These constructions are highly productive in English, especially with adverbial particles such as up, down, by and new uses of verb + particle constructions in a productive setting are regularly appearing in corpora. It is therefore uneconomical to list all the potential uses of productive particle verb constructions in the lexicon. (3) a. John shot the ball. b. The pilot shot the plane down. The approach we have taken to handle the productive case is described in section 3.2. 3 Implementation with XLFG It 2 is impossible to unify two structures with distinct PRED features. This is the standard way of ensuring that each syntactic function is instantiated no more than once. This said, it is well known that complex predicate constructions is a phenomena where two distinct constituents contribute to the specification of a PRED value. To model such cases, XLFG supports the concatenation operator - that derives a PRED value by the combination of a lexeme and a prefix or a suffix lexeme. As a PRED is given by one lexical entry and corresponds to a specific predication for a verb, a combination between a prefix (resp. a suffix) for the particle and a PRED for the main verb corresponds to a particle verb. In order to describe only the lexeme of a PRED attribute but not the sub-categorization, XLFG allow us to use the attribute LEXEME instead of PRED. This attribute can combine with a PRED to form a complex predicate without altering the PRED sub-categorization frame. By contrast, the SUBCAT attribute is equivalent to the PRED attribute minus the lexeme. It allows us to describe only the sub-categorization of a complex predicate. In summary, PRED, LEXEME, and SUBCAT may be combined together to give a complex predicate. 3.1 Idiomatic particle verbs In the particular case where the meaning of a particle verb is not predictable from its components, namely the main verb and the particle, the sub-categorization may also not be always predictable. 2 This analysis is based on the 9.8.0 version of XLFG (Clément (2016)).

(4) a. He gave /a concert/a toy to a child/me his phone number/*on painting with oil/*on me/*fishing. b. They gave up /their personal possessions/*their personal possessions to a child/on painting with oil/on me/fishing. In such a case, XLFG provides for the possibility to override the sub-categorization given by the PRED attribute with the value of the SUBCAT attribute. Then, the SUBCAT attribute is used to describe the sub-categorization of an idiomatic entry. In order to introduce a new lexical entry for such a combination, XLFG makes available the # symbol followed by the new lexeme as follow: # GIVE UP [SUBCAT: < SUBJ, [XCOMP OBLon > Here, the # symbol allows us to create a lexical entry which has no associated morphological form in the lexicon, but a lexeme which is the result of the combination between the verb s lexeme and the particle lexeme part. Given the lexical entries for the main verb to give and the particle up encoded as follow: give V [PRED: GIVE<SUBJ, OBJ, [OBLto OBL>, tense: present up VERB PART [LEXEME: - UP The lexeme GIVE UP is generated on the fly by the parser thanks to the concatenation operator - presents on the particle s lexeme definition. The feature-structure resulting from unification is the following: [PRED: GIVE UP < SUBJ, [XCOMP OBLon >, tense: present the lexeme is GIVE UP, the concatenation of GIVE and UP, The sub-categorization is given by the # lexical entry, The others features are given by the unification of the three lexical entries (verb, particle, and verbal particle) Here 3, in summary, are the various combinations for unification between PRED, LEXEME and SUBCAT attributes in XLFG: PRED: X<Y>Z LEXEME: X SUBCAT: <Y>Z PRED: A<B>C PRED: X A<B U Y>C U Z PRED: X A<B>C PRED: A<Y>Z LEXEME: A PRED: X A<Y>Z LEXEME: X A PRED: A<Y>Z SUBCAT: <B>C PRED: X<B>C PRED: X<B>C none The lexical entries are ranked 4 in XLFG, and the idiomatic entries marked with a # symbol are preferred. This simple method allows us to override the compositional construction when an idiomatic entry exists. 3.2 Adverbial particle Another lexical entry is required to encode the adverbial particle. In such a case, the PRED value is the predicate of the verb itself without any modification. fly V [PRED: FLY<SUBJ>, tense: present; up VERB PART [locative: true; The result of unification for fly up is the following [PRED: FLY< SUBJ >, tense: present, locative: true 3 X A is the concatenation between X and A, depending on if X (vs. A) is a prefix or a suffix. 4 Rank is a value which permits to give precedence to a certain lexical entry over an other, with no statistical basis or formal links to Optimality Theory (we are planning to investigate the use of OT to resolve this issue). For now this value is assigned by the grammar engineer according to the specific language for which he is designing the grammar and relies on his linguistic intuitions.

3.3 An example of XLFG analysis Given the following sentences to parse: (5) a. He gave a toy to a child. b. He gave up on his car. c. The pilote flew up to 40,000 feet. A simplified 5 sample of XLFG lexicon entries are the following 6 : flew V [PRED: FLY<SUBJ, [OBLto>, tense: past; gave V [PRED: GIVE<SUBJ, OBJ, [OBLto OBL>, tense: past; up VERB PART [LEXEME: - UP // A particle which must be combined with another PRED VERB PART [locative: true; // A second entry for up : a locative particle # GIVE UP [SUBCAT: < SUBJ, [XCOMP OBLon > ; // A lexical entry for the idiomatic particle verb A simplified example of the XLFG analyses of these sentences are the following: [PRED GIVE<SUBJ, OBJ, [OBLto OBL> SUBJ [PRED PRO OBJ [PRED TOY OBLto [PRED CHILD [PRED GIVE UP<SUBJ, [XCOMP OBLon> SUBJ [PRED PRO OBLon [PRED CAR [PRED FLY<SUBJ, [OBLto> SUBJ [PRED PILOTE OBLto [PRED 40,000 FEET 4 Conclusion We have proposed an analysis of particle verbs which allows to construct with the same optimized lexicon non-compositional (idiomatic) and productive compositional particle verb constructions. In the latter case, we do not need any special mechanism to compose a verb and a semantically pertinent particle. In the case of idiomatic particle verb constructions, a mechanism specific to XLFG was used. This allowed us to propose a preferential choice of the idiomatic expression by adding a special entry in the lexicon. This way of prioritizing an idiomatic construction, if it exists, and fall back on a compositional construction by default, seems to be coherent with both the FLG formalism and an approach where the lexicon takes a prominent place in the analysis of such phenomena. The XLFG software allowed us to provide this analysis and to efficiently deal with the technical challenges involved in managing productive constructions of particle verbs in the development of an English LFG grammar. 5 We do not write the complete feature-structures to make the content more readable. 6 The XLFG comments start with // symbol.

References Alsina, Alex, Joan Bresnan, and Peter Sells. 1997. Complex predicates: Structure and theory. In A. Alsina, J. Bresnan, and P. Sells, eds., Complex Predicates, pages 1 12. Stanford, CA: CSLI Publications. Butt, Miriam, Tracy Holloway King, Hiroshi Masuichi, and Christian Rohrer. 2002. The parallel grammar project. In N. J. Carroll and R. Sutcliffe, eds., Proceedings of the Workshop on Grammar Engineering and Evaluation, pages 1 7. COLING02. Butt, Miriam, María-Eugenia Niño, and Frédérique Segond. 1999. Stanford, CA: CSLI Publications. A Grammar Writer s Cookbook. Clément, Lionel. 2016. XLFG Documentation https://hal.archives-ouvertes.fr/hal-01277648. Technical report, LaBRI. Kaplan, Ronald M., Tracy Holloway King, and John T. Maxwell III. 2002. Adapting existing grammars: The xle approach. In N. J. Carroll and R. Sutcliffe, eds., Proceedings of the Workshop on Grammar Engineering and Evaluation, pages 29 35. COLING02. Martin Forst, Tracy Holloway King and Tibor Laczkó. 2010. Particle Verbs in Computational LFGs: Issues from English, German, and Hungarian. In M. Butt and T. H. King, eds., The Proceedings of the LFG 10 Conference, pages 228 248. Ottawa, Canada. Villavicencio, Aline. 2003. Verb-particle constructions and lexical resources. In Proceedings of the ACL 2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment - Volume 18, MWE 03, pages 57 64. Stroudsburg, PA, USA: Association for Computational Linguistics.