The Alpino Grammar and Lexicon
|
|
- Arthur Cameron
- 5 years ago
- Views:
Transcription
1 The Alpino Grammar and Lexicon RSAVG, Section 243 & 244 Daniël de Kok
2 Overview Broad overview of Alpino The lexicon The grammar Problem section : Modifiers Verb movement
3 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators
4 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar
5 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer)
6 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer) Part-of-speech tagger (Hidden Markov Model)
7 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer) Part-of-speech tagger (Hidden Markov Model) Parser
8 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer) Part-of-speech tagger (Hidden Markov Model) Parser Generator
9 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer) Part-of-speech tagger (Hidden Markov Model) Parser Generator Treebanking
10 Broad overview of Alpino Hdrug: environment for developing grammars/parsers/generators Lexicon/Grammar Tokenizer (finite state transducer) Part-of-speech tagger (Hidden Markov Model) Parser Generator Treebanking Largely written in Prolog and C/C++
11 Parsing in Alpino Parsing in Alpino Lexical analysis Left-corner parser (with goal weakening and memoization) Disambiguation (with n-best unpacking)
12 Generation in Alpino Generation in Alpino: Lexical prediction Chart generator Fluency ranking (with n-best unpacking)
13 Lexicon
14 Introduction Alpino uses a strongly-lexicalized grammar: Word descriptions have detailed syntactic information A relatively small set of simple grammar rules Words are represented by attribute-value structures
15 Example v subj np agr sg case nom dt 1 sc np [ case acc dt 2 ] dt dt hd [ lex verft ] su 1 obj1 2 Figure 1:Simplified attribute-value structure for verft present tense second/third person inflection of verven to paint
16 Static lexicon M:M mapping: words < tag, stem > Each word is associated with a complex tag The attribute-value structure is constructed from the tag For example: # Inflection # Root # Tag advies advies noun(het,count,sg) adviezen advies noun(het,count,pl) Dictionary size (2012): ~180,000 mappings ~190,000 mappings for named entities Stored in a finite state automaton
17 Special entries Some specific combinations of words cannot be derived using generic grammar rules Consider: helemaal niemand (lit: at all nobody ) * helemaal iemand * helemaal hij helemaal is an intensifier for the pronoun niemand Since it cannot apply to other pronouns: no generalization
18 Special entries (2) The Alpino lexicon contains special entries for such word combinations Since the dependency structure cannot be derived productively, needs to be pre-packaged The tag of helemaal niemand: with_dt( pronoun(nwh,thi,sg,de,both,indef,strpro), dt(np,[ mod=l(helemaal,adverb,advp,0,1), hd=l(niemand, pronoun(nwh,thi,sg,de,both,indef,strpro),1,2)])) Require extra handling in parsing and generation
19 Productive lexicon The productive lexicon analyzes: Compounds Ordinals Unknown words
20 Grammar
21 Introduction The Alpino grammar is written as Prolog rules: %% Rule head template grammar_rule(identifier,lhs,rhs) %% Head for np -> det n grammar_rule(np_det_n,np,[det,n]) Approximately 850 construction-specific rules
22 Example rule grammar_rule(n_adj_n, NP, [ AP, N ] ) :- unmarked_n_adj_n_struct(n,ap,np) unmarked_n_adj_n_struct(n,ap,np) :- n_adj_n_struct(n,ap,np), AP:agr <=> N:agr n_adj_n_struct(n,ap,np) :- NP => n, AP => a, N => n, % reduce spur amb in 'ziek zijn' NP:subn => ~sub_indef_verb, ap_arg(ap), N:wh => nwh, % de hoeveelste overwinning was dat? NP:wh <=> AP:wh, %%
23 Principles Rules use general predicates/principles that are shared between different rules Example: percolate the dependency structure of the projected head on the left-hand side of the rule
24 Rule use Rules are purely declarative (besides a few exceptions) Calling the goal np_det_n grammar_rule(np_det_n,np,[det,n]) Will instantiate NP, Det, and N with attribute-value structures Consequence: we can store the grammar rules as Prolog facts Ideal: exploit first-argument indexing in parsing and generation
25 Handling modifiers
26 Introduction Unfortunately, sometimes a context-free backbone and dependency structure do not match as nicely as we would like Frequently occuring example: modifiers
27 Problem #1 Consider: (1) omdat hij met plezier een taart heeft gebakken because he with pleasure a cake has baked met plezier is a modifier of gebakken, however in the phrase structure it is attached to a phrase headed by the auxiliary heeft
28 Problem #1 34 CHAPTER 2 ATTRIBUTE-VALUE GRAMMAR IN ALPINO sbar comp vp omdat vproj np vproj hij pp vproj met plezier np een taart vproj vc v vc heeft gebakken Figure 222: Derivation tree of omdat hij met plezier een taart heeft gebakken because he with pleasure a cake has baked Category types are used as node
29 Problem #2 In cases where the syntactic head is also the head in the dependency structure, we want the head to have the full modifier list For example: (2) de mooie snelle groene auto the beautiful fast green car auto should have a modifier list containing mooie, snelle, and groene However, Prolog does not allow us to expand a well-formed list
30 Problem #2 24 AVG IN THE ALPINO SYSTEM 35 np det 4:n de a 3:n mooie a 2:n snelle a 1:n groene auto Figure 223: Derivation tree for the phrase de mooie snelle groene auto the beautiful fast green car Rule identifiers are replaced by category types
31 Solutions problem #2 1 Use a diference list for modifiers: apply a difference list append for each modifier that is found and unify the tail with the empty list at the maximal projection Mods = [Hole1] %% n:1 Hole1 = [groene Hole2] %% n:2 Hole2 = [snelle Hole3] %% n:3 Hole3 = [mooie Hole4] %% n:4 Hole4 = [] %% np
32 Solutions problem #2 1 Use a diference list for modifiers: apply a difference list append for each modifier that is found and unify the tail with the empty list at the maximal projection Mods = [Hole1] %% n:1 Hole1 = [groene Hole2] %% n:2 Hole2 = [snelle Hole3] %% n:3 Hole3 = [mooie Hole4] %% n:4 Hole4 = [] %% np 2 Use two separate attributes in the attribute-value structure for modifier collection (cmod) and the final list of modifiers (mod) The final list is reentrant among the categories and is unified at a maximal projection
33 Solution #2 used to collect modifiers and mod is the list of all modifiers that were collected at the maximal projection Figure 224 gives an impression of how these two attributes work for the derivation in Figure [ ] cmod 1 mod 1 np de cmod 1 mooie, snelle, groene mod 1 n mooie cmod snelle, groene mod 1 n snelle cmod groene mod 1 n groene auto
34 Solving problem #1 The second solution also solves problem #1: where appropriate syntactic heads should hand over modifiers appropriately Example: add_modifier_to_dt([],sign) :- Sign => v, Sign:vtype => vaux, Sign:dt:mod => [], Sign:mods <=> GiveMods, Sign:deps <=> [VC _], VC => vc, VC:mods <=> VCMods, VC:cmods <=> VCCMods, alpino_wappend:wappend(givemods,vccmods,vcmods)
35 As an attribute-value structure v deps vc [ mods 1 cmods 2 ] _ mods 3 vtype vaux dt dt [ mod ] wappend( 3, 2, 1 )
36 Verb gaps
37 Finite verb movement (3) omdat ik hem het boek heb gegeven because I him the book have given
38 Finite verb movement (5) omdat ik hem het boek heb gegeven because I him the book have given (6) ik heb hem het boek gegeven I have him the book given
39 Finite verb movement (7) omdat ik hem het boek heb gegeven because I him the book have given (8) ik heb hem het boek gegeven I have him the book given Usual analysis: Dutch has a verb-final word order, in main clauses the finite verb moves to the second position
40 Verb movement in Alpino Many different approaches: Continuous constituents Discontinuous constituents Approach in Alpino: When a finite verb is found, assert a verb gap item with the necessary syntactic information Not very declarative, but efficient
41 Subordinate clause max xp(sbar) sbar(vp) omdat vp vpx vpx vproj vp arg v(np) np pron weak vp arg v(np) ik np pron weak vp arg v(np) hem np det n vproj vc het boek v v v heb vc vb vb v gegeven
42 Main clause max xp(root) non wh topicalization(np) np pron weak o(e) imp ik heb v2 vp vproj vpx vproj vp arg v(np) np pron weak vp arg v(np) hem np det n vproj vc het boek v v v vgap vc vb vb v gegeven
43 Main clause without auxiliary max xp(root) non wh topicalization(np) np pron weak o(e) imp ik geef v2 vp vproj vpx vproj vp arg v(np) np pron weak vp arg v(np) hem np det n vproj vc het boek vc vb vb v vgap
44 The end
Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities
Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Yoav Goldberg Reut Tsarfaty Meni Adler Michael Elhadad Ben Gurion
More informationDeveloping a TT-MCTAG for German with an RCG-based Parser
Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert University of Tübingen, Germany CNRS-LORIA, France LREC 2008,
More informationGrammars & Parsing, Part 1:
Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review
More informationIntroduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.
to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about
More informationThe presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.
Lecture 4: OT Syntax Sources: Kager 1999, Section 8; Legendre et al. 1998; Grimshaw 1997; Barbosa et al. 1998, Introduction; Bresnan 1998; Fanselow et al. 1999; Gibson & Broihier 1998. OT is not a theory
More informationBasic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1
Basic Parsing with Context-Free Grammars Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Announcements HW 2 to go out today. Next Tuesday most important for background to assignment Sign up
More informationChapter 4: Valence & Agreement CSLI Publications
Chapter 4: Valence & Agreement Reminder: Where We Are Simple CFG doesn t allow us to cross-classify categories, e.g., verbs can be grouped by transitivity (deny vs. disappear) or by number (deny vs. denies).
More information11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation
tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each
More informationAccurate Unlexicalized Parsing for Modern Hebrew
Accurate Unlexicalized Parsing for Modern Hebrew Reut Tsarfaty and Khalil Sima an Institute for Logic, Language and Computation, University of Amsterdam Plantage Muidergracht 24, 1018TV Amsterdam, The
More informationAdapting Stochastic Output for Rule-Based Semantics
Adapting Stochastic Output for Rule-Based Semantics Wissenschaftliche Arbeit zur Erlangung des Grades eines Diplom-Handelslehrers im Fachbereich Wirtschaftswissenschaften der Universität Konstanz Februar
More informationSome Principles of Automated Natural Language Information Extraction
Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract
More informationTreebank mining with GrETEL. Liesbeth Augustinus Frank Van Eynde
Treebank mining with GrETEL Liesbeth Augustinus Frank Van Eynde GrETEL tutorial - 27 March, 2015 GrETEL Greedy Extraction of Trees for Empirical Linguistics Search engine for treebanks GrETEL Greedy Extraction
More informationWords come in categories
Nouns Words come in categories D: A grammatical category is a class of expressions which share a common set of grammatical properties (a.k.a. word class or part of speech). Words come in categories Open
More informationCS 598 Natural Language Processing
CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@
More information2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases
POS Tagging Problem Part-of-Speech Tagging L545 Spring 203 Given a sentence W Wn and a tagset of lexical categories, find the most likely tag T..Tn for each word in the sentence Example Secretariat/P is/vbz
More informationContext Free Grammars. Many slides from Michael Collins
Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures
More informationCase government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG
Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG Dr. Kakia Chatsiou, University of Essex achats at essex.ac.uk Explorations in Syntactic Government and Subcategorisation,
More informationThe building blocks of HPSG grammars. Head-Driven Phrase Structure Grammar (HPSG) HPSG grammars from a linguistic perspective
Te building blocks of HPSG grammars Head-Driven Prase Structure Grammar (HPSG) In HPSG, sentences, s, prases, and multisentence discourses are all represented as signs = complexes of ponological, syntactic/semantic,
More informationTowards a MWE-driven A* parsing with LTAGs [WG2,WG3]
Towards a MWE-driven A* parsing with LTAGs [WG2,WG3] Jakub Waszczuk, Agata Savary To cite this version: Jakub Waszczuk, Agata Savary. Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]. PARSEME 6th general
More informationSpecifying a shallow grammatical for parsing purposes
Specifying a shallow grammatical for parsing purposes representation Atro Voutilainen and Timo J~irvinen Research Unit for Multilingual Language Technology P.O. Box 4 FIN-0004 University of Helsinki Finland
More informationUniversal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses
Universal Grammar 1 evidence : 1. crosslinguistic investigation of properties of languages 2. evidence from language acquisition 3. general cognitive abilities 1. Properties can be reflected in a.) structural
More informationBANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS
Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.
More informationSom and Optimality Theory
Som and Optimality Theory This article argues that the difference between English and Norwegian with respect to the presence of a complementizer in embedded subject questions is attributable to a larger
More informationSyntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm syntax: from the Greek syntaxis, meaning setting out together
More informationFeature-Based Grammar
8 Feature-Based Grammar James P. Blevins 8.1 Introduction This chapter considers some of the basic ideas about language and linguistic analysis that define the family of feature-based grammars. Underlying
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationCitation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.
University of Groningen Formalizing the minimalist program Veenstra, Mettina Jolanda Arnoldina IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF if you wish to cite from
More informationBasic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.
Basic Syntax Doug Arnold doug@essex.ac.uk We review some basic grammatical ideas and terminology, and look at some common constructions in English. 1 Categories 1.1 Word level (lexical and functional)
More informationLTAG-spinal and the Treebank
LTAG-spinal and the Treebank a new resource for incremental, dependency and semantic parsing Libin Shen (lshen@bbn.com) BBN Technologies, 10 Moulton Street, Cambridge, MA 02138, USA Lucas Champollion (champoll@ling.upenn.edu)
More informationThe Interface between Phrasal and Functional Constraints
The Interface between Phrasal and Functional Constraints John T. Maxwell III* Xerox Palo Alto Research Center Ronald M. Kaplan t Xerox Palo Alto Research Center Many modern grammatical formalisms divide
More informationConstruction Grammar. University of Jena.
Construction Grammar Holger Diessel University of Jena holger.diessel@uni-jena.de http://www.holger-diessel.de/ Words seem to have a prototype structure; but language does not only consist of words. What
More information1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class
If we cancel class 1/20 idea We ll spend an extra hour on 1/21 I ll give you a brief writing problem for 1/21 based on assigned readings Jot down your thoughts based on your reading so you ll be ready
More informationTowards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la
Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing Grzegorz Chrupa la A dissertation submitted in fulfilment of the requirements for the award of Doctor of Philosophy (Ph.D.)
More informationImproving coverage and parsing quality of a large-scale LFG for German
Improving coverage and parsing quality of a large-scale LFG for German Christian Rohrer, Martin Forst Institute for Natural Language Processing (IMS) University of Stuttgart Azenbergstr. 12 70174 Stuttgart,
More informationInleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3
Inleiding Taalkunde Docent: Paola Monachesi Blok 4, 2001/2002 Contents 1 Syntax 2 2 Phrases and constituent structure 2 3 A minigrammar of Italian 3 4 Trees 3 5 Developing an Italian lexicon 4 6 S(emantic)-selection
More informationA Graph Based Authorship Identification Approach
A Graph Based Authorship Identification Approach Notebook for PAN at CLEF 2015 Helena Gómez-Adorno 1, Grigori Sidorov 1, David Pinto 2, and Ilia Markov 1 1 Center for Computing Research, Instituto Politécnico
More informationApproaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque
Approaches to control phenomena handout 6 5.4 Obligatory control and morphological case: Icelandic and Basque Icelandinc quirky case (displaying properties of both structural and inherent case: lexically
More informationLNGT0101 Introduction to Linguistics
LNGT0101 Introduction to Linguistics Lecture #11 Oct 15 th, 2014 Announcements HW3 is now posted. It s due Wed Oct 22 by 5pm. Today is a sociolinguistics talk by Toni Cook at 4:30 at Hillcrest 103. Extra
More informationHyperedge Replacement and Nonprojective Dependency Structures
Hyperedge Replacement and Nonprojective Dependency Structures Daniel Bauer and Owen Rambow Columbia University New York, NY 10027, USA {bauer,rambow}@cs.columbia.edu Abstract Synchronous Hyperedge Replacement
More informationDomain Adaptation for Parsing
Domain Adaptation for Parsing Barbara Plank CLCG The work presented here was carried out under the auspices of the Center for Language and Cognition Groningen (CLCG) at the Faculty of Arts of the University
More informationVERB MOVEMENT The Status of the Weak Pronouns in Dutch
VERB MOVEMENT 115 2 Clitics in Dutch In this section, and in the following sections, I will provide positive evidence in support of the hypothesis that the functional projections in Dutch are head initial.
More informationcmp-lg/ Jul 1995
A CONSTRAINT-BASED CASE FRAME LEXICON ARCHITECTURE 1 Introduction Kemal Oazer and Okan Ylmaz Department of Computer Engineering and Information Science Bilkent University Bilkent, Ankara 0, Turkey fko,okang@cs.bilkent.edu.tr
More informationTHE VERB ARGUMENT BROWSER
THE VERB ARGUMENT BROWSER Bálint Sass sass.balint@itk.ppke.hu Péter Pázmány Catholic University, Budapest, Hungary 11 th International Conference on Text, Speech and Dialog 8-12 September 2008, Brno PREVIEW
More informationENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist
Meeting 2 Chapter 7 (Morphology) and chapter 9 (Syntax) Today s agenda Repetition of meeting 1 Mini-lecture on morphology Seminar on chapter 7, worksheet Mini-lecture on syntax Seminar on chapter 9, worksheet
More information"f TOPIC =T COMP COMP... OBJ
TREATMENT OF LONG DISTANCE DEPENDENCIES IN LFG AND TAG: FUNCTIONAL UNCERTAINTY IN LFG IS A COROLLARY IN TAG" Aravind K. Joshi Dept. of Computer & Information Science University of Pennsylvania Philadelphia,
More informationHeuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger
Page 1 of 35 Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger Kaihong Liu, MD, MS, Wendy Chapman, PhD, Rebecca Hwa, PhD, and Rebecca S. Crowley, MD, MS
More informationAlpino: accurate, robust, wide coverage computational analysis of Dutch. Gertjan van Noord University of Groningen
Alpino: accurate, robust, wide coverage computational analysis of Dutch Gertjan van Noord University of Groningen Alpino: accurate, robust, wide coverage parsing of Dutch 1 Joint work with: Leonoor van
More informationParasitic participles and ellipsis in VP-focus pseudoclefts. Jan-Wouter Zwart
Parasitic participles and ellipsis in VP-focus pseudoclefts Jan-Wouter Zwart Paper presented at the 31st Comparative Germanic Syntax Workshop Stellenbosch, December 3, 2016 1. Introduction This paper discusses
More informationInformatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy
Informatics 2A: Language Complexity and the Chomsky Hierarchy September 28, 2010 Starter 1 Is there a finite state machine that recognises all those strings s from the alphabet {a, b} where the difference
More informationProof Theory for Syntacticians
Department of Linguistics Ohio State University Syntax 2 (Linguistics 602.02) January 5, 2012 Logics for Linguistics Many different kinds of logic are directly applicable to formalizing theories in syntax
More informationSpecifying Logic Programs in Controlled Natural Language
TECHNICAL REPORT 94.17, DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF ZURICH, NOVEMBER 1994 Specifying Logic Programs in Controlled Natural Language Norbert E. Fuchs, Hubert F. Hofmann, Rolf Schwitter
More informationRefining the Design of a Contracting Finite-State Dependency Parser
Refining the Design of a Contracting Finite-State Dependency Parser Anssi Yli-Jyrä and Jussi Piitulainen and Atro Voutilainen The Department of Modern Languages PO Box 3 00014 University of Helsinki {anssi.yli-jyra,jussi.piitulainen,atro.voutilainen}@helsinki.fi
More informationEdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar
EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,
More informationPrediction of Maximal Projection for Semantic Role Labeling
Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba
More informationUpdate on Soar-based language processing
Update on Soar-based language processing Deryle Lonsdale (and the rest of the BYU NL-Soar Research Group) BYU Linguistics lonz@byu.edu Soar 2006 1 NL-Soar Soar 2006 2 NL-Soar developments Discourse/robotic
More informationThe Discourse Anaphoric Properties of Connectives
The Discourse Anaphoric Properties of Connectives Cassandre Creswell, Kate Forbes, Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi Λ, Bonnie Webber y Λ University of Pennsylvania 3401 Walnut Street Philadelphia,
More informationA relational approach to translation
A relational approach to translation Rémi Zajac Project POLYGLOSS* University of Stuttgart IMS-CL /IfI-AIS, KeplerstraBe 17 7000 Stuttgart 1, West-Germany zajac@is.informatik.uni-stuttgart.dbp.de Abstract.
More informationDEVELOPMENT OF A MULTILINGUAL PARALLEL CORPUS AND A PART-OF-SPEECH TAGGER FOR AFRIKAANS
DEVELOPMENT OF A MULTILINGUAL PARALLEL CORPUS AND A PART-OF-SPEECH TAGGER FOR AFRIKAANS Julia Tmshkina Centre for Text Techitology, North-West University, 253 Potchefstroom, South Africa 2025770@puk.ac.za
More informationTHE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES
THE INTERNATIONAL JOURNAL OF HUMANITIES & SOCIAL STUDIES PRO and Control in Lexical Functional Grammar: Lexical or Theory Motivated? Evidence from Kikuyu Njuguna Githitu Bernard Ph.D. Student, University
More informationMODELING DEPENDENCY GRAMMAR WITH RESTRICTED CONSTRAINTS. Ingo Schröder Wolfgang Menzel Kilian Foth Michael Schulz * Résumé - Abstract
T.A.L., vol. 38, n o 1, pp. 1 30 MODELING DEPENDENCY GRAMMAR WITH RESTRICTED CONSTRAINTS Ingo Schröder Wolfgang Menzel Kilian Foth Michael Schulz * Résumé - Abstract Parsing of dependency grammar has been
More informationAnalysis of Probabilistic Parsing in NLP
Analysis of Probabilistic Parsing in NLP Krishna Karoo, Dr.Girish Katkar Research Scholar, Department of Electronics & Computer Science, R.T.M. Nagpur University, Nagpur, India Head of Department, Department
More informationWriting a composition
A good composition has three elements: Writing a composition an introduction: A topic sentence which contains the main idea of the paragraph. a body : Supporting sentences that develop the main idea. a
More informationUniversity of Groningen. Topics in Corpus-Based Dutch Syntax Beek, Leonoor Johanneke van der
University of Groningen Topics in Corpus-Based Dutch Syntax Beek, Leonoor Johanneke van der IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from
More informationCharacter Stream Parsing of Mixed-lingual Text
Character Stream Parsing of Mixed-lingual Text Harald Romsdorfer and Beat Pfister Speech Processing Group Computer Engineering and Networks Laboratory ETH Zurich {romsdorfer,pfister}@tik.ee.ethz.ch Abstract
More informationNatural Language Processing. George Konidaris
Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans
More informationConstraining X-Bar: Theta Theory
Constraining X-Bar: Theta Theory Carnie, 2013, chapter 8 Kofi K. Saah 1 Learning objectives Distinguish between thematic relation and theta role. Identify the thematic relations agent, theme, goal, source,
More informationChunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.
NLP Lab Session Week 8 October 15, 2014 Noun Phrase Chunking and WordNet in NLTK Getting Started In this lab session, we will work together through a series of small examples using the IDLE window and
More informationParsing natural language
Rochester Institute of Technology RIT Scholar Works Theses Thesis/Dissertation Collections 1983 Parsing natural language Leonard E. Wilcox Follow this and additional works at: http://scholarworks.rit.edu/theses
More informationarxiv:cmp-lg/ v1 7 Jun 1997 Abstract
Comparing a Linguistic and a Stochastic Tagger Christer Samuelsson Lucent Technologies Bell Laboratories 600 Mountain Ave, Room 2D-339 Murray Hill, NJ 07974, USA christer@research.bell-labs.com Atro Voutilainen
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationAnnotation Projection for Discourse Connectives
SFB 833 / Univ. Tübingen Penn Discourse Treebank Workshop Annotation projection Basic idea: Given a bitext E/F and annotation for F, how would the annotation look for E? Examples: Word Sense Disambiguation
More informationAdvanced Topics in HPSG
Advanced Topics in HPSG Andreas Kathol Adam Przepiórkowski Jesse Tseng 1 Introduction This chapter presents a survey of some of the major topics that have received attention from an HPSG perspective since
More informationSemantic Inference at the Lexical-Syntactic Level for Textual Entailment Recognition
Semantic Inference at the Lexical-Syntactic Level for Textual Entailment Recognition Roy Bar-Haim,Ido Dagan, Iddo Greental, Idan Szpektor and Moshe Friedman Computer Science Department, Bar-Ilan University,
More informationTaught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,
First Grade Standards These are the standards for what is taught in first grade. It is the expectation that these skills will be reinforced after they have been taught. Taught Throughout the Year Foundational
More informationControl and Boundedness
Control and Boundedness Having eliminated rules, we would expect constructions to follow from the lexical categories (of heads and specifiers of syntactic constructions) alone. Combinatory syntax simply
More informationPre-Processing MRSes
Pre-Processing MRSes Tore Bruland Norwegian University of Science and Technology Department of Computer and Information Science torebrul@idi.ntnu.no Abstract We are in the process of creating a pipeline
More informationThe Smart/Empire TIPSTER IR System
The Smart/Empire TIPSTER IR System Chris Buckley, Janet Walz Sabir Research, Gaithersburg, MD chrisb,walz@sabir.com Claire Cardie, Scott Mardis, Mandar Mitra, David Pierce, Kiri Wagstaff Department of
More informationESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly
ESSLLI 2010: Resource-light Morpho-syntactic Analysis of Highly Inflected Languages Classical Approaches to Tagging The slides are posted on the web. The url is http://chss.montclair.edu/~feldmana/esslli10/.
More informationHans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZTI INF W. Germany. (2) [S' [NP who][s does he try to find [NP e]]s IS' $=~
The Treatment of Movement-Rules in a LFG-Parser Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZT NF W. Germany n this paper we propose a way of how to treat longdistance movement phenomena
More informationParsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank
Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank Dan Klein and Christopher D. Manning Computer Science Department Stanford University Stanford,
More informationThe Indiana Cooperative Remote Search Task (CReST) Corpus
The Indiana Cooperative Remote Search Task (CReST) Corpus Kathleen Eberhard, Hannele Nicholson, Sandra Kübler, Susan Gundersen, Matthias Scheutz University of Notre Dame Notre Dame, IN 46556, USA {eberhard.1,hnichol1,
More informationIntra-talker Variation: Audience Design Factors Affecting Lexical Selections
Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and
More informationAdjectives tell you more about a noun (for example: the red dress ).
Curriculum Jargon busters Grammar glossary Key: Words in bold are examples. Words underlined are terms you can look up in this glossary. Words in italics are important to the definition. Term Adjective
More informationA Framework for Customizable Generation of Hypertext Presentations
A Framework for Customizable Generation of Hypertext Presentations Benoit Lavoie and Owen Rambow CoGenTex, Inc. 840 Hanshaw Road, Ithaca, NY 14850, USA benoit, owen~cogentex, com Abstract In this paper,
More informationOn the Notion Determiner
On the Notion Determiner Frank Van Eynde University of Leuven Proceedings of the 10th International Conference on Head-Driven Phrase Structure Grammar Michigan State University Stefan Müller (Editor) 2003
More informationStructure-Preserving Extraction without Traces
Empirical Issues in Syntax and Semantics 5 O. Bonami & P. Cabredo Hofherr (eds.) 2004, pp. 27 44 http://www.cssp.cnrs.fr/eiss5 Structure-Preserving Extraction without Traces Wesley Davidson 1 Introduction
More informationUnderlying and Surface Grammatical Relations in Greek consider
0 Underlying and Surface Grammatical Relations in Greek consider Sentences Brian D. Joseph The Ohio State University Abbreviated Title Grammatical Relations in Greek consider Sentences Brian D. Joseph
More informationTarget Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data
Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se
More informationPOS tagging of Chinese Buddhist texts using Recurrent Neural Networks
POS tagging of Chinese Buddhist texts using Recurrent Neural Networks Longlu Qin Department of East Asian Languages and Cultures longlu@stanford.edu Abstract Chinese POS tagging, as one of the most important
More informationEAGLE: an Error-Annotated Corpus of Beginning Learner German
EAGLE: an Error-Annotated Corpus of Beginning Learner German Adriane Boyd Department of Linguistics The Ohio State University adriane@ling.osu.edu Abstract This paper describes the Error-Annotated German
More informationDerivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.
Final Exam (120 points) Click on the yellow balloons below to see the answers I. Short Answer (32pts) 1. (6) The sentence The kinder teachers made sure that the students comprehended the testable material
More informationCh VI- SENTENCE PATTERNS.
Ch VI- SENTENCE PATTERNS faizrisd@gmail.com www.pakfaizal.com It is a common fact that in the making of well-formed sentences we badly need several syntactic devices used to link together words by means
More informationHindi Aspectual Verb Complexes
Hindi Aspectual Verb Complexes HPSG-09 1 Introduction One of the goals of syntax is to termine how much languages do vary, in the hope to be able to make hypothesis about how much natural languages can
More informationEnsemble Technique Utilization for Indonesian Dependency Parser
Ensemble Technique Utilization for Indonesian Dependency Parser Arief Rahman Institut Teknologi Bandung Indonesia 23516008@std.stei.itb.ac.id Ayu Purwarianti Institut Teknologi Bandung Indonesia ayu@stei.itb.ac.id
More informationArgument structure and theta roles
Argument structure and theta roles Introduction to Syntax, EGG Summer School 2017 András Bárány ab155@soas.ac.uk 26 July 2017 Overview Where we left off Arguments and theta roles Some consequences of theta
More informationLearning Methods in Multilingual Speech Recognition
Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex
More informationA Computational Evaluation of Case-Assignment Algorithms
A Computational Evaluation of Case-Assignment Algorithms Miles Calabresi Advisors: Bob Frank and Jim Wood Submitted to the faculty of the Department of Linguistics in partial fulfillment of the requirements
More informationConstructions with Lexical Integrity *
Constructions with Lexical Integrity * Ash Asudeh, Mary Dalrymple, and Ida Toivonen Carleton University & Oxford University abstract Construction Grammar holds that unpredictable form-meaning combinations
More informationThe Role of the Head in the Interpretation of English Deverbal Compounds
The Role of the Head in the Interpretation of English Deverbal Compounds Gianina Iordăchioaia i, Lonneke van der Plas ii, Glorianna Jagfeld i (Universität Stuttgart i, University of Malta ii ) Wen wurmt
More informationSEMAFOR: Frame Argument Resolution with Log-Linear Models
SEMAFOR: Frame Argument Resolution with Log-Linear Models Desai Chen or, The Case of the Missing Arguments Nathan Schneider SemEval July 16, 2010 Dipanjan Das School of Computer Science Carnegie Mellon
More information