Analysis and Reconstruction of Dictionary Definition Units
|
|
- Edward Blankenship
- 5 years ago
- Views:
Transcription
1 Analysis and Reconstruction of Dictionary Definition Units Chung-Won Seo and Key-Sun Choi Department of Computer Science KAIST/AITRC/KORTERM KAIST Kusong-dong, Yusong-ku, Taejon, , Republic of Korea Abstract In this paper, we analyze the dictionary definitions of verbs for finding units of a definition, and study the reconstruction of the dictionary definitions without ambiguity. For finding units of the definition, we analyze the dependency structures of definitions and make a relation between main verb and other syntactic units. We build a hierarchical structure by using the relation between a headword and a main verb of its definition. From that result, we reduce the verbs in the order of frequency to basic verbs that define other verbs. In the dictionary definitions, the uses of conjunctive verb endings are limited in their meanings. We restrict the use of conjunctive verb endings because they contain many errors and ambiguities in syntax and semantics. From the restricted conjunctive verb endings, we reconstruct the dictionary definitions without ambiguity. 1 Introduction Dictionaries are resources for giving lexical information like morphology, semantics and so on. In NLP process, it is important to make a good dictionary for its size and quality. When we define a new lexical entry, it is needed to give semantic category, usage, relations between pre-defined lexical entries and so on. We also need tools for defining and describing a lexical entry. The dictionary definition should be simple and not ambiguous in meanings and contain enough information. It has properties that restricted sentence patterns and pre-defined words like sub-languages. Sub-language is used for improving performance of the machine translation by restricting its lexicon and grammar (Ananiadou, et al.1995; Claire, et al. 2000). In the dictionary definition, we can make a definition of a new lexical entry from pre-defined words and semantic relations between the new entry and existing lexical entries (Lee, et al. 1998; German 1998; Buitelaar 1997). Sub-language for MT just has a restricted lexicon and grammar, but that for dictionary definitions needs basic verbs that describe other verbs and description patterns for definitions. In this paper, we analyze the definition of the verbs in Urimal Korean Unabridged Dictionary (1997) and find units for the verb definitions. From the relations between the headword and the main verb of the definition, we find basic verbs for definitions. We restrict the uses and meanings of the conjunctive verb-endings for resolving ambiguities of automatic processing. 2 Related Works Sub-languages are studied for improving machine translation s performance. In Adriaens (1994), Simplified English Grammar restricts its sentence patterns, lexicons and uses of the modification for human writer and provides SEG, editing and checking programs. In the case of the dictionary definition, we need to find description patterns and basic elements of the definitions. Choi, et al. (2000) gives a method to get the basic verbs for the morpheme dictionary and information retrieval. Their basic verbs are defined by frequencies of corpus and weights by human experts. Conjunctive verb endings connect two or more verbs in one sentence. They are similar with conjunctions. Lee (2000) gives semantic relations of conjunctions based on RST. However, conjunctive verb endings are subdivided by their usages. Hovy (1993) gives 62 discourse relations that were used before and the numbers referred by other researchers.
2 3 Verb Definitions Urimal Korean Unabridged Dictionary (1997) consists of about 204,350 lexical entries and 274,480 meanings. There are about 12,800 verb entries and 21,700 meanings. There are basic rules for writing definitions as followings (The National Academy of Korean Language, 2000). 1. Avoiding cyclic definition. A. Some compound words, irreplaceable main concepts and derivatives by affix are exceptions. 2. Prohibiting one word definition 3. Definition contains one sentence of meaning and additional explanations A. Definition: difference + genus. < Additional explanation> B. Additional explanation: special features, structures and usages of headword. It is optional 4. Exceptional cases of 3 A. When verb stems are combined with some propositions or endings. B. Dependence nouns C. Grammatical formations D. Adverb, unconjugation adjective and pronouns. 5. Some entries can end with an abbreviation term, a term of respect, a court term, an original term, and so on. Because a lexical entry is defined by difference + genus, the last word of the definition is important to describe concepts. In the verb definition, the last word tends to be the main verb in the definition. We determine basic verbs such that they define other verbs. First, we select verbs by frequency, and then we make a hierarchical structure between headwords and main verbs of the definitions. From that result, we select basic verbs that are in the top level. We look the words of high frequency on the KAIST Corpus (2000) for determining basic verbs. From the KAIST Corpus, we select 1,578 verbs that appear in rank 10,000 words. Table 1: words of high frequency A headword and main verb of the definitions have similar meanings and they differ each meaning by the other units of definition nubda (lie down): alh-geona hayeo ileona-ji moshada. [[ subordinate alh-geona hayeo (be ill) ] sentence ileona-ji moshada. (cannot rise)] bulreoileokida (excite): eoddeon maeum, heangdong, sangtae-reul ileona-ge hada. [[[[[ modifier eoddeon (certain) ] parallel maeum (mind) ] parallel heangdong (behavior)] object sangtae-reul (state) ] sentence ileona-ge hada (cause)] seanggaknada (remind): mueot-eul hago sipeun maeum-i ileonada. [[[[ object mueot-eul (something) ] modifier ha-go sipeun (wants to do) ] subject maeum-i (mind) ] sentence ileona-da (occur) ] Those three words are defined by the same verb ý [ileonada]. It can be differed by the caseframe, mood, voice, and conjunct verbs. 3.1 Basic verbs
3 From the new 170 words, we remove non-verb entries and get 96 verbs. From the intersection of [1] and 96 verbs definitions we can get new 40 verbs. For 40 verbs definitions, we can precede similar method. Figure 1: the relations between head verbs and main verb. The units of verb definitions consist of a main verb, an auxiliary verb sequence, other verbs connected by conjunctive verb endings, and the caseframe of main verbs. Table 2: the units of the verb definitions Head Main verb Concepts word Auxiliary verb Meanings of verb (negative, passive, and so on.) Caseframe Arguments Conjunctive Additional meaning verb ending When we determine basic verbs used to explain other verbs, it needs to make hierarchical relations of the verbs from headwords and main verbs. It has 350 common words and 500 words do not appear at definitions in the 1,578 verbs and their definitions. New 170 words appear only at definitions. 500 verbs are leaf entries of the verb hierarchy. Table 3: the intersections of the headwords and main verbs Only head words [0] Common [1] Only definition [2] Figure 2: the contraction process ý[ggieonda] (pour) [2] : ý æ ý[naedeonjida] (throw) [1] When we process three times repeat, most verbs are returned to verbs of [1] like ý [ggieonda]. Nine verbs remain. Following two verbs are not returned to [1]. ý[golhda] (suffer): ý[eongeolmeokda]. ý[eongeolmeokda] (suffer a big loss) ý[gubuleojida] (be curved): [gubutha] (bent) ý. Table 4: the result of the third application. Common words Only in definition <[1]+[2]> [3] We can build the 439 basic verbs for the 1,578 verbs of high frequency in KAIST corpus. 3.2 Conjunctive verb endings
4 The conjunctive verb endings contain many errors in morphological analysis and syntactic analysis. There are three kinds of the conjunctive verb ending (equal, subordinate, and assistant conjunctive verb endings). Most endings are used with more than two kinds of POS and they cause errors in automatic analysis. If sentences contain conjunctive verb endings, syntactic analysis accuracy is lowered very much. In the 1,578 verb definitions, 396 sentences contain the equal or subordinate conjunctive verb endings. It covers one of fourth. If it contains assistant conjunctive verb endings, it covers 87% of the definitions. For accurate analysis of the definitions, we must restrict the use of the conjunctive verb endings. Only 15 conjunctive verb endings cover 95% in definitions. The high frequent words are used more than two POSs and have more ambiguities than the low frequent words. Table 5: the distributions of conjunctive verb endings in the verb definitions. Equal and Conjunctive Total sentence subordinate Verb endings endings 396 (25%) 1373 (87%) 1578 Many conjunctive verb endings are used more than two POSs and meanings. The others without restriction can replace some endings, some can be replaced with some restrictions and some cannot be replaced. If we can give one ending to one POS and meaning, we can resolve the ambiguities for conjunctive verb endings. There are 170 conjunctive verb endings in the morpheme dictionary. In corpus, there are 280 endings, because they contain the transformed morphologies. In the dictionary definitions, 106 conjunctive endings are appeared. In the case of the verb definitions, there are only 44 conjunctive verb endings. We can reduce the number by the restriction of usages and meanings. Table 6: the usages of conjunctive verb endings. Figure 3: the surface and deep structures of the dictionary definitions. The conjunctive verb endings make a semantic relation between two verbs. We can contract the numbers of conjunctive verb endings by restricting their relations. First, we find the usages and meanings of each conjunctive verb endings and bind them with similar meanings. Some conjunctive verb endings belong to more than two relations. For the relations that belong to two or more conjunctive verb endings, it can be chosen one conjunctive verb endings with one relation. However, some relations contain only one verb endings and not replaceable with other verb endings. Those verb endings are only used for one POS. In the morphology level, they contain no ambiguity. For frequent verb endings like ~ [go], ~ [eo], ~ [ge], they can be replaced with other verb endings. ~ [eo] is used as
5 subordinate and assistant conjunctive verb endings. When we need to use ~ [eo] as subordinate verb endings, we can replace ~ [eo] with [eoseo] of the same meaning. When it is used as assistant verb endings, we do not replace any other verb endings. We can restrict ~ [eo] as assistant conjunctive ending and [eoseo] as subordinate ending of the meaning of conditions. ~ [go] and ~ [ge] also can be restricted by using with the same relations and replaceable endings. In the verb definitions, we can find 16 relations (sequence, cause/reason, purpose, background and so on.). <Table 7> shows the mapping between relations and verb endings. Table 7: the relations of conjunctive verb endings Relations Conjunctive Restriction Verb endings Sequence [geona] [deunji] Purpose [goja] [ryeogo] Purpose Precedence Reason /Cause Background Reason ~û(~ û) [ni, euni] [meuro] Excuse [neura] Contrast [neunde] Cause ~ý [dago] Condition () [eoseo/aseo] ~ (~ ) [eo / a] [neunde] Include effort Opposite reason Following state or action. From the relations, we can restrict the numbers of the conjunctive verb endings to 20 and resolve POS ambiguities. 4 Evaluation From the result of restricted conjunctive verb endings, we can make new 200 verb definitions that contain conjunctive verb endings. Those works are done by hand. We evaluate the accuracy of dependency parsing (Seo, 2001). Because the uses of conjunctive verbs endings are restricted, it can improve the accuracy of the parsing result. The 200 sentences consist of 1453 word phrases. The inputs for the dependency parser are raw sentences. For modified sentences, we add post-processors for conjunctive verb endings. For the dependency parser, we add the rules to determine the dependent of the node that includes conjunctive verb endings. We evaluate the sentence accuracy. Table 8: The result of parsing Accuracy Original definitions 105/200 (52.5%) Modified definitions 149/200 (74.5%) The sentence accuracy is improved by 22%. The original definitions include tagging errors of the conjunctive verb endings and ambiguities of subordinate clauses boundary. For modified definitions, we add some heuristics for conjunctive verb endings (determining the dependent of nodes that include conjunctive verb endings and boundaries of subordinate clauses). Most errors occur at the parallel noun phrases and the adverbs in the subordinate clauses or the relative clauses. From the result, we can conclude that the restriction of the conjunctive verb endings resolve ambiguities of syntax. We need to evaluate the differences of meanings between original definitions and modified definitions for verifying that the restrictions do not lose information. 5 Conclusion In this paper, we analyze the verb definitions and find the units of definitions. We determine basic verbs that define other verbs. We set the relations about the conjunctive verb endings for resolving
6 ambiguities. From those results, we can reconstruct the verb definitions. The dictionary definitions consist of the terms that represent differences and genus term. We can find basic verbs from the genus term. The genus terms of verb definitions are converged to the basic verbs. From the 1,578 definitions, we get 439 basic verbs. The conjunctive verb endings represent the relations between main verbs of definitions and the other events. Discourse relations can define those relations. In the dictionary definitions, the use of relations is limited. We restrict the use of conjunctive verb endings with 16 discourse relations. The restriction can resolve ambiguities because they have many ambiguities in syntax and meaning. From the basic verbs and restricted verb endings, we can reconstruct verb definitions without ambiguity. In future work, we plan to analyze the ambiguities of restrictions of lexical entries, and generate the definitions automatically. The basic verbs and its relations can be compared with thesaurus, and we can get the definitions from corpus by constructing the semantic hierarchy. From the result of verb definitions, it can be extended to the adverbs or nouns. References In Artificial Intelligence 63, Special Issue on Natural Language Processing. German Rigau (1998). Automatic Acquisition of Lexical Knowledge from MRDs, PhD Thesis, Departament de Llenguatges i Sistemes Inform`atics, Universitat Polit`ecnica de Catalunya. Hanguel Society, Ed. (1997). Urimal Korean Unabridged Dictionary, Eomungag. Ivan A. Sag and Thomas Wasow (1999). A Syntax Theory, CSLI publications. Korterm (2000) KAIST Corpus, Lee, Chungmin, Seungho Nam and Beom-mo Kang (1998). A Generative Approach to the Lexical Semantics of Korean Predicates, Korean Journal of Cognitive Science 9.3, Korean Society for Cognitive Science. Lee, Yu-Ri (2000). Text Summarization using Rhetorical Structure, MS. thesis, Dept. Of Computer Science, KAIST Seo, Chung-Won (2001). Dependency Parsing of Simple Korean Sentence Usings Verb Caseframe, MS. Thsis, Dept. of Electirical Engineering & Computer Science, KAIST The National Academy of Korean Language, (2000). Standard Korean Big Dictionary Editing Guids II, the National Academy of Korean Language Adriaens G. (1994). The LRE SECC Project: Simplified English Grammar and Style Correction in an MT Framework, in Proceedings of Linguistic Engineering Convention, Paris Ananiadou, S., Radford, I. and Tsujii, J. (1995). Sublanguage knowledge acquisition for hypertext optimization, Proceedings of NLPRS, Seoul. Buitelaar (1997). A Lexicon for Underspecified Semantic Tagging, Proceedings of ANLP 97. Choi Young-suk, Woon-jae Lee, and Key-sun Choi (2000). A Study on Verbs Statistics in Corpus, Proceedings of KLIP2000 Claire Grover, et al (2000). Designing a controlled language for interactive model checking, Proceedings of the Third International Workshop on Controlled Language Application. Eduar H. Hovy (1993). Automatic Discourse Generation using Discourse Structure Relations,
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationIntroduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.
to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about
More informationCS 598 Natural Language Processing
CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@
More informationAn Interactive Intelligent Language Tutor Over The Internet
An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This
More informationA Syllable Based Word Recognition Model for Korean Noun Extraction
are used as the most important terms (features) that express the document in NLP applications such as information retrieval, document categorization, text summarization, information extraction, and etc.
More informationThe College Board Redesigned SAT Grade 12
A Correlation of, 2017 To the Redesigned SAT Introduction This document demonstrates how myperspectives English Language Arts meets the Reading, Writing and Language and Essay Domains of Redesigned SAT.
More informationIntra-talker Variation: Audience Design Factors Affecting Lexical Selections
Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and
More informationWriting a composition
A good composition has three elements: Writing a composition an introduction: A topic sentence which contains the main idea of the paragraph. a body : Supporting sentences that develop the main idea. a
More informationAdvanced Grammar in Use
Advanced Grammar in Use A self-study reference and practice book for advanced learners of English Third Edition with answers and CD-ROM cambridge university press cambridge, new york, melbourne, madrid,
More informationHeuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger
Page 1 of 35 Heuristic Sample Selection to Minimize Reference Standard Training Set for a Part-Of-Speech Tagger Kaihong Liu, MD, MS, Wendy Chapman, PhD, Rebecca Hwa, PhD, and Rebecca S. Crowley, MD, MS
More informationGERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017
GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017 Instructor: Dr. Claudia Schwabe Class hours: TR 9:00-10:15 p.m. claudia.schwabe@usu.edu Class room: Old Main 301 Office: Old Main 002D Office hours:
More informationChapter 9 Banked gap-filling
Chapter 9 Banked gap-filling This testing technique is known as banked gap-filling, because you have to choose the appropriate word from a bank of alternatives. In a banked gap-filling task, similarly
More informationFormulaic Language and Fluency: ESL Teaching Applications
Formulaic Language and Fluency: ESL Teaching Applications Formulaic Language Terminology Formulaic sequence One such item Formulaic language Non-count noun referring to these items Phraseology The study
More informationA First-Pass Approach for Evaluating Machine Translation Systems
[Proceedings of the Evaluators Forum, April 21st 24th, 1991, Les Rasses, Vaud, Switzerland; ed. Kirsten Falkedal (Geneva: ISSCO).] A First-Pass Approach for Evaluating Machine Translation Systems Pamela
More informationLING 329 : MORPHOLOGY
LING 329 : MORPHOLOGY TTh 10:30 11:50 AM, Physics 121 Course Syllabus Spring 2013 Matt Pearson Office: Vollum 313 Email: pearsonm@reed.edu Phone: 7618 (off campus: 503-517-7618) Office hrs: Mon 1:30 2:30,
More informationSpecification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,
More informationLoughton School s curriculum evening. 28 th February 2017
Loughton School s curriculum evening 28 th February 2017 Aims of this session Share our approach to teaching writing, reading, SPaG and maths. Share resources, ideas and strategies to support children's
More informationEnhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities
Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Yoav Goldberg Reut Tsarfaty Meni Adler Michael Elhadad Ben Gurion
More informationCalifornia Department of Education English Language Development Standards for Grade 8
Section 1: Goal, Critical Principles, and Overview Goal: English learners read, analyze, interpret, and create a variety of literary and informational text types. They develop an understanding of how language
More informationSome Principles of Automated Natural Language Information Extraction
Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract
More informationDeveloping True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability
Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability Shih-Bin Chen Dept. of Information and Computer Engineering, Chung-Yuan Christian University Chung-Li, Taiwan
More informationContext Free Grammars. Many slides from Michael Collins
Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures
More informationDeveloping a TT-MCTAG for German with an RCG-based Parser
Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert University of Tübingen, Germany CNRS-LORIA, France LREC 2008,
More informationLinking Task: Identifying authors and book titles in verbose queries
Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,
More information1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class
If we cancel class 1/20 idea We ll spend an extra hour on 1/21 I ll give you a brief writing problem for 1/21 based on assigned readings Jot down your thoughts based on your reading so you ll be ready
More informationEnsemble Technique Utilization for Indonesian Dependency Parser
Ensemble Technique Utilization for Indonesian Dependency Parser Arief Rahman Institut Teknologi Bandung Indonesia 23516008@std.stei.itb.ac.id Ayu Purwarianti Institut Teknologi Bandung Indonesia ayu@stei.itb.ac.id
More informationA Framework for Customizable Generation of Hypertext Presentations
A Framework for Customizable Generation of Hypertext Presentations Benoit Lavoie and Owen Rambow CoGenTex, Inc. 840 Hanshaw Road, Ithaca, NY 14850, USA benoit, owen~cogentex, com Abstract In this paper,
More informationBULATS A2 WORDLIST 2
BULATS A2 WORDLIST 2 INTRODUCTION TO THE BULATS A2 WORDLIST 2 The BULATS A2 WORDLIST 21 is a list of approximately 750 words to help candidates aiming at an A2 pass in the Cambridge BULATS exam. It is
More informationFOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.
CONTENTS FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8 УРОК (Unit) 1 25 1.1. QUESTIONS WITH КТО AND ЧТО 27 1.2. GENDER OF NOUNS 29 1.3. PERSONAL PRONOUNS 31 УРОК (Unit) 2 38 2.1. PRESENT TENSE OF THE
More informationConstraining X-Bar: Theta Theory
Constraining X-Bar: Theta Theory Carnie, 2013, chapter 8 Kofi K. Saah 1 Learning objectives Distinguish between thematic relation and theta role. Identify the thematic relations agent, theme, goal, source,
More informationPseudo-Passives as Adjectival Passives
Pseudo-Passives as Adjectival Passives Kwang-sup Kim Hankuk University of Foreign Studies English Department 81 Oedae-lo Cheoin-Gu Yongin-City 449-791 Republic of Korea kwangsup@hufs.ac.kr Abstract The
More informationA Domain Ontology Development Environment Using a MRD and Text Corpus
A Domain Ontology Development Environment Using a MRD and Text Corpus Naomi Nakaya 1 and Masaki Kurematsu 2 and Takahira Yamaguchi 1 1 Faculty of Information, Shizuoka University 3-5-1 Johoku Hamamatsu
More informationWhat the National Curriculum requires in reading at Y5 and Y6
What the National Curriculum requires in reading at Y5 and Y6 Word reading apply their growing knowledge of root words, prefixes and suffixes (morphology and etymology), as listed in Appendix 1 of the
More informationCross Language Information Retrieval
Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................
More informationDerivational and Inflectional Morphemes in Pak-Pak Language
Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes
More informationCase government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG
Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG Dr. Kakia Chatsiou, University of Essex achats at essex.ac.uk Explorations in Syntactic Government and Subcategorisation,
More informationOpportunities for Writing Title Key Stage 1 Key Stage 2 Narrative
English Teaching Cycle The English curriculum at Wardley CE Primary is based upon the National Curriculum. Our English is taught through a text based curriculum as we believe this is the best way to develop
More informationLinguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis
International Journal of Arts Humanities and Social Sciences (IJAHSS) Volume 1 Issue 1 ǁ August 216. www.ijahss.com Linguistic Variation across Sports Category of Press Reportage from British Newspapers:
More informationEdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar
EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,
More informationUniversal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses
Universal Grammar 1 evidence : 1. crosslinguistic investigation of properties of languages 2. evidence from language acquisition 3. general cognitive abilities 1. Properties can be reflected in a.) structural
More informationImproved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form
Orthographic Form 1 Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form The development and testing of word-retrieval treatments for aphasia has generally focused
More informationAQUA: An Ontology-Driven Question Answering System
AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.
More informationArizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS
Arizona s English Language Arts Standards 11-12th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS 11 th -12 th Grade Overview Arizona s English Language Arts Standards work together
More informationThe presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.
Lecture 4: OT Syntax Sources: Kager 1999, Section 8; Legendre et al. 1998; Grimshaw 1997; Barbosa et al. 1998, Introduction; Bresnan 1998; Fanselow et al. 1999; Gibson & Broihier 1998. OT is not a theory
More informationTHE VERB ARGUMENT BROWSER
THE VERB ARGUMENT BROWSER Bálint Sass sass.balint@itk.ppke.hu Péter Pázmány Catholic University, Budapest, Hungary 11 th International Conference on Text, Speech and Dialog 8-12 September 2008, Brno PREVIEW
More informationThe Discourse Anaphoric Properties of Connectives
The Discourse Anaphoric Properties of Connectives Cassandre Creswell, Kate Forbes, Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi Λ, Bonnie Webber y Λ University of Pennsylvania 3401 Walnut Street Philadelphia,
More informationBasic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.
Basic Syntax Doug Arnold doug@essex.ac.uk We review some basic grammatical ideas and terminology, and look at some common constructions in English. 1 Categories 1.1 Word level (lexical and functional)
More informationWords come in categories
Nouns Words come in categories D: A grammatical category is a class of expressions which share a common set of grammatical properties (a.k.a. word class or part of speech). Words come in categories Open
More informationModeling full form lexica for Arabic
Modeling full form lexica for Arabic Susanne Alt Amine Akrout Atilf-CNRS Laurent Romary Loria-CNRS Objectives Presentation of the current standardization activity in the domain of lexical data modeling
More informationBANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS
Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.
More informationCh VI- SENTENCE PATTERNS.
Ch VI- SENTENCE PATTERNS faizrisd@gmail.com www.pakfaizal.com It is a common fact that in the making of well-formed sentences we badly need several syntactic devices used to link together words by means
More informationENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist
Meeting 2 Chapter 7 (Morphology) and chapter 9 (Syntax) Today s agenda Repetition of meeting 1 Mini-lecture on morphology Seminar on chapter 7, worksheet Mini-lecture on syntax Seminar on chapter 9, worksheet
More informationProcedia - Social and Behavioral Sciences 154 ( 2014 )
Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 154 ( 2014 ) 263 267 THE XXV ANNUAL INTERNATIONAL ACADEMIC CONFERENCE, LANGUAGE AND CULTURE, 20-22 October
More informationTarget Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data
Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se
More informationSemi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.
Semi-supervised methods of text processing, and an application to medical concept extraction Yacine Jernite Text-as-Data series September 17. 2015 What do we want from text? 1. Extract information 2. Link
More information5 th Grade Language Arts Curriculum Map
5 th Grade Language Arts Curriculum Map Quarter 1 Unit of Study: Launching Writer s Workshop 5.L.1 - Demonstrate command of the conventions of Standard English grammar and usage when writing or speaking.
More informationApplications of memory-based natural language processing
Applications of memory-based natural language processing Antal van den Bosch and Roser Morante ILK Research Group Tilburg University Prague, June 24, 2007 Current ILK members Principal investigator: Antal
More informationIntensive English Program Southwest College
Intensive English Program Southwest College ESOL 0352 Advanced Intermediate Grammar for Foreign Speakers CRN 55661-- Summer 2015 Gulfton Center Room 114 11:00 2:45 Mon. Fri. 3 hours lecture / 2 hours lab
More informationA Corpus-based Evaluation of a Domain-specific Text to Knowledge Mapping Prototype
A Corpus-based Evaluation of a Domain-specific Text to Knowledge Mapping Prototype Rushdi Shams Department of Computer Science and Engineering, Khulna University of Engineering & Technology (KUET), Bangladesh
More informationPrediction of Maximal Projection for Semantic Role Labeling
Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba
More informationReading Grammar Section and Lesson Writing Chapter and Lesson Identify a purpose for reading W1-LO; W2- LO; W3- LO; W4- LO; W5-
New York Grade 7 Core Performance Indicators Grades 7 8: common to all four ELA standards Throughout grades 7 and 8, students demonstrate the following core performance indicators in the key ideas of reading,
More informationToday we examine the distribution of infinitival clauses, which can be
Infinitival Clauses Today we examine the distribution of infinitival clauses, which can be a) the subject of a main clause (1) [to vote for oneself] is objectionable (2) It is objectionable to vote for
More informationTaught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,
First Grade Standards These are the standards for what is taught in first grade. It is the expectation that these skills will be reinforced after they have been taught. Taught Throughout the Year Foundational
More informationApproaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque
Approaches to control phenomena handout 6 5.4 Obligatory control and morphological case: Icelandic and Basque Icelandinc quirky case (displaying properties of both structural and inherent case: lexically
More informationChunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.
NLP Lab Session Week 8 October 15, 2014 Noun Phrase Chunking and WordNet in NLTK Getting Started In this lab session, we will work together through a series of small examples using the IDLE window and
More informationNatural Language Processing. George Konidaris
Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans
More informationA Case Study: News Classification Based on Term Frequency
A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center
More informationUniversiteit Leiden ICT in Business
Universiteit Leiden ICT in Business Ranking of Multi-Word Terms Name: Ricardo R.M. Blikman Student-no: s1184164 Internal report number: 2012-11 Date: 07/03/2013 1st supervisor: Prof. Dr. J.N. Kok 2nd supervisor:
More informationParticipate in expanded conversations and respond appropriately to a variety of conversational prompts
Students continue their study of German by further expanding their knowledge of key vocabulary topics and grammar concepts. Students not only begin to comprehend listening and reading passages more fully,
More informationLanguage Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus
Language Acquisition Fall 2010/Winter 2011 Lexical Categories Afra Alishahi, Heiner Drenhaus Computational Linguistics and Phonetics Saarland University Children s Sensitivity to Lexical Categories Look,
More informationImpact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment
Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment Takako Aikawa, Lee Schwartz, Ronit King Mo Corston-Oliver Carmen Lozano Microsoft
More informationGrammars & Parsing, Part 1:
Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review
More informationUnderlying and Surface Grammatical Relations in Greek consider
0 Underlying and Surface Grammatical Relations in Greek consider Sentences Brian D. Joseph The Ohio State University Abbreviated Title Grammatical Relations in Greek consider Sentences Brian D. Joseph
More informationA Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many
Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.
More informationDeveloping Grammar in Context
Developing Grammar in Context intermediate with answers Mark Nettle and Diana Hopkins PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United
More informationControlled vocabulary
Indexing languages 6.2.2. Controlled vocabulary Overview Anyone who has struggled to find the exact search term to retrieve information about a certain subject can benefit from controlled vocabulary. Controlled
More informationCharacter Stream Parsing of Mixed-lingual Text
Character Stream Parsing of Mixed-lingual Text Harald Romsdorfer and Beat Pfister Speech Processing Group Computer Engineering and Networks Laboratory ETH Zurich {romsdorfer,pfister}@tik.ee.ethz.ch Abstract
More informationModeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures
Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures Ulrike Baldewein (ulrike@coli.uni-sb.de) Computational Psycholinguistics, Saarland University D-66041 Saarbrücken,
More informationDerivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.
Final Exam (120 points) Click on the yellow balloons below to see the answers I. Short Answer (32pts) 1. (6) The sentence The kinder teachers made sure that the students comprehended the testable material
More informationChapter 4: Valence & Agreement CSLI Publications
Chapter 4: Valence & Agreement Reminder: Where We Are Simple CFG doesn t allow us to cross-classify categories, e.g., verbs can be grouped by transitivity (deny vs. disappear) or by number (deny vs. denies).
More informationAn Introduction to the Minimalist Program
An Introduction to the Minimalist Program Luke Smith University of Arizona Summer 2016 Some findings of traditional syntax Human languages vary greatly, but digging deeper, they all have distinct commonalities:
More informationSpecifying a shallow grammatical for parsing purposes
Specifying a shallow grammatical for parsing purposes representation Atro Voutilainen and Timo J~irvinen Research Unit for Multilingual Language Technology P.O. Box 4 FIN-0004 University of Helsinki Finland
More informationCommon Core State Standards for English Language Arts
Reading Standards for Literature 6-12 Grade 9-10 Students: 1. Cite strong and thorough textual evidence to support analysis of what the text says explicitly as well as inferences drawn from the text. 2.
More informationCEFR Overall Illustrative English Proficiency Scales
CEFR Overall Illustrative English Proficiency s CEFR CEFR OVERALL ORAL PRODUCTION Has a good command of idiomatic expressions and colloquialisms with awareness of connotative levels of meaning. Can convey
More informationAdjectives tell you more about a noun (for example: the red dress ).
Curriculum Jargon busters Grammar glossary Key: Words in bold are examples. Words underlined are terms you can look up in this glossary. Words in italics are important to the definition. Term Adjective
More informationSenior Stenographer / Senior Typist Series (including equivalent Secretary titles)
New York State Department of Civil Service Committed to Innovation, Quality, and Excellence A Guide to the Written Test for the Senior Stenographer / Senior Typist Series (including equivalent Secretary
More informationWritten by: YULI AMRIA (RRA1B210085) ABSTRACT. Key words: ability, possessive pronouns, and possessive adjectives INTRODUCTION
STUDYING GRAMMAR OF ENGLISH AS A FOREIGN LANGUAGE: STUDENTS ABILITY IN USING POSSESSIVE PRONOUNS AND POSSESSIVE ADJECTIVES IN ONE JUNIOR HIGH SCHOOL IN JAMBI CITY Written by: YULI AMRIA (RRA1B210085) ABSTRACT
More informationMultiple case assignment and the English pseudo-passive *
Multiple case assignment and the English pseudo-passive * Norvin Richards Massachusetts Institute of Technology Previous literature on pseudo-passives (see van Riemsdijk 1978, Chomsky 1981, Hornstein &
More informationTHE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING
SISOM & ACOUSTICS 2015, Bucharest 21-22 May THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING MarilenaăLAZ R 1, Diana MILITARU 2 1 Military Equipment and Technologies Research Agency, Bucharest,
More informationLEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)
More informationCOMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR
COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR ROLAND HAUSSER Institut für Deutsche Philologie Ludwig-Maximilians Universität München München, West Germany 1. CHOICE OF A PRIMITIVE OPERATION The
More informationSyntactic and Lexical Simplification: The Impact on EFL Listening Comprehension at Low and High Language Proficiency Levels
ISSN 1798-4769 Journal of Language Teaching and Research, Vol. 5, No. 3, pp. 566-571, May 2014 Manufactured in Finland. doi:10.4304/jltr.5.3.566-571 Syntactic and Lexical Simplification: The Impact on
More informationScienceDirect. Malayalam question answering system
Available online at www.sciencedirect.com ScienceDirect Procedia Technology 24 (2016 ) 1388 1392 International Conference on Emerging Trends in Engineering, Science and Technology (ICETEST - 2015) Malayalam
More informationPrentice Hall Literature Common Core Edition Grade 10, 2012
A Correlation of Prentice Hall Literature Common Core Edition, 2012 To the New Jersey Model Curriculum A Correlation of Prentice Hall Literature Common Core Edition, 2012 Introduction This document demonstrates
More informationELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit
Unit 1 Language Development Express Ideas and Opinions Ask for and Give Information Engage in Discussion ELD CELDT 5 EDGE Level C Curriculum Guide 20132014 Sentences Reflective Essay August 12 th September
More informationL1 and L2 acquisition. Holger Diessel
L1 and L2 acquisition Holger Diessel Schedule Comparing L1 and L2 acquisition The role of the native language in L2 acquisition The critical period hypothesis [student presentation] Non-linguistic factors
More informationEmmaus Lutheran School English Language Arts Curriculum
Emmaus Lutheran School English Language Arts Curriculum Rationale based on Scripture God is the Creator of all things, including English Language Arts. Our school is committed to providing students with
More informationA Comparison of Two Text Representations for Sentiment Analysis
010 International Conference on Computer Application and System Modeling (ICCASM 010) A Comparison of Two Text Representations for Sentiment Analysis Jianxiong Wang School of Computer Science & Educational
More information11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation
tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each
More information