Natural Languages Analysis in Machine Translation (MT) based on the STCG (STRING-TREE CORRESPONDENCE GRAMMAR)
|
|
- Hugo Daniel
- 6 years ago
- Views:
Transcription
1 Natural Laguages Aalysis i Machie Traslatio (MT) based o the STCG (STRING-TREE CORRESPONDENCE GRAMMAR) Tag Eya Kog, Zahari Yusoff Uit Terjemaha Melalui Komputer Pusat Pegajia Sais Komputer Uiversiti Sais Malaysia Mide, Pulau Piag, Malaysia. [ eyakog@cs.usm.my ad zari@cs.usm.my] 0. Abstract The Strig-Tree Correspodece Grammar (STCG) [1] is a grammar formalism for defiig: a set of strigs (a laguage), a set of trees (valid represetatio/iterpretatio structures), the mappig betwee the two (to be iterpreted for aalysis & geeratio). The formalism is argued to be a totally declarative grammar formalism that ca associate, to strigs i a laguage, arbitrary tree structures as desired by the grammar writer to be the liguistic represetatio structures of the strigs. More importatly is the facility to specify the correspodece betwee the strig ad the associated tree i a very atural maer. These features are very much desired i grammar writig, i particular for the treatmet of certai liguistic pheomea which are 'o-stadard', amely featurisatio, lexicalisatio ad crossed depedecies [2,3]. Furthermore, a grammar writte i this way aturally iherits the desired property of bi-directioality (i fact o-directioality [4]) such that the same grammar ca be iterpreted for both aalysis ad geeratio. I this paper, we ivestigate the properties of the STCG for iterpretatio towards aalysis (as is uderstood withi the cotext of Machie Traslatio (MT)). Other tha usig STCG grammars as specificatios for the automatic geeratio of aalysis programs i the Specialised Laguages for Liguistic Programmig (SLLPs) of MT systems (a study reported i [5,6]), the work also cetres aroud the specificatio of a geeral aalyser/parser for the STCG. The proposed STCG aalyser is capable of mimickig some very useful features i various cotextfree parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the compact way of represetig possible parse trees for ambiguous seteces, such as the oe see i [8]. Though ot reported i this paper, we ote that the proposed aalyser also provide a atural way for hadlig the kid of awkward pheomea metioed above (amely lexicalisatio, featurisatio, ad worst of all, crossed depedecies) while at the same time retaiig much of the efficiecy of stadard cotext-free parsig algorithms (a study reported i [2,3]). 1. The STCG Formalism The Strig-Tree Correspodece Grammar is a declarative grammar formalism that ca be used to describe the correspodece betwee strigs of terms ad trees. I particular, liguistic rules are writte with utteraces as the strig of terms (heceforth STRING) ad the correspodig represetative liguistic structures as the tree (heceforth TREE). Figure 1 gives a idicatio of a full STCG rule. The structure of the TREE is totally specified by the liguist ad is ot costraied by ay applicatio of rules (as i the case for the parse tree i the classical cotext free grammar). I a rule, the mai correspodece is first declared: i the example, the STRING #1.v.#2.part (with #1 ad #2 beig strig variables, ie. variables which are istatiable to strigs of terms) is set to correspod to the TREE with root ode S (where ad are forest variables, ie. variables that ca be istatiated to lists of subtrees). The mai-corr(espodece) is followed by a declaratio of subcorrespodeces (o the right had 261
2 side) betwee substrigs of the STRING ad subtrees of the TREE, each of which possibly havig a list of refereces (rule ames). For example, the sub-corr(espodece) betwee the substrig #1 ad the subtree rooted at the ode 1 refers to the rules R..., the latter beig other rules i the grammar. This referece is a mechaism by meas of which the strig ad forest variables metioed earlier are fully istatiated via a operatio called iificatio [9,10] resultig i a correspodece betwee explicit strigs of terms ad ad trees, both without variables. I actual fact, the mai-corr as well as the sub-co specified i the rule are formally recorded i terms of a Structured Strig Tree Correspodece (SSTC) trasparet to the liguist [11] as illustrated i figure 2, where a give correspodece may be oprojective (eg. with discotiuous costituets) as is the case for the odev(part) i the example. Note also that the particle is chose (by the liguist) to be represeted as a collectio of features i the ode v - a case of featurisatio. Mai-Corr. 0"'"..ThP 1,/./.11) v(part) #1.v.#2.part with : R1 I very simple terms, a strig to tree correspodece i the STCG ca be viewed as aalogous to the mathematical defiitio of a relatio betwee iteger umbers as i the example give o the right. Here, a relatio (i this case a fuctio) f is defied i terms of fier subrelatios accordig to the subdomais. Sub-Corr. #1 with : R v(part) v.part = pick, etc. paa = up, etc. Figure 1. (0/a_d) Nff..11*.VP (- /a_b) (0/b e), kpart) (b_c+d_e/ (- /c_d) b_c+d_e) $13 #1. v. #2 part a bb cc dd e Figure 2. 1 #2 with : R -3 x<3 f(x)= x +5 3<x<5 x 55_X A set of STCG rules form a grammar, some of which are axiom rules (ie. start rules or rules cotaiig axiom trees, as i the axiom or the start symbol S i the classical cotext free grammar). With the sematics of the rules beig as idicated above, a grammar thus defies a laguage of strigs, a laguage of represetatio trees, ad the correspodece betwee elemets of the two laguages/sets. It is this set of strig-tree correspodeces that ca be iterpreted for both aalysis ad geeratio. 2. Natural Laguages Aalysis i MT Based o the STCG Iitially, the STCG was desiged to serve as a specificatio laguage for writig grammars i MT such that the specificatios writte i the STCG grammar formalism ca the be coded (maually) ito the liguistic programs for aalysis ad geeratio writte i the SLLPs of itegrated MT systems. Some substatial work have also bee carried out to automate this process, amely towards the automatic geeratio of aalysis programs i the MT systems ARIANE [12] ad JEMAH [13] from grammars writte i the STCG formalism (see for example [5,6]). However, due to certai limitatios i the existig SLLPs for the realisatio of a proper implemetatio of a STCG aalyser (as discussed i [2]), we have decided istead to look ito the desig of a aalyser which ca directly iterpret the STCG grammar The Fudametal Desig of the STCG Aalyser As we have see above, a STCG grammar actually defies a set of SSTCs i a way quite similar to the defiitio of a mathematical fuctio. I evaluatig a mathematical fuctio, if the fuctio is defied i terms of other sub-fuctios the it ca oly be completely evaluated after all its sub-fuctios have bee evaluated ad retur with the appropriate values. We ca view the STCG aalysis process i the same maer where, by takig the iput strig/setece as their STRING, the set of explicit SSTCs defied by the axiom rules of a grammar are costructed based o the resultat sub-sstcs defied by the referece rules of these axiom rules. Sice the 262
3 referece rules of the axiom rules may i tur refer to other rules, they may also retur the completed SSTCs oly after their respective referece rules have bee completed. This referece process will termiate whe all remaiig sub-sstcs evaluated are defied by subcorrespodeces which do ot refer to ay other rule, amely the 'lexical-sstcs', which must match with the iput words (the o-lexical SSTCs are called 'phrasal- SSTCs'). We illustrate this i the followig aalysis of the iput strig "He picks the ball up" with respect to a grammar cosistig of rule R1 give i figure 1 ad rules R1, R3 give i figure 3. The rule R1 is give as a axiom rule. The aalysis process begis with the evaluatio of the geeral SSTC defied by the axiom rule R1, which i tur leads to the evaluatio of two other sub-sstcs defied by the referece rules R1, R3 as illustrated i figure 4. mai-corr with : R1 mai-corr d/\. with : R3 Figure 3. sub-corr 1 = Joh, ball, he,..., etc. sub-corr the,etc. = ball,etc. VP vpar (l/t; #A. v. #B. Pa (1aa_bbcc5 with : RI - Apply rule R I - Apply rule R3 - Apply rule RI VP (0/1_5) (0/0_1) v(pai (1 2+4_5/ (0/2 4) 1=2+4_5)...".1111"" (2_3/2_3 ) ko_4/ ((1_1 /0_ I) j-le. picks. the. ball. LID 0_ _4 4_5 with : R I r #11 b_c with : R b d dc with : R3 Phrasal-SSTC (0/0_1) I (0_1/0_1) kit 01 with R I (o12_4) ded...""1"17 t (2_3/2_th3e).(3b_a411/3_4) 2_3 3_4 with : R... (0/2 4) v(part) ti Ski b_d Lexical-SSTC v (part) icks. u 1_2 4_5 1/10. hail _3 3_4 He 01 a b picks 1 2 b_d d_c c_5 the ball up 2_3 3_4 4_5 Figure 4. a /b picks 1 2 t d d_c c_5 he ball up _3 3_4 4_5 I the diagram above (o the left), the aalysis process expads the SSTC defied by the axiom rule ito a strig of sub-sstcs, which is further expaded ito aother strig of sub-sstcs util it caot be expaded ay further, which is whe the strig of sub-sstcs cosists oly of lexical-sstcs. The strig of lexical-sstcs is the matched with the words i the iput strig. Note that the matchig eed ot be i a projective maer, as ca be see i this particular example, where the lexical-sstcs are matched to the words i the iput strig i a crossed serial maer - a case of crossed depedecies. I order to keep track of such o- 263
4 projective correspodeces, we itroduce the use of idex variables to record the iterval correspodig to each symbol appearig i the STRING (as illustrated o the right). I [2], we proposed a desig of the STCG aalysis algorithm which is capable of mimickig some very useful features i various cotext-free parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the represetatio of shared forest i term of a STCG grammar rules which is i fact followig the approach adopted i [8] as illustrated i the ext sectio. 2.2 Multiple Results of aalysis for ambiguous iput setece The example setece give above is uambiguous, ad thus correspods to oly a sigle represetatio tree. However, atural laguage grammars are kow to be i the class of highly ambiguous grammars, ad as such, there may be umerous represetatio trees geerated for a sigle setece i the laguage described. Istead of storig each represetatio tree separately i the set of SSTCs defiig the correspodeces betwee the give setece ad all its possible represetatio trees, we should try to represet all these i a space-efficiet maer. I the figure give below, we preset a compact way of represetig a set of SSTCs correspods to a ambiguous setece by meas of a AND-OR graph of rules - similar to the techique used by [8]. For example, the two SSTCs: VP (0/ 13) (0/0_1 ) I V FP (013_6) ( 1_2/1_2) (0/2_3) p (0/4_6) ( 0_ 1/()_ I ) I (3_4/3_4) /4 (2_3/2 3) de ' (4_5/4_5) (5_6/S 6) ruh'14.?1 (0_1) (0/0_6) _6) (0_ ) ( 1_2/1_2) (OPT 6) PP (2_3/2_3) (0/3_6) (3_4/34) de t (4_5/4_57(56:6/5-6) e,.rs.4te rat() with : RTC Figure 5:Two liguistic represetatios of the setece Joh saw Mary i the boat. ca be factorised ito a AND-OR graph of rules R2, R3, R5, RPP (give below) ad rules R1, R3 (give i figure 3) i the followig maer: RIP I (Joh) (saw) (Mary) P ) R3 (i " De t R I (the) (boat) Figure 6 : A AND-OR Graph of STCG grammar rules. Mai-Corr. Sup-corr. Ne1)& V NIP #A. v. #B with : R2 with : R 1,RIR5 with : R I.R3,R5 Mai-Corr. Sub-Corr. S PP 1 EA #A.#B with : R2,R3 with : RPP with : R3 pa Mai-Corr. Sub-Corr. p $ lip *IA itg la with : with : R5 R I,R3 with : RPP Mai-Corr. p with : RPP Sub-Corr. with : R I,R3,R5 264
5 3. Cocludig Remarks Recetly, efficiet cotext-free parsig methods such as the LR parser ad Earley's Algorithm have bee referred to extesively i implemetig parsers for most of the formalisms used i the field of NLP. I a effort to retai the efficiecy of stadard cotext-free parsig algorithms, most recet declarative formalisms are typically restricted by the costrait of strig cocateatio i cotext-free grammars which allows a setece to be systematically decomposed so that the parsig process ca be idexed by the subparts of that decompositio (the substrigs). However, it has also bee widely recogised that the cocateatio restrictio of CFG ca be problematic i hadlig pheomea such as lexicalisatio, featurisatio, ad especially crossed depedecies. As a alterative, we propose the STCG formalism which allows for a more 'atural' way of specifiyig the strigs of the laguage beig described, their correspodig liguistically motivated represetatio trees, ad the correspodece betwee the two, where the correspodece eed ot be projective ad hece appropriate for the said pheomea. Eve though the stadard CF parsig methods caot be adopted directly i the aalysis of a iput setece with respect of a STCG grammar, due to the STRING patters of the STCG which eed ot submit to the cocateatio restrictio of CFG, i this paper we preset the geeral layout (due to the space costrait, however iterested readers may get more ails i [2]) of a aalyser for the STCG which is capable of mimickig some very useful features i various cotext-free parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the compact way of represetig possible parse trees for ambiguous seteces, such as the oe see i [8]. Furthermore, we have also provided a atural way for hadlig the kid of awkward pheomea such as lexicalisatio, featurisatio, ad worst of all, crossed depedecies, while at the same time retaiig much of the efficiecy of stadard cotext-free parsig algoritms [2,3]. REFERENCES [ 1 ] Zahari Y., Strig-Tree Correspodece Grammar: a declarative grammar formalism for defiig the correspodece betwee strigs of terms ad tree structures, proceedigs of the 3rd Coferece of the Europea Chapter of the ACL, Copehage, April [2] Tag Eya Kog, Natural laguages Aalysis i machie traslatio (MT) based o the STCG, PhD thesis, Uiversiti Sais Malaysia, Peag, March [3] Tag Eya Kog, Zahari Y., Hadlig Crossed Depedecies with the STCG, proceedigs of Natural Laguage Processig Pacific Rim Symposium (NLPRS'95), Sofitel Ambassador Hotel, Seoul, Korea, Dec. 4-6, [4] Yves Lepage, Parsig ad Geeratig Cotext-Sesitive Laguages with Correspodece Iificatio Grammars, proceedigs of the Natural Laguage Processig Pacific Rim Symposium (NLPRS'91), Sigapore, Nov [5] Zahari Yusoff, Tag Eya Kog, Geeratio of aalysis programs i ROBRA (ARIANE) From Strig-Tree Correspodece Grammars (or a Strategy for Aalysis i machie traaslatio), Proceedigs of the 3rd Machie Traslatio Summit, Washigto, D.C., July,1991. [6] Zahari Y., Tag Eya Kog, Strig-Tree Correspodece Grammars as a base for the automatic geeratio of aalysis programs i machie traaslatio, proceedigs of the Iteratioal Coferece o Curret Issues i Computatioal Liguistics, Peag, Jue [7] J. Earley, A efficiet catext-free parsig algorithm, Commuicatios of the ACM, Vol. 13, Num. 2, Feb 1970, pp [8] Lag, B., Towards a Uiform Formal Framework for Parsig, I : Curret Issues i Parsig Techology, M. Tomita (ed.), Kluwer Academic Publishers, 1991, pp [9] Zahari Y., Strategies ad heuristics i the aalysis of atural laguages i machie traslatio, PhD thesis, Uiversiti Sais Malaysia, Peag, March [10] Y.Lepage, U systeme de grammaires correspodacielles d'iificatio, these de Docteur, IMAG, Uiversite Joseph Fourier, Greoble, Jue [11] Zahari Yusoff, Christia Boitet, Represetatio trees ad strig-tree correspodeces, proceedigs of the 12th Iteratioal Coferece o Computatioal Liguistics, COLING-88, Budapest, August 1988, pp [12] Ch.Boitet, P.Guillaume, M.Quezel-Ambruaz, Le poit sur ARIANE-78, debut 1982 (DSE-I ), vol.], part.] : le logiciel, GETA, avril [13] Tog Loog Cheog, The JEMAH System : Referece Maual, UTMK documet, USM, Peag,
6 266
Natural language processing implementation on Romanian ChatBot
Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION Natural laguage processig implemetatio o Romaia ChatBot RALF FABIAN, MARCU ALEXANDRU-NICOLAE Departmet for Iformatics
More information'Norwegian University of Science and Technology, Department of Computer and Information Science
The helpful Patiet Record System: Problem Orieted Ad Kowledge Based Elisabeth Bayega, MS' ad Samso Tu, MS2 'Norwegia Uiversity of Sciece ad Techology, Departmet of Computer ad Iformatio Sciece ad Departmet
More informationE-LEARNING USABILITY: A LEARNER-ADAPTED APPROACH BASED ON THE EVALUATION OF LEANER S PREFERENCES. Valentina Terzieva, Yuri Pavlov, Rumen Andreev
Titre du documet / Documet title E-learig usability : A learer-adapted approach based o the evaluatio of leaer's prefereces Auteur(s) / Author(s) TERZIEVA Valetia ; PAVLOV Yuri (1) ; ANDREEV Rume (2) ;
More informationarxiv: v1 [cs.dl] 22 Dec 2016
ScieceWISE: Topic Modelig over Scietific Literature Networks arxiv:1612.07636v1 [cs.dl] 22 Dec 2016 A. Magalich, V. Gemmetto, D. Garlaschelli, A. Boyarsky Uiversity of Leide, The Netherlads {magalich,
More informationManagement Science Letters
Maagemet Sciece Letters 4 (24) 2 26 Cotets lists available at GrowigSciece Maagemet Sciece Letters homepage: www.growigsciece.com/msl A applicatio of data evelopmet aalysis for measurig the relative efficiecy
More informationFuzzy Reference Gain-Scheduling Approach as Intelligent Agents: FRGS Agent
Fuzzy Referece Gai-Schedulig Approach as Itelliget Agets: FRGS Aget J. E. ARAUJO * eresto@lit.ipe.br K. H. KIENITZ # kieitz@ita.br S. A. SANDRI sadra@lac.ipe.br J. D. S. da SILVA demisio@lac.ipe.br * Itegratio
More informationConsortium: North Carolina Community Colleges
Associatio of Research Libraries / Texas A&M Uiversity www.libqual.org Cotributors Collee Cook Texas A&M Uiversity Fred Heath Uiversity of Texas BruceThompso Texas A&M Uiversity Martha Kyrillidou Associatio
More informationpart2 Participatory Processes
part part2 Participatory Processes Participatory Learig Approaches Whose Learig? Participatory learig is based o the priciple of ope expressio where all sectios of the commuity ad exteral stakeholders
More informationCONSTITUENT VOICE TECHNICAL NOTE 1 INTRODUCING Version 1.1, September 2014
preview begis oct 2014 lauches ja 2015 INTRODUCING WWW.FEEDBACKCOMMONS.ORG A serviced cloud platform to share ad compare feedback data ad collaboratively develop feedback ad learig practice CONSTITUENT
More informationApplication for Admission
Applicatio for Admissio Admissio Office PO Box 2900 Illiois Wesleya Uiversity Bloomig, Illiois 61702-2900 Apply o-lie at: www.iwu.edu Applicatio Iformatio I am applyig: Early Actio Regular Decisio Early
More informationCOMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR
COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR ROLAND HAUSSER Institut für Deutsche Philologie Ludwig-Maximilians Universität München München, West Germany 1. CHOICE OF A PRIMITIVE OPERATION The
More informationSyntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm syntax: from the Greek syntaxis, meaning setting out together
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationVISION, MISSION, VALUES, AND GOALS
6 VISION, MISSION, VALUES, AND GOALS 2010-2015 VISION STATEMENT Ohloe College will be kow throughout Califoria for our iclusiveess, iovatio, ad superior rates of studet success. MISSION STATEMENT The Missio
More informationChinese Language Parsing with Maximum-Entropy-Inspired Parser
Chinese Language Parsing with Maximum-Entropy-Inspired Parser Heng Lian Brown University Abstract The Chinese language has many special characteristics that make parsing difficult. The performance of state-of-the-art
More informationBasic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1
Basic Parsing with Context-Free Grammars Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Announcements HW 2 to go out today. Next Tuesday most important for background to assignment Sign up
More informationTowards a MWE-driven A* parsing with LTAGs [WG2,WG3]
Towards a MWE-driven A* parsing with LTAGs [WG2,WG3] Jakub Waszczuk, Agata Savary To cite this version: Jakub Waszczuk, Agata Savary. Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]. PARSEME 6th general
More informationHANDBOOK. Career Center Handbook. Tools & Tips for Career Search Success CALIFORNIA STATE UNIVERSITY, SACR AMENTO
HANDBOOK Career Ceter Hadbook CALIFORNIA STATE UNIVERSITY, SACR AMENTO Tools & Tips for Career Search Success Academic Advisig ad Career Ceter 6000 J Street Lasse Hall 1013 Sacrameto, CA 95819-6064 916-278-6231
More informationThe Interface between Phrasal and Functional Constraints
The Interface between Phrasal and Functional Constraints John T. Maxwell III* Xerox Palo Alto Research Center Ronald M. Kaplan t Xerox Palo Alto Research Center Many modern grammatical formalisms divide
More information11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation
tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each
More informationInformatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy
Informatics 2A: Language Complexity and the Chomsky Hierarchy September 28, 2010 Starter 1 Is there a finite state machine that recognises all those strings s from the alphabet {a, b} where the difference
More informationContext Free Grammars. Many slides from Michael Collins
Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures
More informationParsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank
Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank Dan Klein and Christopher D. Manning Computer Science Department Stanford University Stanford,
More informationNatural Language Processing. George Konidaris
Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans
More informationRANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S
N S ER E P S I M TA S UN A I S I T VER RANKING AND UNRANKING LEFT SZILARD LANGUAGES Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A-1997-2 UNIVERSITY OF TAMPERE DEPARTMENT OF
More informationalso inside Continuing Education Alumni Authors College Events
SUMMER 2016 JAMESTOWN COMMUNITY COLLEGE ALUMNI MAGAZINE create a etrepreeur creatig a busiess a artist creatig beauty a citize creatig the future also iside Cotiuig Educatio Alumi Authors College Evets
More informationAn Interactive Intelligent Language Tutor Over The Internet
An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This
More information2014 Gold Award Winner SpecialParent
Award Wier SpecialParet Dedicated to all families of childre with special eeds 6 th Editio/Fall/Witer 2014 Desig ad Editorial Awards Competitio MISSION Our goal is to provide parets of childre with special
More informationDERMATOLOGY. Sponsored by the NYU Post-Graduate Medical School. 129 Years of Continuing Medical Education
Advaces i DERMATOLOGY THURSDAY - FRIDAY JUNE 7-8, 2012 New York, NY Sposored by the NYU Post-Graduate Medical School 129 Years of Cotiuig Medical Educatio THE RONALD O. PERELMAN DEPARTMENT OF DERMATOLOGY
More informationDeveloping a TT-MCTAG for German with an RCG-based Parser
Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert University of Tübingen, Germany CNRS-LORIA, France LREC 2008,
More informationPredicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks
Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com
More informationEfficient Normal-Form Parsing for Combinatory Categorial Grammar
Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, June 1996, pp. 79-86. Efficient Normal-Form Parsing for Combinatory Categorial Grammar Jason Eisner Dept. of Computer and Information Science
More informationChunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.
NLP Lab Session Week 8 October 15, 2014 Noun Phrase Chunking and WordNet in NLTK Getting Started In this lab session, we will work together through a series of small examples using the IDLE window and
More information1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class
If we cancel class 1/20 idea We ll spend an extra hour on 1/21 I ll give you a brief writing problem for 1/21 based on assigned readings Jot down your thoughts based on your reading so you ll be ready
More informationA General Class of Noncontext Free Grammars Generating Context Free Languages
INFORMATION AND CONTROL 43, 187-194 (1979) A General Class of Noncontext Free Grammars Generating Context Free Languages SARWAN K. AGGARWAL Boeing Wichita Company, Wichita, Kansas 67210 AND JAMES A. HEINEN
More informationSome Principles of Automated Natural Language Information Extraction
Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract
More informationMultimedia Courseware of Road Safety Education for Secondary School Students
Multimedia Courseware of Road Safety Education for Secondary School Students Hanis Salwani, O 1 and Sobihatun ur, A.S 2 1 Universiti Utara Malaysia, Malaysia, hanisalwani89@hotmail.com 2 Universiti Utara
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationEnhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities
Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Yoav Goldberg Reut Tsarfaty Meni Adler Michael Elhadad Ben Gurion
More informationAnalysis of Probabilistic Parsing in NLP
Analysis of Probabilistic Parsing in NLP Krishna Karoo, Dr.Girish Katkar Research Scholar, Department of Electronics & Computer Science, R.T.M. Nagpur University, Nagpur, India Head of Department, Department
More informationDisambiguation of Thai Personal Name from Online News Articles
Disambiguation of Thai Personal Name from Online News Articles Phaisarn Sutheebanjard Graduate School of Information Technology Siam University Bangkok, Thailand mr.phaisarn@gmail.com Abstract Since online
More informationCS 598 Natural Language Processing
CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@
More informationAn Efficient Implementation of a New POP Model
An Efficient Implementation of a New POP Model Rens Bod ILLC, University of Amsterdam School of Computing, University of Leeds Nieuwe Achtergracht 166, NL-1018 WV Amsterdam rens@science.uva.n1 Abstract
More information"f TOPIC =T COMP COMP... OBJ
TREATMENT OF LONG DISTANCE DEPENDENCIES IN LFG AND TAG: FUNCTIONAL UNCERTAINTY IN LFG IS A COROLLARY IN TAG" Aravind K. Joshi Dept. of Computer & Information Science University of Pennsylvania Philadelphia,
More informationDistant Supervised Relation Extraction with Wikipedia and Freebase
Distant Supervised Relation Extraction with Wikipedia and Freebase Marcel Ackermann TU Darmstadt ackermann@tk.informatik.tu-darmstadt.de Abstract In this paper we discuss a new approach to extract relational
More informationBANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS
Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.
More informationUNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL
UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL A thesis submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in COMPUTER SCIENCE
More informationObjectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition
Chapter 2: The Representation of Knowledge Expert Systems: Principles and Programming, Fourth Edition Objectives Introduce the study of logic Learn the difference between formal logic and informal logic
More informationLTAG-spinal and the Treebank
LTAG-spinal and the Treebank a new resource for incremental, dependency and semantic parsing Libin Shen (lshen@bbn.com) BBN Technologies, 10 Moulton Street, Cambridge, MA 02138, USA Lucas Champollion (champoll@ling.upenn.edu)
More informationSeminar - Organic Computing
Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts
More informationhave to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,
A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994
More informationTop US Tech Talent for the Top China Tech Company
THE FALL 2017 US RECRUITING TOUR Top US Tech Talent for the Top China Tech Company INTERVIEWS IN 7 CITIES Tour Schedule CITY Boston, MA New York, NY Pittsburgh, PA Urbana-Champaign, IL Ann Arbor, MI Los
More informationGrammars & Parsing, Part 1:
Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review
More informationThe CYK -Approach to Serial and Parallel Parsing
The CYK -Approach to Serial and Parallel Parsing Anton Nijholt Traditional parsing methods for general context-free grammars have been re-investigated in order to see whether they can be adapted to a parallel
More informationMultimedia Application Effective Support of Education
Multimedia Application Effective Support of Education Eva Milková Faculty of Science, University od Hradec Králové, Hradec Králové, Czech Republic eva.mikova@uhk.cz Abstract Multimedia applications have
More informationA relational approach to translation
A relational approach to translation Rémi Zajac Project POLYGLOSS* University of Stuttgart IMS-CL /IfI-AIS, KeplerstraBe 17 7000 Stuttgart 1, West-Germany zajac@is.informatik.uni-stuttgart.dbp.de Abstract.
More informationApproaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque
Approaches to control phenomena handout 6 5.4 Obligatory control and morphological case: Icelandic and Basque Icelandinc quirky case (displaying properties of both structural and inherent case: lexically
More informationPrediction of Maximal Projection for Semantic Role Labeling
Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba
More informationHyperedge Replacement and Nonprojective Dependency Structures
Hyperedge Replacement and Nonprojective Dependency Structures Daniel Bauer and Owen Rambow Columbia University New York, NY 10027, USA {bauer,rambow}@cs.columbia.edu Abstract Synchronous Hyperedge Replacement
More informationA Version Space Approach to Learning Context-free Grammars
Machine Learning 2: 39~74, 1987 1987 Kluwer Academic Publishers, Boston - Manufactured in The Netherlands A Version Space Approach to Learning Context-free Grammars KURT VANLEHN (VANLEHN@A.PSY.CMU.EDU)
More informationAQUA: An Ontology-Driven Question Answering System
AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.
More informationTarget Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data
Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se
More informationSpecifying Logic Programs in Controlled Natural Language
TECHNICAL REPORT 94.17, DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF ZURICH, NOVEMBER 1994 Specifying Logic Programs in Controlled Natural Language Norbert E. Fuchs, Hubert F. Hofmann, Rolf Schwitter
More informationHans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZTI INF W. Germany. (2) [S' [NP who][s does he try to find [NP e]]s IS' $=~
The Treatment of Movement-Rules in a LFG-Parser Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZT NF W. Germany n this paper we propose a way of how to treat longdistance movement phenomena
More informationNCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches
NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches Yu-Chun Wang Chun-Kai Wu Richard Tzong-Han Tsai Department of Computer Science
More informationThe presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.
Lecture 4: OT Syntax Sources: Kager 1999, Section 8; Legendre et al. 1998; Grimshaw 1997; Barbosa et al. 1998, Introduction; Bresnan 1998; Fanselow et al. 1999; Gibson & Broihier 1998. OT is not a theory
More informationEECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;
EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10 Instructor: Kang G. Shin, 4605 CSE, 763-0391; kgshin@umich.edu Number of credit hours: 4 Class meeting time and room: Regular classes: MW 10:30am noon
More informationThe Singapore Copyright Act applies to the use of this document.
Title Mathematical problem solving in Singapore schools Author(s) Berinderjeet Kaur Source Teaching and Learning, 19(1), 67-78 Published by Institute of Education (Singapore) This document may be used
More informationGraduate Program in Education
SPECIAL EDUCATION THESIS/PROJECT AND SEMINAR (EDME 531-01) SPRING / 2015 Professor: Janet DeRosa, D.Ed. Course Dates: January 11 to May 9, 2015 Phone: 717-258-5389 (home) Office hours: Tuesday evenings
More informationThe Discourse Anaphoric Properties of Connectives
The Discourse Anaphoric Properties of Connectives Cassandre Creswell, Kate Forbes, Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi Λ, Bonnie Webber y Λ University of Pennsylvania 3401 Walnut Street Philadelphia,
More informationEdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar
EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,
More informationAbstractions and the Brain
Abstractions and the Brain Brian D. Josephson Department of Physics, University of Cambridge Cavendish Lab. Madingley Road Cambridge, UK. CB3 OHE bdj10@cam.ac.uk http://www.tcm.phy.cam.ac.uk/~bdj10 ABSTRACT
More informationSpecification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,
More informationHuman-like Natural Language Generation Using Monte Carlo Tree Search
Human-like Natural Language Generation Using Monte Carlo Tree Search Kaori Kumagai Ichiro Kobayashi Daichi Mochihashi Ochanomizu University The Institute of Statistical Mathematics {kaori.kumagai,koba}@is.ocha.ac.jp
More informationAdapting Stochastic Output for Rule-Based Semantics
Adapting Stochastic Output for Rule-Based Semantics Wissenschaftliche Arbeit zur Erlangung des Grades eines Diplom-Handelslehrers im Fachbereich Wirtschaftswissenschaften der Universität Konstanz Februar
More informationIntroduction, Organization Overview of NLP, Main Issues
HG2051 Language and the Computer Computational Linguistics with Python Introduction, Organization Overview of NLP, Main Issues Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/
More informationIntroduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.
to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about
More informationRefining the Design of a Contracting Finite-State Dependency Parser
Refining the Design of a Contracting Finite-State Dependency Parser Anssi Yli-Jyrä and Jussi Piitulainen and Atro Voutilainen The Department of Modern Languages PO Box 3 00014 University of Helsinki {anssi.yli-jyra,jussi.piitulainen,atro.voutilainen}@helsinki.fi
More informationA Graph Based Authorship Identification Approach
A Graph Based Authorship Identification Approach Notebook for PAN at CLEF 2015 Helena Gómez-Adorno 1, Grigori Sidorov 1, David Pinto 2, and Ilia Markov 1 1 Center for Computing Research, Instituto Politécnico
More informationLEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)
More informationMath-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade
Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade The third grade standards primarily address multiplication and division, which are covered in Math-U-See
More informationMeasuring the relative compositionality of verb-noun (V-N) collocations by integrating features
Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features Sriram Venkatapathy Language Technologies Research Centre, International Institute of Information Technology
More informationINTERMEDIATE ALGEBRA Course Syllabus
INTERMEDIATE ALGEBRA Course Syllabus This syllabus gives a detailed explanation of the course procedures and policies. You are responsible for this information - ask your instructor if anything is unclear.
More informationGrade 4. Common Core Adoption Process. (Unpacked Standards)
Grade 4 Common Core Adoption Process (Unpacked Standards) Grade 4 Reading: Literature RL.4.1 Refer to details and examples in a text when explaining what the text says explicitly and when drawing inferences
More informationMassachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139
Hariharan Narayanan Massachusetts Institute of Technology Tel: 773.428.3115 LIDS har@mit.edu 77 Massachusetts Avenue http://www.mit.edu/~har Room 32-D558 MA 02139 EMPLOYMENT Massachusetts Institute of
More informationMachine Learning from Garden Path Sentences: The Application of Computational Linguistics
Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,
More informationSELF-STUDY QUESTIONNAIRE FOR REVIEW of the COMPUTER SCIENCE PROGRAM
Disclaimer: This Self Study was developed to meet the goals of the CAC Session at the 2006 Summit. It should not be considered as a model or a template. ABET Computing Accreditation Commission SELF-STUDY
More informationA. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION
INVENTORY OF PROCESSES IN COLLEGE COMPOSITION This questionnaire describes the different ways that college students go about writing essays and papers. There are no right or wrong answers because there
More informationWriting Research Articles
Marek J. Druzdzel with minor additions from Peter Brusilovsky University of Pittsburgh School of Information Sciences and Intelligent Systems Program marek@sis.pitt.edu http://www.pitt.edu/~druzdzel Overview
More informationAccurate Unlexicalized Parsing for Modern Hebrew
Accurate Unlexicalized Parsing for Modern Hebrew Reut Tsarfaty and Khalil Sima an Institute for Logic, Language and Computation, University of Amsterdam Plantage Muidergracht 24, 1018TV Amsterdam, The
More informationSpoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers
Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates, and Kay Peterson Language Technologies Institute Carnegie
More informationCompositional Semantics
Compositional Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Words, bag of words Sequences Trees Meaning Representing Meaning An important goal of NLP/AI: convert natural language
More informationCLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction
CLASSIFICATION OF PROGRAM Critical Elements Analysis 1 Program Name: Macmillan/McGraw Hill Reading 2003 Date of Publication: 2003 Publisher: Macmillan/McGraw Hill Reviewer Code: 1. X The program meets
More informationarxiv:cmp-lg/ v1 7 Jun 1997 Abstract
Comparing a Linguistic and a Stochastic Tagger Christer Samuelsson Lucent Technologies Bell Laboratories 600 Mountain Ave, Room 2D-339 Murray Hill, NJ 07974, USA christer@research.bell-labs.com Atro Voutilainen
More informationOn March 15, 2016, Governor Rick Snyder. Continuing Medical Education Becomes Mandatory in Michigan. in this issue... 3 Great Lakes Veterinary
michiga veteriary medical associatio i this issue... 3 Great Lakes Veteriary Coferece 4 What You Need to Kow Whe Issuig a Iterstate Certificate of Ispectio 6 Low Pathogeic Avia Iflueza H5 Virus Detectios
More informationWhat Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models
What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models Michael A. Sao Pedro Worcester Polytechnic Institute 100 Institute Rd. Worcester, MA 01609
More informationUniversity of Groningen. Systemen, planning, netwerken Bosman, Aart
University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document
More informationSouth Carolina English Language Arts
South Carolina English Language Arts A S O F J U N E 2 0, 2 0 1 0, T H I S S TAT E H A D A D O P T E D T H E CO M M O N CO R E S TAT E S TA N DA R D S. DOCUMENTS REVIEWED South Carolina Academic Content
More informationCLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH
ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department
More informationA Pumpkin Grows. Written by Linda D. Bullock and illustrated by Debby Fisher
GUIDED READING REPORT A Pumpkin Grows Written by Linda D. Bullock and illustrated by Debby Fisher KEY IDEA This nonfiction text traces the stages a pumpkin goes through as it grows from a seed to become
More information