Natural Languages Analysis in Machine Translation (MT) based on the STCG (STRING-TREE CORRESPONDENCE GRAMMAR)

Size: px
Start display at page:

Download "Natural Languages Analysis in Machine Translation (MT) based on the STCG (STRING-TREE CORRESPONDENCE GRAMMAR)"

Transcription

1 Natural Laguages Aalysis i Machie Traslatio (MT) based o the STCG (STRING-TREE CORRESPONDENCE GRAMMAR) Tag Eya Kog, Zahari Yusoff Uit Terjemaha Melalui Komputer Pusat Pegajia Sais Komputer Uiversiti Sais Malaysia Mide, Pulau Piag, Malaysia. [ eyakog@cs.usm.my ad zari@cs.usm.my] 0. Abstract The Strig-Tree Correspodece Grammar (STCG) [1] is a grammar formalism for defiig: a set of strigs (a laguage), a set of trees (valid represetatio/iterpretatio structures), the mappig betwee the two (to be iterpreted for aalysis & geeratio). The formalism is argued to be a totally declarative grammar formalism that ca associate, to strigs i a laguage, arbitrary tree structures as desired by the grammar writer to be the liguistic represetatio structures of the strigs. More importatly is the facility to specify the correspodece betwee the strig ad the associated tree i a very atural maer. These features are very much desired i grammar writig, i particular for the treatmet of certai liguistic pheomea which are 'o-stadard', amely featurisatio, lexicalisatio ad crossed depedecies [2,3]. Furthermore, a grammar writte i this way aturally iherits the desired property of bi-directioality (i fact o-directioality [4]) such that the same grammar ca be iterpreted for both aalysis ad geeratio. I this paper, we ivestigate the properties of the STCG for iterpretatio towards aalysis (as is uderstood withi the cotext of Machie Traslatio (MT)). Other tha usig STCG grammars as specificatios for the automatic geeratio of aalysis programs i the Specialised Laguages for Liguistic Programmig (SLLPs) of MT systems (a study reported i [5,6]), the work also cetres aroud the specificatio of a geeral aalyser/parser for the STCG. The proposed STCG aalyser is capable of mimickig some very useful features i various cotextfree parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the compact way of represetig possible parse trees for ambiguous seteces, such as the oe see i [8]. Though ot reported i this paper, we ote that the proposed aalyser also provide a atural way for hadlig the kid of awkward pheomea metioed above (amely lexicalisatio, featurisatio, ad worst of all, crossed depedecies) while at the same time retaiig much of the efficiecy of stadard cotext-free parsig algorithms (a study reported i [2,3]). 1. The STCG Formalism The Strig-Tree Correspodece Grammar is a declarative grammar formalism that ca be used to describe the correspodece betwee strigs of terms ad trees. I particular, liguistic rules are writte with utteraces as the strig of terms (heceforth STRING) ad the correspodig represetative liguistic structures as the tree (heceforth TREE). Figure 1 gives a idicatio of a full STCG rule. The structure of the TREE is totally specified by the liguist ad is ot costraied by ay applicatio of rules (as i the case for the parse tree i the classical cotext free grammar). I a rule, the mai correspodece is first declared: i the example, the STRING #1.v.#2.part (with #1 ad #2 beig strig variables, ie. variables which are istatiable to strigs of terms) is set to correspod to the TREE with root ode S (where ad are forest variables, ie. variables that ca be istatiated to lists of subtrees). The mai-corr(espodece) is followed by a declaratio of subcorrespodeces (o the right had 261

2 side) betwee substrigs of the STRING ad subtrees of the TREE, each of which possibly havig a list of refereces (rule ames). For example, the sub-corr(espodece) betwee the substrig #1 ad the subtree rooted at the ode 1 refers to the rules R..., the latter beig other rules i the grammar. This referece is a mechaism by meas of which the strig ad forest variables metioed earlier are fully istatiated via a operatio called iificatio [9,10] resultig i a correspodece betwee explicit strigs of terms ad ad trees, both without variables. I actual fact, the mai-corr as well as the sub-co specified i the rule are formally recorded i terms of a Structured Strig Tree Correspodece (SSTC) trasparet to the liguist [11] as illustrated i figure 2, where a give correspodece may be oprojective (eg. with discotiuous costituets) as is the case for the odev(part) i the example. Note also that the particle is chose (by the liguist) to be represeted as a collectio of features i the ode v - a case of featurisatio. Mai-Corr. 0"'"..ThP 1,/./.11) v(part) #1.v.#2.part with : R1 I very simple terms, a strig to tree correspodece i the STCG ca be viewed as aalogous to the mathematical defiitio of a relatio betwee iteger umbers as i the example give o the right. Here, a relatio (i this case a fuctio) f is defied i terms of fier subrelatios accordig to the subdomais. Sub-Corr. #1 with : R v(part) v.part = pick, etc. paa = up, etc. Figure 1. (0/a_d) Nff..11*.VP (- /a_b) (0/b e), kpart) (b_c+d_e/ (- /c_d) b_c+d_e) $13 #1. v. #2 part a bb cc dd e Figure 2. 1 #2 with : R -3 x<3 f(x)= x +5 3<x<5 x 55_X A set of STCG rules form a grammar, some of which are axiom rules (ie. start rules or rules cotaiig axiom trees, as i the axiom or the start symbol S i the classical cotext free grammar). With the sematics of the rules beig as idicated above, a grammar thus defies a laguage of strigs, a laguage of represetatio trees, ad the correspodece betwee elemets of the two laguages/sets. It is this set of strig-tree correspodeces that ca be iterpreted for both aalysis ad geeratio. 2. Natural Laguages Aalysis i MT Based o the STCG Iitially, the STCG was desiged to serve as a specificatio laguage for writig grammars i MT such that the specificatios writte i the STCG grammar formalism ca the be coded (maually) ito the liguistic programs for aalysis ad geeratio writte i the SLLPs of itegrated MT systems. Some substatial work have also bee carried out to automate this process, amely towards the automatic geeratio of aalysis programs i the MT systems ARIANE [12] ad JEMAH [13] from grammars writte i the STCG formalism (see for example [5,6]). However, due to certai limitatios i the existig SLLPs for the realisatio of a proper implemetatio of a STCG aalyser (as discussed i [2]), we have decided istead to look ito the desig of a aalyser which ca directly iterpret the STCG grammar The Fudametal Desig of the STCG Aalyser As we have see above, a STCG grammar actually defies a set of SSTCs i a way quite similar to the defiitio of a mathematical fuctio. I evaluatig a mathematical fuctio, if the fuctio is defied i terms of other sub-fuctios the it ca oly be completely evaluated after all its sub-fuctios have bee evaluated ad retur with the appropriate values. We ca view the STCG aalysis process i the same maer where, by takig the iput strig/setece as their STRING, the set of explicit SSTCs defied by the axiom rules of a grammar are costructed based o the resultat sub-sstcs defied by the referece rules of these axiom rules. Sice the 262

3 referece rules of the axiom rules may i tur refer to other rules, they may also retur the completed SSTCs oly after their respective referece rules have bee completed. This referece process will termiate whe all remaiig sub-sstcs evaluated are defied by subcorrespodeces which do ot refer to ay other rule, amely the 'lexical-sstcs', which must match with the iput words (the o-lexical SSTCs are called 'phrasal- SSTCs'). We illustrate this i the followig aalysis of the iput strig "He picks the ball up" with respect to a grammar cosistig of rule R1 give i figure 1 ad rules R1, R3 give i figure 3. The rule R1 is give as a axiom rule. The aalysis process begis with the evaluatio of the geeral SSTC defied by the axiom rule R1, which i tur leads to the evaluatio of two other sub-sstcs defied by the referece rules R1, R3 as illustrated i figure 4. mai-corr with : R1 mai-corr d/\. with : R3 Figure 3. sub-corr 1 = Joh, ball, he,..., etc. sub-corr the,etc. = ball,etc. VP vpar (l/t; #A. v. #B. Pa (1aa_bbcc5 with : RI - Apply rule R I - Apply rule R3 - Apply rule RI VP (0/1_5) (0/0_1) v(pai (1 2+4_5/ (0/2 4) 1=2+4_5)...".1111"" (2_3/2_3 ) ko_4/ ((1_1 /0_ I) j-le. picks. the. ball. LID 0_ _4 4_5 with : R I r #11 b_c with : R b d dc with : R3 Phrasal-SSTC (0/0_1) I (0_1/0_1) kit 01 with R I (o12_4) ded...""1"17 t (2_3/2_th3e).(3b_a411/3_4) 2_3 3_4 with : R... (0/2 4) v(part) ti Ski b_d Lexical-SSTC v (part) icks. u 1_2 4_5 1/10. hail _3 3_4 He 01 a b picks 1 2 b_d d_c c_5 the ball up 2_3 3_4 4_5 Figure 4. a /b picks 1 2 t d d_c c_5 he ball up _3 3_4 4_5 I the diagram above (o the left), the aalysis process expads the SSTC defied by the axiom rule ito a strig of sub-sstcs, which is further expaded ito aother strig of sub-sstcs util it caot be expaded ay further, which is whe the strig of sub-sstcs cosists oly of lexical-sstcs. The strig of lexical-sstcs is the matched with the words i the iput strig. Note that the matchig eed ot be i a projective maer, as ca be see i this particular example, where the lexical-sstcs are matched to the words i the iput strig i a crossed serial maer - a case of crossed depedecies. I order to keep track of such o- 263

4 projective correspodeces, we itroduce the use of idex variables to record the iterval correspodig to each symbol appearig i the STRING (as illustrated o the right). I [2], we proposed a desig of the STCG aalysis algorithm which is capable of mimickig some very useful features i various cotext-free parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the represetatio of shared forest i term of a STCG grammar rules which is i fact followig the approach adopted i [8] as illustrated i the ext sectio. 2.2 Multiple Results of aalysis for ambiguous iput setece The example setece give above is uambiguous, ad thus correspods to oly a sigle represetatio tree. However, atural laguage grammars are kow to be i the class of highly ambiguous grammars, ad as such, there may be umerous represetatio trees geerated for a sigle setece i the laguage described. Istead of storig each represetatio tree separately i the set of SSTCs defiig the correspodeces betwee the give setece ad all its possible represetatio trees, we should try to represet all these i a space-efficiet maer. I the figure give below, we preset a compact way of represetig a set of SSTCs correspods to a ambiguous setece by meas of a AND-OR graph of rules - similar to the techique used by [8]. For example, the two SSTCs: VP (0/ 13) (0/0_1 ) I V FP (013_6) ( 1_2/1_2) (0/2_3) p (0/4_6) ( 0_ 1/()_ I ) I (3_4/3_4) /4 (2_3/2 3) de ' (4_5/4_5) (5_6/S 6) ruh'14.?1 (0_1) (0/0_6) _6) (0_ ) ( 1_2/1_2) (OPT 6) PP (2_3/2_3) (0/3_6) (3_4/34) de t (4_5/4_57(56:6/5-6) e,.rs.4te rat() with : RTC Figure 5:Two liguistic represetatios of the setece Joh saw Mary i the boat. ca be factorised ito a AND-OR graph of rules R2, R3, R5, RPP (give below) ad rules R1, R3 (give i figure 3) i the followig maer: RIP I (Joh) (saw) (Mary) P ) R3 (i " De t R I (the) (boat) Figure 6 : A AND-OR Graph of STCG grammar rules. Mai-Corr. Sup-corr. Ne1)& V NIP #A. v. #B with : R2 with : R 1,RIR5 with : R I.R3,R5 Mai-Corr. Sub-Corr. S PP 1 EA #A.#B with : R2,R3 with : RPP with : R3 pa Mai-Corr. Sub-Corr. p $ lip *IA itg la with : with : R5 R I,R3 with : RPP Mai-Corr. p with : RPP Sub-Corr. with : R I,R3,R5 264

5 3. Cocludig Remarks Recetly, efficiet cotext-free parsig methods such as the LR parser ad Earley's Algorithm have bee referred to extesively i implemetig parsers for most of the formalisms used i the field of NLP. I a effort to retai the efficiecy of stadard cotext-free parsig algorithms, most recet declarative formalisms are typically restricted by the costrait of strig cocateatio i cotext-free grammars which allows a setece to be systematically decomposed so that the parsig process ca be idexed by the subparts of that decompositio (the substrigs). However, it has also bee widely recogised that the cocateatio restrictio of CFG ca be problematic i hadlig pheomea such as lexicalisatio, featurisatio, ad especially crossed depedecies. As a alterative, we propose the STCG formalism which allows for a more 'atural' way of specifiyig the strigs of the laguage beig described, their correspodig liguistically motivated represetatio trees, ad the correspodece betwee the two, where the correspodece eed ot be projective ad hece appropriate for the said pheomea. Eve though the stadard CF parsig methods caot be adopted directly i the aalysis of a iput setece with respect of a STCG grammar, due to the STRING patters of the STCG which eed ot submit to the cocateatio restrictio of CFG, i this paper we preset the geeral layout (due to the space costrait, however iterested readers may get more ails i [2]) of a aalyser for the STCG which is capable of mimickig some very useful features i various cotext-free parsig techiques. Oe such feature is the use of charts i tabular parsig algorithms, as exemplified i Earley's Algorithm [7], which is very helpful i avoidig redudacies that may otherwise result i a combiatorial explosio. Aother is the compact way of represetig possible parse trees for ambiguous seteces, such as the oe see i [8]. Furthermore, we have also provided a atural way for hadlig the kid of awkward pheomea such as lexicalisatio, featurisatio, ad worst of all, crossed depedecies, while at the same time retaiig much of the efficiecy of stadard cotext-free parsig algoritms [2,3]. REFERENCES [ 1 ] Zahari Y., Strig-Tree Correspodece Grammar: a declarative grammar formalism for defiig the correspodece betwee strigs of terms ad tree structures, proceedigs of the 3rd Coferece of the Europea Chapter of the ACL, Copehage, April [2] Tag Eya Kog, Natural laguages Aalysis i machie traslatio (MT) based o the STCG, PhD thesis, Uiversiti Sais Malaysia, Peag, March [3] Tag Eya Kog, Zahari Y., Hadlig Crossed Depedecies with the STCG, proceedigs of Natural Laguage Processig Pacific Rim Symposium (NLPRS'95), Sofitel Ambassador Hotel, Seoul, Korea, Dec. 4-6, [4] Yves Lepage, Parsig ad Geeratig Cotext-Sesitive Laguages with Correspodece Iificatio Grammars, proceedigs of the Natural Laguage Processig Pacific Rim Symposium (NLPRS'91), Sigapore, Nov [5] Zahari Yusoff, Tag Eya Kog, Geeratio of aalysis programs i ROBRA (ARIANE) From Strig-Tree Correspodece Grammars (or a Strategy for Aalysis i machie traaslatio), Proceedigs of the 3rd Machie Traslatio Summit, Washigto, D.C., July,1991. [6] Zahari Y., Tag Eya Kog, Strig-Tree Correspodece Grammars as a base for the automatic geeratio of aalysis programs i machie traaslatio, proceedigs of the Iteratioal Coferece o Curret Issues i Computatioal Liguistics, Peag, Jue [7] J. Earley, A efficiet catext-free parsig algorithm, Commuicatios of the ACM, Vol. 13, Num. 2, Feb 1970, pp [8] Lag, B., Towards a Uiform Formal Framework for Parsig, I : Curret Issues i Parsig Techology, M. Tomita (ed.), Kluwer Academic Publishers, 1991, pp [9] Zahari Y., Strategies ad heuristics i the aalysis of atural laguages i machie traslatio, PhD thesis, Uiversiti Sais Malaysia, Peag, March [10] Y.Lepage, U systeme de grammaires correspodacielles d'iificatio, these de Docteur, IMAG, Uiversite Joseph Fourier, Greoble, Jue [11] Zahari Yusoff, Christia Boitet, Represetatio trees ad strig-tree correspodeces, proceedigs of the 12th Iteratioal Coferece o Computatioal Liguistics, COLING-88, Budapest, August 1988, pp [12] Ch.Boitet, P.Guillaume, M.Quezel-Ambruaz, Le poit sur ARIANE-78, debut 1982 (DSE-I ), vol.], part.] : le logiciel, GETA, avril [13] Tog Loog Cheog, The JEMAH System : Referece Maual, UTMK documet, USM, Peag,

6 266

Natural language processing implementation on Romanian ChatBot

Natural language processing implementation on Romanian ChatBot Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION Natural laguage processig implemetatio o Romaia ChatBot RALF FABIAN, MARCU ALEXANDRU-NICOLAE Departmet for Iformatics

More information

'Norwegian University of Science and Technology, Department of Computer and Information Science

'Norwegian University of Science and Technology, Department of Computer and Information Science The helpful Patiet Record System: Problem Orieted Ad Kowledge Based Elisabeth Bayega, MS' ad Samso Tu, MS2 'Norwegia Uiversity of Sciece ad Techology, Departmet of Computer ad Iformatio Sciece ad Departmet

More information

E-LEARNING USABILITY: A LEARNER-ADAPTED APPROACH BASED ON THE EVALUATION OF LEANER S PREFERENCES. Valentina Terzieva, Yuri Pavlov, Rumen Andreev

E-LEARNING USABILITY: A LEARNER-ADAPTED APPROACH BASED ON THE EVALUATION OF LEANER S PREFERENCES. Valentina Terzieva, Yuri Pavlov, Rumen Andreev Titre du documet / Documet title E-learig usability : A learer-adapted approach based o the evaluatio of leaer's prefereces Auteur(s) / Author(s) TERZIEVA Valetia ; PAVLOV Yuri (1) ; ANDREEV Rume (2) ;

More information

arxiv: v1 [cs.dl] 22 Dec 2016

arxiv: v1 [cs.dl] 22 Dec 2016 ScieceWISE: Topic Modelig over Scietific Literature Networks arxiv:1612.07636v1 [cs.dl] 22 Dec 2016 A. Magalich, V. Gemmetto, D. Garlaschelli, A. Boyarsky Uiversity of Leide, The Netherlads {magalich,

More information

Management Science Letters

Management Science Letters Maagemet Sciece Letters 4 (24) 2 26 Cotets lists available at GrowigSciece Maagemet Sciece Letters homepage: www.growigsciece.com/msl A applicatio of data evelopmet aalysis for measurig the relative efficiecy

More information

Fuzzy Reference Gain-Scheduling Approach as Intelligent Agents: FRGS Agent

Fuzzy Reference Gain-Scheduling Approach as Intelligent Agents: FRGS Agent Fuzzy Referece Gai-Schedulig Approach as Itelliget Agets: FRGS Aget J. E. ARAUJO * eresto@lit.ipe.br K. H. KIENITZ # kieitz@ita.br S. A. SANDRI sadra@lac.ipe.br J. D. S. da SILVA demisio@lac.ipe.br * Itegratio

More information

Consortium: North Carolina Community Colleges

Consortium: North Carolina Community Colleges Associatio of Research Libraries / Texas A&M Uiversity www.libqual.org Cotributors Collee Cook Texas A&M Uiversity Fred Heath Uiversity of Texas BruceThompso Texas A&M Uiversity Martha Kyrillidou Associatio

More information

part2 Participatory Processes

part2 Participatory Processes part part2 Participatory Processes Participatory Learig Approaches Whose Learig? Participatory learig is based o the priciple of ope expressio where all sectios of the commuity ad exteral stakeholders

More information

CONSTITUENT VOICE TECHNICAL NOTE 1 INTRODUCING Version 1.1, September 2014

CONSTITUENT VOICE TECHNICAL NOTE 1 INTRODUCING  Version 1.1, September 2014 preview begis oct 2014 lauches ja 2015 INTRODUCING WWW.FEEDBACKCOMMONS.ORG A serviced cloud platform to share ad compare feedback data ad collaboratively develop feedback ad learig practice CONSTITUENT

More information

Application for Admission

Application for Admission Applicatio for Admissio Admissio Office PO Box 2900 Illiois Wesleya Uiversity Bloomig, Illiois 61702-2900 Apply o-lie at: www.iwu.edu Applicatio Iformatio I am applyig: Early Actio Regular Decisio Early

More information

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR ROLAND HAUSSER Institut für Deutsche Philologie Ludwig-Maximilians Universität München München, West Germany 1. CHOICE OF A PRIMITIVE OPERATION The

More information

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm syntax: from the Greek syntaxis, meaning setting out together

More information

Parsing of part-of-speech tagged Assamese Texts

Parsing of part-of-speech tagged Assamese Texts IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal

More information

VISION, MISSION, VALUES, AND GOALS

VISION, MISSION, VALUES, AND GOALS 6 VISION, MISSION, VALUES, AND GOALS 2010-2015 VISION STATEMENT Ohloe College will be kow throughout Califoria for our iclusiveess, iovatio, ad superior rates of studet success. MISSION STATEMENT The Missio

More information

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Chinese Language Parsing with Maximum-Entropy-Inspired Parser Chinese Language Parsing with Maximum-Entropy-Inspired Parser Heng Lian Brown University Abstract The Chinese language has many special characteristics that make parsing difficult. The performance of state-of-the-art

More information

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Basic Parsing with Context-Free Grammars Some slides adapted from Julia Hirschberg and Dan Jurafsky 1 Announcements HW 2 to go out today. Next Tuesday most important for background to assignment Sign up

More information

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3] Towards a MWE-driven A* parsing with LTAGs [WG2,WG3] Jakub Waszczuk, Agata Savary To cite this version: Jakub Waszczuk, Agata Savary. Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]. PARSEME 6th general

More information

HANDBOOK. Career Center Handbook. Tools & Tips for Career Search Success CALIFORNIA STATE UNIVERSITY, SACR AMENTO

HANDBOOK. Career Center Handbook. Tools & Tips for Career Search Success CALIFORNIA STATE UNIVERSITY, SACR AMENTO HANDBOOK Career Ceter Hadbook CALIFORNIA STATE UNIVERSITY, SACR AMENTO Tools & Tips for Career Search Success Academic Advisig ad Career Ceter 6000 J Street Lasse Hall 1013 Sacrameto, CA 95819-6064 916-278-6231

More information

The Interface between Phrasal and Functional Constraints

The Interface between Phrasal and Functional Constraints The Interface between Phrasal and Functional Constraints John T. Maxwell III* Xerox Palo Alto Research Center Ronald M. Kaplan t Xerox Palo Alto Research Center Many modern grammatical formalisms divide

More information

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each

More information

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy Informatics 2A: Language Complexity and the Chomsky Hierarchy September 28, 2010 Starter 1 Is there a finite state machine that recognises all those strings s from the alphabet {a, b} where the difference

More information

Context Free Grammars. Many slides from Michael Collins

Context Free Grammars. Many slides from Michael Collins Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures

More information

Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank

Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank Parsing with Treebank Grammars: Empirical Bounds, Theoretical Models, and the Structure of the Penn Treebank Dan Klein and Christopher D. Manning Computer Science Department Stanford University Stanford,

More information

Natural Language Processing. George Konidaris

Natural Language Processing. George Konidaris Natural Language Processing George Konidaris gdk@cs.brown.edu Fall 2017 Natural Language Processing Understanding spoken/written sentences in a natural language. Major area of research in AI. Why? Humans

More information

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S N S ER E P S I M TA S UN A I S I T VER RANKING AND UNRANKING LEFT SZILARD LANGUAGES Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A-1997-2 UNIVERSITY OF TAMPERE DEPARTMENT OF

More information

also inside Continuing Education Alumni Authors College Events

also inside Continuing Education Alumni Authors College Events SUMMER 2016 JAMESTOWN COMMUNITY COLLEGE ALUMNI MAGAZINE create a etrepreeur creatig a busiess a artist creatig beauty a citize creatig the future also iside Cotiuig Educatio Alumi Authors College Evets

More information

An Interactive Intelligent Language Tutor Over The Internet

An Interactive Intelligent Language Tutor Over The Internet An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This

More information

2014 Gold Award Winner SpecialParent

2014 Gold Award Winner SpecialParent Award Wier SpecialParet Dedicated to all families of childre with special eeds 6 th Editio/Fall/Witer 2014 Desig ad Editorial Awards Competitio MISSION Our goal is to provide parets of childre with special

More information

DERMATOLOGY. Sponsored by the NYU Post-Graduate Medical School. 129 Years of Continuing Medical Education

DERMATOLOGY. Sponsored by the NYU Post-Graduate Medical School. 129 Years of Continuing Medical Education Advaces i DERMATOLOGY THURSDAY - FRIDAY JUNE 7-8, 2012 New York, NY Sposored by the NYU Post-Graduate Medical School 129 Years of Cotiuig Medical Educatio THE RONALD O. PERELMAN DEPARTMENT OF DERMATOLOGY

More information

Developing a TT-MCTAG for German with an RCG-based Parser

Developing a TT-MCTAG for German with an RCG-based Parser Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert University of Tübingen, Germany CNRS-LORIA, France LREC 2008,

More information

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com

More information

Efficient Normal-Form Parsing for Combinatory Categorial Grammar

Efficient Normal-Form Parsing for Combinatory Categorial Grammar Proceedings of the 34th Annual Meeting of the ACL, Santa Cruz, June 1996, pp. 79-86. Efficient Normal-Form Parsing for Combinatory Categorial Grammar Jason Eisner Dept. of Computer and Information Science

More information

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence. NLP Lab Session Week 8 October 15, 2014 Noun Phrase Chunking and WordNet in NLTK Getting Started In this lab session, we will work together through a series of small examples using the IDLE window and

More information

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class If we cancel class 1/20 idea We ll spend an extra hour on 1/21 I ll give you a brief writing problem for 1/21 based on assigned readings Jot down your thoughts based on your reading so you ll be ready

More information

A General Class of Noncontext Free Grammars Generating Context Free Languages

A General Class of Noncontext Free Grammars Generating Context Free Languages INFORMATION AND CONTROL 43, 187-194 (1979) A General Class of Noncontext Free Grammars Generating Context Free Languages SARWAN K. AGGARWAL Boeing Wichita Company, Wichita, Kansas 67210 AND JAMES A. HEINEN

More information

Some Principles of Automated Natural Language Information Extraction

Some Principles of Automated Natural Language Information Extraction Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract

More information

Multimedia Courseware of Road Safety Education for Secondary School Students

Multimedia Courseware of Road Safety Education for Secondary School Students Multimedia Courseware of Road Safety Education for Secondary School Students Hanis Salwani, O 1 and Sobihatun ur, A.S 2 1 Universiti Utara Malaysia, Malaysia, hanisalwani89@hotmail.com 2 Universiti Utara

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities Yoav Goldberg Reut Tsarfaty Meni Adler Michael Elhadad Ben Gurion

More information

Analysis of Probabilistic Parsing in NLP

Analysis of Probabilistic Parsing in NLP Analysis of Probabilistic Parsing in NLP Krishna Karoo, Dr.Girish Katkar Research Scholar, Department of Electronics & Computer Science, R.T.M. Nagpur University, Nagpur, India Head of Department, Department

More information

Disambiguation of Thai Personal Name from Online News Articles

Disambiguation of Thai Personal Name from Online News Articles Disambiguation of Thai Personal Name from Online News Articles Phaisarn Sutheebanjard Graduate School of Information Technology Siam University Bangkok, Thailand mr.phaisarn@gmail.com Abstract Since online

More information

CS 598 Natural Language Processing

CS 598 Natural Language Processing CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@

More information

An Efficient Implementation of a New POP Model

An Efficient Implementation of a New POP Model An Efficient Implementation of a New POP Model Rens Bod ILLC, University of Amsterdam School of Computing, University of Leeds Nieuwe Achtergracht 166, NL-1018 WV Amsterdam rens@science.uva.n1 Abstract

More information

"f TOPIC =T COMP COMP... OBJ

f TOPIC =T COMP COMP... OBJ TREATMENT OF LONG DISTANCE DEPENDENCIES IN LFG AND TAG: FUNCTIONAL UNCERTAINTY IN LFG IS A COROLLARY IN TAG" Aravind K. Joshi Dept. of Computer & Information Science University of Pennsylvania Philadelphia,

More information

Distant Supervised Relation Extraction with Wikipedia and Freebase

Distant Supervised Relation Extraction with Wikipedia and Freebase Distant Supervised Relation Extraction with Wikipedia and Freebase Marcel Ackermann TU Darmstadt ackermann@tk.informatik.tu-darmstadt.de Abstract In this paper we discuss a new approach to extract relational

More information

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.

More information

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL A thesis submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in COMPUTER SCIENCE

More information

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition Chapter 2: The Representation of Knowledge Expert Systems: Principles and Programming, Fourth Edition Objectives Introduce the study of logic Learn the difference between formal logic and informal logic

More information

LTAG-spinal and the Treebank

LTAG-spinal and the Treebank LTAG-spinal and the Treebank a new resource for incremental, dependency and semantic parsing Libin Shen (lshen@bbn.com) BBN Technologies, 10 Moulton Street, Cambridge, MA 02138, USA Lucas Champollion (champoll@ling.upenn.edu)

More information

Seminar - Organic Computing

Seminar - Organic Computing Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

Top US Tech Talent for the Top China Tech Company

Top US Tech Talent for the Top China Tech Company THE FALL 2017 US RECRUITING TOUR Top US Tech Talent for the Top China Tech Company INTERVIEWS IN 7 CITIES Tour Schedule CITY Boston, MA New York, NY Pittsburgh, PA Urbana-Champaign, IL Ann Arbor, MI Los

More information

Grammars & Parsing, Part 1:

Grammars & Parsing, Part 1: Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review

More information

The CYK -Approach to Serial and Parallel Parsing

The CYK -Approach to Serial and Parallel Parsing The CYK -Approach to Serial and Parallel Parsing Anton Nijholt Traditional parsing methods for general context-free grammars have been re-investigated in order to see whether they can be adapted to a parallel

More information

Multimedia Application Effective Support of Education

Multimedia Application Effective Support of Education Multimedia Application Effective Support of Education Eva Milková Faculty of Science, University od Hradec Králové, Hradec Králové, Czech Republic eva.mikova@uhk.cz Abstract Multimedia applications have

More information

A relational approach to translation

A relational approach to translation A relational approach to translation Rémi Zajac Project POLYGLOSS* University of Stuttgart IMS-CL /IfI-AIS, KeplerstraBe 17 7000 Stuttgart 1, West-Germany zajac@is.informatik.uni-stuttgart.dbp.de Abstract.

More information

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque Approaches to control phenomena handout 6 5.4 Obligatory control and morphological case: Icelandic and Basque Icelandinc quirky case (displaying properties of both structural and inherent case: lexically

More information

Prediction of Maximal Projection for Semantic Role Labeling

Prediction of Maximal Projection for Semantic Role Labeling Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba

More information

Hyperedge Replacement and Nonprojective Dependency Structures

Hyperedge Replacement and Nonprojective Dependency Structures Hyperedge Replacement and Nonprojective Dependency Structures Daniel Bauer and Owen Rambow Columbia University New York, NY 10027, USA {bauer,rambow}@cs.columbia.edu Abstract Synchronous Hyperedge Replacement

More information

A Version Space Approach to Learning Context-free Grammars

A Version Space Approach to Learning Context-free Grammars Machine Learning 2: 39~74, 1987 1987 Kluwer Academic Publishers, Boston - Manufactured in The Netherlands A Version Space Approach to Learning Context-free Grammars KURT VANLEHN (VANLEHN@A.PSY.CMU.EDU)

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se

More information

Specifying Logic Programs in Controlled Natural Language

Specifying Logic Programs in Controlled Natural Language TECHNICAL REPORT 94.17, DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF ZURICH, NOVEMBER 1994 Specifying Logic Programs in Controlled Natural Language Norbert E. Fuchs, Hubert F. Hofmann, Rolf Schwitter

More information

Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZTI INF W. Germany. (2) [S' [NP who][s does he try to find [NP e]]s IS' $=~

Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZTI INF W. Germany. (2) [S' [NP who][s does he try to find [NP e]]s IS' $=~ The Treatment of Movement-Rules in a LFG-Parser Hans-Ulrich Block, Hans Haugeneder Siemens AG, MOnchen ZT ZT NF W. Germany n this paper we propose a way of how to treat longdistance movement phenomena

More information

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches Yu-Chun Wang Chun-Kai Wu Richard Tzong-Han Tsai Department of Computer Science

More information

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing. Lecture 4: OT Syntax Sources: Kager 1999, Section 8; Legendre et al. 1998; Grimshaw 1997; Barbosa et al. 1998, Introduction; Bresnan 1998; Fanselow et al. 1999; Gibson & Broihier 1998. OT is not a theory

More information

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ; EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10 Instructor: Kang G. Shin, 4605 CSE, 763-0391; kgshin@umich.edu Number of credit hours: 4 Class meeting time and room: Regular classes: MW 10:30am noon

More information

The Singapore Copyright Act applies to the use of this document.

The Singapore Copyright Act applies to the use of this document. Title Mathematical problem solving in Singapore schools Author(s) Berinderjeet Kaur Source Teaching and Learning, 19(1), 67-78 Published by Institute of Education (Singapore) This document may be used

More information

Graduate Program in Education

Graduate Program in Education SPECIAL EDUCATION THESIS/PROJECT AND SEMINAR (EDME 531-01) SPRING / 2015 Professor: Janet DeRosa, D.Ed. Course Dates: January 11 to May 9, 2015 Phone: 717-258-5389 (home) Office hours: Tuesday evenings

More information

The Discourse Anaphoric Properties of Connectives

The Discourse Anaphoric Properties of Connectives The Discourse Anaphoric Properties of Connectives Cassandre Creswell, Kate Forbes, Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi Λ, Bonnie Webber y Λ University of Pennsylvania 3401 Walnut Street Philadelphia,

More information

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,

More information

Abstractions and the Brain

Abstractions and the Brain Abstractions and the Brain Brian D. Josephson Department of Physics, University of Cambridge Cavendish Lab. Madingley Road Cambridge, UK. CB3 OHE bdj10@cam.ac.uk http://www.tcm.phy.cam.ac.uk/~bdj10 ABSTRACT

More information

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,

More information

Human-like Natural Language Generation Using Monte Carlo Tree Search

Human-like Natural Language Generation Using Monte Carlo Tree Search Human-like Natural Language Generation Using Monte Carlo Tree Search Kaori Kumagai Ichiro Kobayashi Daichi Mochihashi Ochanomizu University The Institute of Statistical Mathematics {kaori.kumagai,koba}@is.ocha.ac.jp

More information

Adapting Stochastic Output for Rule-Based Semantics

Adapting Stochastic Output for Rule-Based Semantics Adapting Stochastic Output for Rule-Based Semantics Wissenschaftliche Arbeit zur Erlangung des Grades eines Diplom-Handelslehrers im Fachbereich Wirtschaftswissenschaften der Universität Konstanz Februar

More information

Introduction, Organization Overview of NLP, Main Issues

Introduction, Organization Overview of NLP, Main Issues HG2051 Language and the Computer Computational Linguistics with Python Introduction, Organization Overview of NLP, Main Issues Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/

More information

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions. to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about

More information

Refining the Design of a Contracting Finite-State Dependency Parser

Refining the Design of a Contracting Finite-State Dependency Parser Refining the Design of a Contracting Finite-State Dependency Parser Anssi Yli-Jyrä and Jussi Piitulainen and Atro Voutilainen The Department of Modern Languages PO Box 3 00014 University of Helsinki {anssi.yli-jyra,jussi.piitulainen,atro.voutilainen}@helsinki.fi

More information

A Graph Based Authorship Identification Approach

A Graph Based Authorship Identification Approach A Graph Based Authorship Identification Approach Notebook for PAN at CLEF 2015 Helena Gómez-Adorno 1, Grigori Sidorov 1, David Pinto 2, and Ilia Markov 1 1 Center for Computing Research, Instituto Politécnico

More information

LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE

LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)

More information

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade

Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade Math-U-See Correlation with the Common Core State Standards for Mathematical Content for Third Grade The third grade standards primarily address multiplication and division, which are covered in Math-U-See

More information

Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features

Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features Sriram Venkatapathy Language Technologies Research Centre, International Institute of Information Technology

More information

INTERMEDIATE ALGEBRA Course Syllabus

INTERMEDIATE ALGEBRA Course Syllabus INTERMEDIATE ALGEBRA Course Syllabus This syllabus gives a detailed explanation of the course procedures and policies. You are responsible for this information - ask your instructor if anything is unclear.

More information

Grade 4. Common Core Adoption Process. (Unpacked Standards)

Grade 4. Common Core Adoption Process. (Unpacked Standards) Grade 4 Common Core Adoption Process (Unpacked Standards) Grade 4 Reading: Literature RL.4.1 Refer to details and examples in a text when explaining what the text says explicitly and when drawing inferences

More information

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

Massachusetts Institute of Technology Tel: Massachusetts Avenue  Room 32-D558 MA 02139 Hariharan Narayanan Massachusetts Institute of Technology Tel: 773.428.3115 LIDS har@mit.edu 77 Massachusetts Avenue http://www.mit.edu/~har Room 32-D558 MA 02139 EMPLOYMENT Massachusetts Institute of

More information

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,

More information

SELF-STUDY QUESTIONNAIRE FOR REVIEW of the COMPUTER SCIENCE PROGRAM

SELF-STUDY QUESTIONNAIRE FOR REVIEW of the COMPUTER SCIENCE PROGRAM Disclaimer: This Self Study was developed to meet the goals of the CAC Session at the 2006 Summit. It should not be considered as a model or a template. ABET Computing Accreditation Commission SELF-STUDY

More information

A. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION

A. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION INVENTORY OF PROCESSES IN COLLEGE COMPOSITION This questionnaire describes the different ways that college students go about writing essays and papers. There are no right or wrong answers because there

More information

Writing Research Articles

Writing Research Articles Marek J. Druzdzel with minor additions from Peter Brusilovsky University of Pittsburgh School of Information Sciences and Intelligent Systems Program marek@sis.pitt.edu http://www.pitt.edu/~druzdzel Overview

More information

Accurate Unlexicalized Parsing for Modern Hebrew

Accurate Unlexicalized Parsing for Modern Hebrew Accurate Unlexicalized Parsing for Modern Hebrew Reut Tsarfaty and Khalil Sima an Institute for Logic, Language and Computation, University of Amsterdam Plantage Muidergracht 24, 1018TV Amsterdam, The

More information

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates, and Kay Peterson Language Technologies Institute Carnegie

More information

Compositional Semantics

Compositional Semantics Compositional Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Words, bag of words Sequences Trees Meaning Representing Meaning An important goal of NLP/AI: convert natural language

More information

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction CLASSIFICATION OF PROGRAM Critical Elements Analysis 1 Program Name: Macmillan/McGraw Hill Reading 2003 Date of Publication: 2003 Publisher: Macmillan/McGraw Hill Reviewer Code: 1. X The program meets

More information

arxiv:cmp-lg/ v1 7 Jun 1997 Abstract

arxiv:cmp-lg/ v1 7 Jun 1997 Abstract Comparing a Linguistic and a Stochastic Tagger Christer Samuelsson Lucent Technologies Bell Laboratories 600 Mountain Ave, Room 2D-339 Murray Hill, NJ 07974, USA christer@research.bell-labs.com Atro Voutilainen

More information

On March 15, 2016, Governor Rick Snyder. Continuing Medical Education Becomes Mandatory in Michigan. in this issue... 3 Great Lakes Veterinary

On March 15, 2016, Governor Rick Snyder. Continuing Medical Education Becomes Mandatory in Michigan. in this issue... 3 Great Lakes Veterinary michiga veteriary medical associatio i this issue... 3 Great Lakes Veteriary Coferece 4 What You Need to Kow Whe Issuig a Iterstate Certificate of Ispectio 6 Low Pathogeic Avia Iflueza H5 Virus Detectios

More information

What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models

What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models Michael A. Sao Pedro Worcester Polytechnic Institute 100 Institute Rd. Worcester, MA 01609

More information

University of Groningen. Systemen, planning, netwerken Bosman, Aart

University of Groningen. Systemen, planning, netwerken Bosman, Aart University of Groningen Systemen, planning, netwerken Bosman, Aart IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document

More information

South Carolina English Language Arts

South Carolina English Language Arts South Carolina English Language Arts A S O F J U N E 2 0, 2 0 1 0, T H I S S TAT E H A D A D O P T E D T H E CO M M O N CO R E S TAT E S TA N DA R D S. DOCUMENTS REVIEWED South Carolina Academic Content

More information

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department

More information

A Pumpkin Grows. Written by Linda D. Bullock and illustrated by Debby Fisher

A Pumpkin Grows. Written by Linda D. Bullock and illustrated by Debby Fisher GUIDED READING REPORT A Pumpkin Grows Written by Linda D. Bullock and illustrated by Debby Fisher KEY IDEA This nonfiction text traces the stages a pumpkin goes through as it grows from a seed to become

More information