Natural language processing implementation on Romanian ChatBot
|
|
- Rudolf Bishop
- 6 years ago
- Views:
Transcription
1 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION Natural laguage processig implemetatio o Romaia ChatBot RALF FABIAN, MARCU ALEXANDRU-NICOLAE Departmet for Iformatics Lucia Blaga Uiversity of Sibiu Io Raţiu Street, o. 5-7, Sibiu ROMANIA ralf.fabia@ulbsibiu.ro, marcu.alex@yahoo.com Abstract: - If i writte ore spoke form, laguage is a essece of huma behavior. It is used for kowledge represetatio ad trasfer from oe geeratio to aother. Without laguage we would't be able to ay kid of commuicatio. Today, we are all cofroted by a uprecedeted volume of iformatio ad most of it i text form. Usig computer system to maage ad access large volumes of iformatio becomes a ecessary evil for today, ad eve more for upcomig geeratios. System capable of uderstadig huma laguage would sigificatly improve huma-computer iteractio. This paper presets a applicatio framework for atural laguage processig i form of a ChatBot for the Romaia laguage. Key-Words: - atural laguage processig, formal laguages, parsig, atural laguage iterface, computatioal model 1 Itroductio Sice for may people, a large ad growig fractio of work ad leisure time is spet avigatig ad accessig the uiverse of iformatio, classical computer laguages ad iformatio queryig methods are ot a attractive ad realistic optio. Thus, the study of laguage has become a primary area of iterest for sciece. Besides the vast amog of iformatio that a system capable of uderstadig huma laguage would have to access, it would first of all improve humacomputer iteractio. Some years ago (i 2003), it was estimated that the aual productio of books reached about 8 Terabytes. It would take a huma beig at least five years to read the ew scietific material that is produced every 24 hours. Although these estimates where based o prited materials ad dose ot iclude the icreasigly amout of iformatio produced electroically o the Web. Noe of the curret expert system ca match the flexibility ad accurate of a huma coversatio. Ad cosiderig the level of ambiguity i some laguages is amazig how huma psychology has adapted. Natural laguage processig (NLP) is defied [4] as a field of computer sciece ad liguistics cocered with the iteractios betwee computers ad huma (atural) laguages. Computer system, uderstadig atural laguage, deal with machie readig comprehesio, ad represet a subtopic of NLP. The applicatio of atural laguage uderstadig described i these paper addresses text based processig. Researches have show that this kid of processig applies successfully to: queryig documet from databases with desired topics, iformatio extractio from documets ore messages, text traslatios from oe laguage to aother, questio-aswerig system [1][3]. The system eeds to participate actively i order to maitai a atural dialogue. Furthermore it requires verifyig that thigs are uderstood ad if ot, a ability to geerate clarificatio subdialogues. Parsig iput is more complex tha the reverse process of output costructio i atural laguage geeratio because of the potetial occurrece of ukow ad uexpected features i the iput ad the eed to determie the appropriate sytactic ad sematic schemes to apply to it. The first popular program that uses atural laguage commuicatio was ELIZA, developed by MIT begiig from Other commo Chaterbox ore Chatbot applicatios are Dr. Romulo based o the ALICE artificial itelligece chat platform ad MathBot for aswerig simple umber problems. A milestoe i this applicatio filed is the award-wiig free atural laguage artificial itelligece chat robot A.L.I.C.E. (Artificial Liguistic Iteret Computer Etity) usig AIML (Artificial Itelligece Markup Laguage) [8]. A huge of chatterbots, chat bot, coversatioal agets ad virtual agets from all over the World the Chatbots Directory [7]. ISSN: ISBN:
2 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION There are two mai motivatio for developig computatioal models: the scietific motivatio ad the practical/techological motivatio [1][2]. The former oe deals with obtaiig a better uderstadig about how laguage works ad tries to traspose complex theories as computer programs ad the test them by observig how well they perform. The later, deals whit the assumptio that atural laguage processig capabilities may chage the way computers are used today. Computers aware of atural laguage uderstadig could access ad maage stored iformatio i text form ad i additio provide a user iterface accessible to everyoe. I this paper we preset the implemetatio of a ChatBot based o commo cocepts from formal laguage theory ad atural laguage uderstadig. The program i implemeted i Pytho programmig laguage ad may be obtaied ad accessed through the iteret. At the ed we outlie a set of extesios to the formal model used. These are based o our previous research o formal models for modelig ad simulatig dyamic systems. Eve if the model used dose ot faithfully match the way humas process laguages, it is importat oly to produce the desired results. 2 Notios ad termiology The first step i makig a computer capable of processig atural laguage is to defie a set of rules that yield a exact commuicatio eeded for the computer, as cotrary to a more ambiguous oe accepted amog humas. It is possible for a setece to have ay umber of meaigs eve for a particular cotext. This raises a very particular problem for algorithms meat to uderstad huma laguage, because computers programs are traditioally used i a very precise ad exact way. From formal laguage theory we kow that a Chomsky geerative grammar (shortly grammar) [2], [6], [7], is a quadruple G= ( VN, VT, S, P), where V N ad V T are alphabets of otermial respectively termial symbols; S VN is the startig symbol or axiom ad P is a fiite set of pairs of words from ( VN VT), P= {( ui, vi) 1 i m}, so that ay word u i cotais at least oe otermial symbol. The pairs ( ui, v i) are called derivatio rules or productio rules or simple productios ad will be deoted by ui vi. If the left had side of a productio rule cosists of oly a sigle otermial symbol, the we have a cotext-free grammar [2][1]. The set of all seteces build i respect to the cosidered rules, is called laguage geerated by the grammar, ad formally defied as: LG ( ) = { p VT S p}. From the viewpoit of atural laguage processig, cotext free grammars are iterestig for two reasos: - the model is powerful eough to represet structures of atural laguages; - the model is simple eough to build efficiet parsers to aalyze seteces. Havig a model of a give laguage, the ext step is to create a algorithm which tests a give setece to see if it's well formed. Such type of algorithm is called parser. Techically, a parser or, more formally, sytactic aalysis, is the process of aalyzig a sequece of tokes to determie their grammatical structure with respect to a give formal grammar. Every programmig laguage has a parser ad at least a iterpreter to brig the code ito a program. Where as for programmig laguages this task is cosiderably easy, the huma laguages are seemigly edless i complexity. A setece i atural laguage may have several iterpretatios ad choosig betwee them is related to the beliefs ad kowledge of the perso who commuicates. The followig productio rules are a simplified except from the grammar of Romaia laguage used i our applicatio. S PS PS PREDICAT SUBIECT PREDICAT PRONUME_INTEROGATIV VERB_COPULATIV SUBIECT ARTICOL_NEHOTARAT SUBSTANTIV PRONUME_INTEROGATIV 'ce' VERB_COPULATIV 'este' ARTICOL_NEHOTARAT 'o' SUBTANTIV 'masia' Accordig to these rules we ca build seteces i a tree structure called derivatio tree (e.g. fig. 1.). Figure 1. Derivatio tree of a setece. Cosiderig that a setece ca come i ay size, they are two strategies for aalyzig a setece from the grammar poit of view: (a) top-dow - starts ISSN: ISBN:
3 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION with the S symbol ad attempts to populate the iteral structure of the derivatio tree to obtai a sequece of termial symbols that matches the classes of the words i the iput setece. (b) bottom-up - attempts to populate the iteral structure of the derivatio tree startig from the setece ad searchig for matchig productio rules that ca stepwise lead fially to the top symbol S. Simulatig a system maily supposes a model for experimet creatio that matches best to the evolvig of the real system ad also supposes a set of processig procedures for this experimets that may idicate the optimal decisios for a further cotrol of the system. The system simulatio starts with iitializig the system with data describig the iitial state. The dyamics of the system cosist i choosig the ext activity, i.e. the ext procedure to be executed. 3 Applicatio descriptio The applicatio represets a implemetatio of a Chatbot for Romaia laguage ad is basically guided by the work a results from [1][2][5][6]. Although, it is a attempt to build a framework for easig up computer-huma commuicatio trough atural laguage usage. The user iterface is simple ad ca easily be itegrated i ay web page. Figure 2 depicts the elemet of the iterface, cotaiig a text field for the user iput, a optioal butto to sed a setece to the system ad a list of coversatio history. Figure 2. User iterface. Iput Discussio history The developmet is etirely i Phyte [17] as programmig laguage ad Djago [18] for data models ad web itegratio. Fuctioally, the applicatio supports two operatio modes: (a) traiig mode this mode is used to trai the system by addig ad classifyig ew kowledge; (b) coversatio mode this is the ormal operatio mode i which the system respods to the seteces passed by user. Traiig data as well as testig data are collected from the Romaia laguage, due the uderlyig grammar. I this first versio the two operatio modes are idepedet. We pa to exted this type of operatio Exteral (Web iterface) Iteral Iput Words Parsig Sytactic structure Cotext iterpretatio Discourse cotext Grammar, Vocabulary Applicatio reasoig Output Words Realizatio Sytactic structure Cotext geeratio Figure 3. Respose geeratio process with iteral stages. ISSN: ISBN:
4 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION such that user iput seteces may cotribute to the kowledge of the system. A proposal for this is described i a later paragraph. 3.1 Iteral structure ad fuctioality With respect to the cocepts of grammars ad parsers, we ca ow describe the compositio ad iteral behavior of our system. For a give iput, i form of a setece, we have to cosider the followig steps, i order to get a uderstadig from it. First of all the setece has to be checked to see if it's well formed. Havig passed this, it is the divided ito three parts: the elemet that does the actio, the actio ad the rest of the setece cotaiig extra parameters used to uderstad how ad i which circumstaces the actio has bee performed. Next a class is costructed which geerates the actio o the word performig it. A respose accordig to the system's behavior is the provided. Commuicatio is the process of trasferrig iformatio from oe source to aother. I learig mode, all of the iformatio commuicated to the system, will eed i the begiig a few pre-coded actios, ecessary for creatig the iitial liks betwee parts of kowledge. All words are, i essece, labels for somethig i real life ad thus, they are the most basic elemets of huma commuicatio. I commo for all elemets are the represetatio of there specifics/particularities ad what are the actios that they ca do. A Word class cotais two distict elemets: the parts ad the actio. With this represetatio it is ow possible to create the setece. Importat here is to uderstad the way i witch a words meaig is affected i a certai setece, ad further more, the way i witch the meaig of the setece is affected by the phrase, ad i the ed, the geeral meaig affected by the cotext ad the geeral Figure 4. Iteral data represetatio ad relatios. ISSN: ISBN:
5 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION topic of the discussio. The other basic elemet of this system is the actio which i this case is actual source code which will be ivoked with the appropriate words cotaied i the setece. The way i which this particular fuctio is costructed will represet the way i which the setece will be uderstood. For better performace a distictio is made betwee existig verbs. A verb that will expect to get a adverb will perform oly a certai type of actios as opposite to a verb that expect a adjective. A eve more deep separatio ca be made usig a classificatio system. For example a verb ca perform certai operatios to certai types of words. 3.2 Itegratio ad extesio For a future release of this applicatio we cosider two alteratives as extesios for the formal model of the grammar treatig system. Oe based o fuzzy reasoig ad the other o a stochastic approach. We preset i the followig some basic cocepts for the stochastic versio. A fuzzy modelig system is treated i a earlier paper [16]. Defiitio 3.1. A stochastic geerative grammar (or shortly stochastic grammar) is a pair ( G, f p ) where: - G = ( VN, VT, S, P) is a Chomsky geerative grammar ad - fp : P [0,1] is a probability fuctio havig the property Σ( f p( α αi)) = 1, where α αi are all α -productios from P. Extedig the probability fuctios from productios to derivatios we obtai the followig results: f p( S q r) = fp( S q) fp( q r), where f p( q r) = fp( p), p beig the set of productios applied. Defiitio 3.2. The laguages geerated by the stochastic grammar ( G, f p ) is p T p L( G, f ) = { p p V ad S p, f ( S p) > 0}. Defiitio 3.3. A stochastic geerative grammar of type i = 0,1, 2,3 is a pair ( G, f p ) where G= ( VN, VT, S, P) is a Chomsky geerative grammar of type i ad fp : P [0,1] is a probability fuctio with the property Σ( f p( α αi)) = 1, where α αi are all α -productios from P. Defiitio 3.4. A stimulatio fuctios is a mappig f :[0,1] {1,2,..., } [0,1], f ( x, x,..., x, i) = ( f ( x, x,..., x, i), f2( x1, x2,..., x, i),..., f( x1, x2,..., x, i )) verifyig the properties: - fk( x1, x2,..., x, i) = xk k= 1 k= 1 - fi( x1, x2,..., x, i) xi - fl( x1, x2,..., x, i) xl (1) > (2) <, ( ) l i. (3) We otice that if α αi, i= 1,..., are all the α - productios from P the we ca associate to them a fiite probability field α α1 α α2... α α A ( α) = p1 p2... p (4) O the set of these probability fields A we defie ow a operator called stimulatio operator. A stimulatio operator is a mappig E: A {1,2,..., } A defied as α α1 α α2... α α E( A ( α), i) =, (5) q1 q2... q where ql = fl( x1, x2,..., x, i), l {1,2,..., }.. 4 Coclusio ad further work At this early stage of developmet, our approach looks promisig especially by the fact that it s workig. The modular developmet of the applicatio framework eables us to experimet with further extesio, as stated i paragraph 3.2. Oe major cocer for every system that has to iteract with a huma commuicator is to deal with error validatio. Furthermore, facig costatly such issues, it is ecessarily to exted the dyamic system behavior with a learig strategy. A iput validatio would the use the parser for a prelimiary check followed by the logical test. The later ca oly be doe by uderstadig a setece meaig. A drawback that eeds to be hadled is the fact that the system is curretly susceptible to assimilatig bad iformatio i the istructio mode. Aother issue we would like to ivestigate is the itegratio ito a ecommerce web portal, where a potetially ew cliet ca ask questios about the products available, ad this he way bypass tedious avigatig through cofusig meus. Refereces: [1] James Alle, Natural Laguage Uderstadig, 2d Editio, Addiso-Wesley, 1995, ISBN-10: , ISBN-13: ISSN: ISBN:
6 Proceedigs of the 9th WSEAS Iteratioal Coferece o SIMULATION, MODELLING AND OPTIMIZATION [2] Alfred V. Aho, Ravi Sethi, Jeffrey D. Ullma, Compilers: Priciples, Techiques, ad Tools, Addiso Wesley, [3] Lluís Marquez, Machie Learig ad Natural Laguage Processig, Techical Report LSI R, Departamet de Lleguatges i Sistemes Iform`atics (LSI), Uiversitat Politecica de Cataluya (UPC), Barceloa, Spai [4] Wikipedia, The free ecyclopedia, processig [5] Bird, Steve; Ewa Klei; Edward Loper, Natural Laguage Processig with Pytho. O'Reilly Media, [6] Bird, Steve; Ewa Klei; Edward Loper; Jaso Baldridge, Multidiscipliary istructio with the Natural Laguage Toolkit. Proceedigs of the Third Workshop o Issues i Teachig Computatioal Liguistics, ACL, [7] Chatbots directory, [8] A.L.I.C.E. Artificial Itelligece Foudatio [9] Multi Level Recursive Specificatios for Cotext Free Grammars - Vasile Crăciuea, Cristia Elea Aro, Ralf Fabia, Ioa-Daiel Huyadi, Proceedigs of the 11th WSEAS Iteratioal Coferece o COMPUTERS, Agios Nikolaos, Crete Islad, Greece, 2007, pag , ISSN , ISBN [10] Traslatio for itermediate code, Ioa-Daiel Huyadi, Emil M. Popa, Ralf Fabia, Ioela Moca Proceedigs of the 8th WSEAS Iteratioal Coferece, Mathematical Methods ad Computatioal Techiques i Electrical Egieerig (MMACTEE), Bucharest, Romaia 2006, pag , ISSN , ISBN [11] Fix poit iteral hierarchy specificatio for cotext free grammars - Vasile Crăciuea, Ralf Fabia, Ioa-Daiel Huyadi, Emil M. Popa, Proceedigs of the 11th WSEAS Iteratioal Coferece o COMPUTERS, Agios Nikolaos, Crete Islad, Greece, 2007, pag , ISSN , ISBN [12] Zadeh, L. A. (1992). Kowledge represetatio i fuzzy logic. I A itroductio to fuzzy logic applicatios i itelliget systems. Kluwer Academic. [13] William Siler, James J. Buckley - Fuzzy Expert Systems ad Fuzzy Reasoig, Published by Joh Wiley & Sos, Ic., Caada, [14] Jose Galido, Agelica Urrutia, Mario Piattii - Fuzzy Databases: Modelig, Desig ad Implemetatio, Idea Group Publishig, USA, [15] Wag, Z., X. Shao, G. Zhag, H. Zhu. Itegratio of Variable Precisio Rough Set ad Fuzzy Clusterig: A Applicatio to Kowledge Acquisitio for Maufacturig Process Plaig. Rough Sets, Fuzzy Sets, Data Miig, ad Graular Computig. Lecture Notes i Computer Sciece, , Spriger Berli / Heidelberg, [16] Fabia, R., V. Crăciuea, E M. Popa. Itelliget system modellig with total fuzzy grammars. Proc. of the 8th WSEAS Iteratioal Coferece, Mathematical Methods ad Computatioal Techiques i Electrical Egieerig (MMACTEE), 82-87, Bucharest, [17] Pytho Software Foudatio, [18] The Djago framework, ISSN: ISBN:
E-LEARNING USABILITY: A LEARNER-ADAPTED APPROACH BASED ON THE EVALUATION OF LEANER S PREFERENCES. Valentina Terzieva, Yuri Pavlov, Rumen Andreev
Titre du documet / Documet title E-learig usability : A learer-adapted approach based o the evaluatio of leaer's prefereces Auteur(s) / Author(s) TERZIEVA Valetia ; PAVLOV Yuri (1) ; ANDREEV Rume (2) ;
More informationFuzzy Reference Gain-Scheduling Approach as Intelligent Agents: FRGS Agent
Fuzzy Referece Gai-Schedulig Approach as Itelliget Agets: FRGS Aget J. E. ARAUJO * eresto@lit.ipe.br K. H. KIENITZ # kieitz@ita.br S. A. SANDRI sadra@lac.ipe.br J. D. S. da SILVA demisio@lac.ipe.br * Itegratio
More information'Norwegian University of Science and Technology, Department of Computer and Information Science
The helpful Patiet Record System: Problem Orieted Ad Kowledge Based Elisabeth Bayega, MS' ad Samso Tu, MS2 'Norwegia Uiversity of Sciece ad Techology, Departmet of Computer ad Iformatio Sciece ad Departmet
More informationarxiv: v1 [cs.dl] 22 Dec 2016
ScieceWISE: Topic Modelig over Scietific Literature Networks arxiv:1612.07636v1 [cs.dl] 22 Dec 2016 A. Magalich, V. Gemmetto, D. Garlaschelli, A. Boyarsky Uiversity of Leide, The Netherlads {magalich,
More informationManagement Science Letters
Maagemet Sciece Letters 4 (24) 2 26 Cotets lists available at GrowigSciece Maagemet Sciece Letters homepage: www.growigsciece.com/msl A applicatio of data evelopmet aalysis for measurig the relative efficiecy
More informationCONSTITUENT VOICE TECHNICAL NOTE 1 INTRODUCING Version 1.1, September 2014
preview begis oct 2014 lauches ja 2015 INTRODUCING WWW.FEEDBACKCOMMONS.ORG A serviced cloud platform to share ad compare feedback data ad collaboratively develop feedback ad learig practice CONSTITUENT
More informationConsortium: North Carolina Community Colleges
Associatio of Research Libraries / Texas A&M Uiversity www.libqual.org Cotributors Collee Cook Texas A&M Uiversity Fred Heath Uiversity of Texas BruceThompso Texas A&M Uiversity Martha Kyrillidou Associatio
More informationHANDBOOK. Career Center Handbook. Tools & Tips for Career Search Success CALIFORNIA STATE UNIVERSITY, SACR AMENTO
HANDBOOK Career Ceter Hadbook CALIFORNIA STATE UNIVERSITY, SACR AMENTO Tools & Tips for Career Search Success Academic Advisig ad Career Ceter 6000 J Street Lasse Hall 1013 Sacrameto, CA 95819-6064 916-278-6231
More informationApplication for Admission
Applicatio for Admissio Admissio Office PO Box 2900 Illiois Wesleya Uiversity Bloomig, Illiois 61702-2900 Apply o-lie at: www.iwu.edu Applicatio Iformatio I am applyig: Early Actio Regular Decisio Early
More informationpart2 Participatory Processes
part part2 Participatory Processes Participatory Learig Approaches Whose Learig? Participatory learig is based o the priciple of ope expressio where all sectios of the commuity ad exteral stakeholders
More informationVISION, MISSION, VALUES, AND GOALS
6 VISION, MISSION, VALUES, AND GOALS 2010-2015 VISION STATEMENT Ohloe College will be kow throughout Califoria for our iclusiveess, iovatio, ad superior rates of studet success. MISSION STATEMENT The Missio
More information2014 Gold Award Winner SpecialParent
Award Wier SpecialParet Dedicated to all families of childre with special eeds 6 th Editio/Fall/Witer 2014 Desig ad Editorial Awards Competitio MISSION Our goal is to provide parets of childre with special
More informationParsing of part-of-speech tagged Assamese Texts
IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal
More informationalso inside Continuing Education Alumni Authors College Events
SUMMER 2016 JAMESTOWN COMMUNITY COLLEGE ALUMNI MAGAZINE create a etrepreeur creatig a busiess a artist creatig beauty a citize creatig the future also iside Cotiuig Educatio Alumi Authors College Evets
More informationOn March 15, 2016, Governor Rick Snyder. Continuing Medical Education Becomes Mandatory in Michigan. in this issue... 3 Great Lakes Veterinary
michiga veteriary medical associatio i this issue... 3 Great Lakes Veteriary Coferece 4 What You Need to Kow Whe Issuig a Iterstate Certificate of Ispectio 6 Low Pathogeic Avia Iflueza H5 Virus Detectios
More informationErkki Mäkinen State change languages as homomorphic images of Szilard languages
Erkki Mäkinen State change languages as homomorphic images of Szilard languages UNIVERSITY OF TAMPERE SCHOOL OF INFORMATION SCIENCES REPORTS IN INFORMATION SCIENCES 48 TAMPERE 2016 UNIVERSITY OF TAMPERE
More informationSyntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm syntax: from the Greek syntaxis, meaning setting out together
More informationSINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,
More informationCOMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR
COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR ROLAND HAUSSER Institut für Deutsche Philologie Ludwig-Maximilians Universität München München, West Germany 1. CHOICE OF A PRIMITIVE OPERATION The
More informationRANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S
N S ER E P S I M TA S UN A I S I T VER RANKING AND UNRANKING LEFT SZILARD LANGUAGES Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A-1997-2 UNIVERSITY OF TAMPERE DEPARTMENT OF
More informationRule discovery in Web-based educational systems using Grammar-Based Genetic Programming
Data Mining VI 205 Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming C. Romero, S. Ventura, C. Hervás & P. González Universidad de Córdoba, Campus Universitario de
More informationGrammars & Parsing, Part 1:
Grammars & Parsing, Part 1: Rules, representations, and transformations- oh my! Sentence VP The teacher Verb gave the lecture 2015-02-12 CS 562/662: Natural Language Processing Game plan for today: Review
More informationSeminar - Organic Computing
Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts
More informationEvolutive Neural Net Fuzzy Filtering: Basic Description
Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:
More informationPH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)
PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.) OVERVIEW ADMISSION REQUIREMENTS PROGRAM REQUIREMENTS OVERVIEW FOR THE PH.D. IN COMPUTER SCIENCE Overview The doctoral program is designed for those students
More informationSpecification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,
More informationAn Introduction to the Minimalist Program
An Introduction to the Minimalist Program Luke Smith University of Arizona Summer 2016 Some findings of traditional syntax Human languages vary greatly, but digging deeper, they all have distinct commonalities:
More informationLanguage properties and Grammar of Parallel and Series Parallel Languages
arxiv:1711.01799v1 [cs.fl] 6 Nov 2017 Language properties and Grammar of Parallel and Series Parallel Languages Mohana.N 1, Kalyani Desikan 2 and V.Rajkumar Dare 3 1 Division of Mathematics, School of
More informationSome Principles of Automated Natural Language Information Extraction
Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract
More informationA General Class of Noncontext Free Grammars Generating Context Free Languages
INFORMATION AND CONTROL 43, 187-194 (1979) A General Class of Noncontext Free Grammars Generating Context Free Languages SARWAN K. AGGARWAL Boeing Wichita Company, Wichita, Kansas 67210 AND JAMES A. HEINEN
More informationIntroduction, Organization Overview of NLP, Main Issues
HG2051 Language and the Computer Computational Linguistics with Python Introduction, Organization Overview of NLP, Main Issues Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/
More informationAxiom 2013 Team Description Paper
Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association
More informationInternational Conference KNOWLEDGE-BASED ORGANIZATION Vol. XXIII No SIMULATION AND GAMIFICATION IN E-LEARNING TECHNICAL COURSES
International Conference KNOWLEDGE-BASED ORGANIZATION Vol. XXIII No 3 2017 SIMULATION AND GAMIFICATION IN E-LEARNING TECHNICAL COURSES Ghiţă BÂRSAN*, Vasile NĂSTĂSESCU**, Vlad-Andrei BÂRSAN*** * "Nicolae
More informationA student diagnosing and evaluation system for laboratory-based academic exercises
A student diagnosing and evaluation system for laboratory-based academic exercises Maria Samarakou, Emmanouil Fylladitakis and Pantelis Prentakis Technological Educational Institute (T.E.I.) of Athens
More informationLearning Methods for Fuzzy Systems
Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8
More informationDerivational and Inflectional Morphemes in Pak-Pak Language
Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes
More informationWriting a composition
A good composition has three elements: Writing a composition an introduction: A topic sentence which contains the main idea of the paragraph. a body : Supporting sentences that develop the main idea. a
More informationATENEA UPC AND THE NEW "Activity Stream" or "WALL" FEATURE Jesus Alcober 1, Oriol Sánchez 2, Javier Otero 3, Ramon Martí 4
ATENEA UPC AND THE NEW "Activity Stream" or "WALL" FEATURE Jesus Alcober 1, Oriol Sánchez 2, Javier Otero 3, Ramon Martí 4 1 Universitat Politècnica de Catalunya (Spain) 2 UPCnet (Spain) 3 UPCnet (Spain)
More informationLearning Methods in Multilingual Speech Recognition
Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex
More informationUsing dialogue context to improve parsing performance in dialogue systems
Using dialogue context to improve parsing performance in dialogue systems Ivan Meza-Ruiz and Oliver Lemon School of Informatics, Edinburgh University 2 Buccleuch Place, Edinburgh I.V.Meza-Ruiz@sms.ed.ac.uk,
More informationReinforcement Learning by Comparing Immediate Reward
Reinforcement Learning by Comparing Immediate Reward Punit Pandey DeepshikhaPandey Dr. Shishir Kumar Abstract This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate
More informationCS 598 Natural Language Processing
CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@
More informationLEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE
LEXICAL COHESION ANALYSIS OF THE ARTICLE WHAT IS A GOOD RESEARCH PROJECT? BY BRIAN PALTRIDGE A JOURNAL ARTICLE Submitted in partial fulfillment of the requirements for the degree of Sarjana Sastra (S.S.)
More informationProof Theory for Syntacticians
Department of Linguistics Ohio State University Syntax 2 (Linguistics 602.02) January 5, 2012 Logics for Linguistics Many different kinds of logic are directly applicable to formalizing theories in syntax
More informationAQUA: An Ontology-Driven Question Answering System
AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.
More informationOn the Combined Behavior of Autonomous Resource Management Agents
On the Combined Behavior of Autonomous Resource Management Agents Siri Fagernes 1 and Alva L. Couch 2 1 Faculty of Engineering Oslo University College Oslo, Norway siri.fagernes@iu.hio.no 2 Computer Science
More informationInformatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy
Informatics 2A: Language Complexity and the Chomsky Hierarchy September 28, 2010 Starter 1 Is there a finite state machine that recognises all those strings s from the alphabet {a, b} where the difference
More informationA Grammar for Battle Management Language
Bastian Haarmann 1 Dr. Ulrich Schade 1 Dr. Michael R. Hieb 2 1 Fraunhofer Institute for Communication, Information Processing and Ergonomics 2 George Mason University bastian.haarmann@fkie.fraunhofer.de
More informationDeveloping Grammar in Context
Developing Grammar in Context intermediate with answers Mark Nettle and Diana Hopkins PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United
More informationAbstractions and the Brain
Abstractions and the Brain Brian D. Josephson Department of Physics, University of Cambridge Cavendish Lab. Madingley Road Cambridge, UK. CB3 OHE bdj10@cam.ac.uk http://www.tcm.phy.cam.ac.uk/~bdj10 ABSTRACT
More informationMyths, Legends, Fairytales and Novels (Writing a Letter)
Assessment Focus This task focuses on Communication through the mode of Writing at Levels 3, 4 and 5. Two linked tasks (Hot Seating and Character Study) that use the same context are available to assess
More informationMultimedia Application Effective Support of Education
Multimedia Application Effective Support of Education Eva Milková Faculty of Science, University od Hradec Králové, Hradec Králové, Czech Republic eva.mikova@uhk.cz Abstract Multimedia applications have
More informationAutomating the E-learning Personalization
Automating the E-learning Personalization Fathi Essalmi 1, Leila Jemni Ben Ayed 1, Mohamed Jemni 1, Kinshuk 2, and Sabine Graf 2 1 The Research Laboratory of Technologies of Information and Communication
More informationLoughton School s curriculum evening. 28 th February 2017
Loughton School s curriculum evening 28 th February 2017 Aims of this session Share our approach to teaching writing, reading, SPaG and maths. Share resources, ideas and strategies to support children's
More informationIntroduction to Simulation
Introduction to Simulation Spring 2010 Dr. Louis Luangkesorn University of Pittsburgh January 19, 2010 Dr. Louis Luangkesorn ( University of Pittsburgh ) Introduction to Simulation January 19, 2010 1 /
More informationContent Language Objectives (CLOs) August 2012, H. Butts & G. De Anda
Content Language Objectives (CLOs) Outcomes Identify the evolution of the CLO Identify the components of the CLO Understand how the CLO helps provide all students the opportunity to access the rigor of
More informationThe Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma
International Journal of Computer Applications (975 8887) The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma Gilbert M.
More information1.2 Interpretive Communication: Students will demonstrate comprehension of content from authentic audio and visual resources.
Course French I Grade 9-12 Unit of Study Unit 1 - Bonjour tout le monde! & les Passe-temps Unit Type(s) x Topical Skills-based Thematic Pacing 20 weeks Overarching Standards: 1.1 Interpersonal Communication:
More informationSoftware Maintenance
1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories
More informationPrediction of Maximal Projection for Semantic Role Labeling
Prediction of Maximal Projection for Semantic Role Labeling Weiwei Sun, Zhifang Sui Institute of Computational Linguistics Peking University Beijing, 100871, China {ws, szf}@pku.edu.cn Haifeng Wang Toshiba
More informationThe College Board Redesigned SAT Grade 12
A Correlation of, 2017 To the Redesigned SAT Introduction This document demonstrates how myperspectives English Language Arts meets the Reading, Writing and Language and Essay Domains of Redesigned SAT.
More informationA Comparison of Two Text Representations for Sentiment Analysis
010 International Conference on Computer Application and System Modeling (ICCASM 010) A Comparison of Two Text Representations for Sentiment Analysis Jianxiong Wang School of Computer Science & Educational
More informationLearning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for
Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email Marilyn A. Walker Jeanne C. Fromer Shrikanth Narayanan walker@research.att.com jeannie@ai.mit.edu shri@research.att.com
More informationTarget Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data
Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se
More informationContext Free Grammars. Many slides from Michael Collins
Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures
More informationGreeley-Evans School District 6 French 1, French 1A Curriculum Guide
Theme: Salut, les copains! - Greetings, friends! Inquiry Questions: How has the French language and culture influenced our lives, our language and the world? Vocabulary: Greetings, introductions, leave-taking,
More informationSyntactic systematicity in sentence processing with a recurrent self-organizing network
Syntactic systematicity in sentence processing with a recurrent self-organizing network Igor Farkaš,1 Department of Applied Informatics, Comenius University Mlynská dolina, 842 48 Bratislava, Slovak Republic
More informationChamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform
Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform doi:10.3991/ijac.v3i3.1364 Jean-Marie Maes University College Ghent, Ghent, Belgium Abstract Dokeos used to be one of
More informationThe 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X
The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,
More informationWriting Research Articles
Marek J. Druzdzel with minor additions from Peter Brusilovsky University of Pittsburgh School of Information Sciences and Intelligent Systems Program marek@sis.pitt.edu http://www.pitt.edu/~druzdzel Overview
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationCharacteristics of the Text Genre Realistic fi ction Text Structure
LESSON 14 TEACHER S GUIDE by Oscar Hagen Fountas-Pinnell Level A Realistic Fiction Selection Summary A boy and his mom visit a pond and see and count a bird, fish, turtles, and frogs. Number of Words:
More informationExploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data
Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data Maja Popović and Hermann Ney Lehrstuhl für Informatik VI, Computer
More informationA Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many
Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.
More informationMaximizing Learning Through Course Alignment and Experience with Different Types of Knowledge
Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February
More informationDesigning a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses
Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses Thomas F.C. Woodhall Masters Candidate in Civil Engineering Queen s University at Kingston,
More informationLanguage Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus
Language Acquisition Fall 2010/Winter 2011 Lexical Categories Afra Alishahi, Heiner Drenhaus Computational Linguistics and Phonetics Saarland University Children s Sensitivity to Lexical Categories Look,
More informationBANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS
Daffodil International University Institutional Repository DIU Journal of Science and Technology Volume 8, Issue 1, January 2013 2013-01 BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS Uddin, Sk.
More informationTop US Tech Talent for the Top China Tech Company
THE FALL 2017 US RECRUITING TOUR Top US Tech Talent for the Top China Tech Company INTERVIEWS IN 7 CITIES Tour Schedule CITY Boston, MA New York, NY Pittsburgh, PA Urbana-Champaign, IL Ann Arbor, MI Los
More informationPronunciation: Student self-assessment: Based on the Standards, Topics and Key Concepts and Structures listed here, students should ask themselves...
BVSD World Languages Course Outline Course Description: furthers the study of grammar, vocabulary and an understanding of the culture though movies, videos and magazines. Students improve listening, speaking,
More informationLecture 10: Reinforcement Learning
Lecture 1: Reinforcement Learning Cognitive Systems II - Machine Learning SS 25 Part III: Learning Programs and Strategies Q Learning, Dynamic Programming Lecture 1: Reinforcement Learning p. Motivation
More informationDERMATOLOGY. Sponsored by the NYU Post-Graduate Medical School. 129 Years of Continuing Medical Education
Advaces i DERMATOLOGY THURSDAY - FRIDAY JUNE 7-8, 2012 New York, NY Sposored by the NYU Post-Graduate Medical School 129 Years of Cotiuig Medical Educatio THE RONALD O. PERELMAN DEPARTMENT OF DERMATOLOGY
More informationChinese Language Parsing with Maximum-Entropy-Inspired Parser
Chinese Language Parsing with Maximum-Entropy-Inspired Parser Heng Lian Brown University Abstract The Chinese language has many special characteristics that make parsing difficult. The performance of state-of-the-art
More informationPart I. Figuring out how English works
9 Part I Figuring out how English works 10 Chapter One Interaction and grammar Grammar focus. Tag questions Introduction. How closely do you pay attention to how English is used around you? For example,
More informationCandidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.
The Test of Interactive English, C2 Level Qualification Structure The Test of Interactive English consists of two units: Unit Name English English Each Unit is assessed via a separate examination, set,
More informationNotes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1
Notes on The Sciences of the Artificial Adapted from a shorter document written for course 17-652 (Deciding What to Design) 1 Ali Almossawi December 29, 2005 1 Introduction The Sciences of the Artificial
More informationEnsemble Technique Utilization for Indonesian Dependency Parser
Ensemble Technique Utilization for Indonesian Dependency Parser Arief Rahman Institut Teknologi Bandung Indonesia 23516008@std.stei.itb.ac.id Ayu Purwarianti Institut Teknologi Bandung Indonesia ayu@stei.itb.ac.id
More informationMultisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)
Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems) If searching for the ebook Multisensor Data Fusion: From Algorithms and Architectural
More informationFUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria
FUZZY EXPERT SYSTEMS 16-18 18 February 2002 University of Damascus-Syria Dr. Kasim M. Al-Aubidy Computer Eng. Dept. Philadelphia University What is Expert Systems? ES are computer programs that emulate
More information1. Introduction. 2. The OMBI database editor
OMBI bilingual lexical resources: Arabic-Dutch / Dutch-Arabic Carole Tiberius, Anna Aalstein, Instituut voor Nederlandse Lexicologie Jan Hoogland, Nederlands Instituut in Marokko (NIMAR) In this paper
More informationAn Interactive Intelligent Language Tutor Over The Internet
An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This
More informationA Reinforcement Learning Variant for Control Scheduling
A Reinforcement Learning Variant for Control Scheduling Aloke Guha Honeywell Sensor and System Development Center 3660 Technology Drive Minneapolis MN 55417 Abstract We present an algorithm based on reinforcement
More informationSchool of Innovative Technologies and Engineering
School of Innovative Technologies and Engineering Department of Applied Mathematical Sciences Proficiency Course in MATLAB COURSE DOCUMENT VERSION 1.0 PCMv1.0 July 2012 University of Technology, Mauritius
More informationSection 3.4. Logframe Module. This module will help you understand and use the logical framework in project design and proposal writing.
Section 3.4 Logframe Module This module will help you understand and use the logical framework in project design and proposal writing. THIS MODULE INCLUDES: Contents (Direct links clickable belo[abstract]w)
More informationVersion Space. Term 2012/2013 LSI - FIB. Javier Béjar cbea (LSI - FIB) Version Space Term 2012/ / 18
Version Space Javier Béjar cbea LSI - FIB Term 2012/2013 Javier Béjar cbea (LSI - FIB) Version Space Term 2012/2013 1 / 18 Outline 1 Learning logical formulas 2 Version space Introduction Search strategy
More informationTitle:A Flexible Simulation Platform to Quantify and Manage Emergency Department Crowding
Author's response to reviews Title:A Flexible Simulation Platform to Quantify and Manage Emergency Department Crowding Authors: Joshua E Hurwitz (jehurwitz@ufl.edu) Jo Ann Lee (joann5@ufl.edu) Kenneth
More informationCourse Outline for Honors Spanish II Mrs. Sharon Koller
Course Outline for Honors Spanish II Mrs. Sharon Koller Overview: Spanish 2 is designed to prepare students to function at beginning levels of proficiency in a variety of authentic situations. Emphasis
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview
More information11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation
tatistical Parsing (Following slides are modified from Prof. Raymond Mooney s slides.) tatistical Parsing tatistical parsing uses a probabilistic model of syntax in order to assign probabilities to each
More informationCreating Travel Advice
Creating Travel Advice Classroom at a Glance Teacher: Language: Grade: 11 School: Fran Pettigrew Spanish III Lesson Date: March 20 Class Size: 30 Schedule: McLean High School, McLean, Virginia Block schedule,
More information