A REINFORCEMENT LEARNING ALGORITHM WITH EVOLVING FUZZY NEURAL NETWORKS

Size: px
Start display at page:

Download "A REINFORCEMENT LEARNING ALGORITHM WITH EVOLVING FUZZY NEURAL NETWORKS"

Transcription

1 Proceedings of he 23 Inernaional Conference on Sysems, Conrol and Informaics A REINFORCEMEN LEARNING ALGORIHM WIH EVOLVING FUZZY NEURAL NEWORKS Hiesh Shah Professor, Deparmen of Elecronics & Communicaion G H Pael College of Engineering & echnology Vallabh Vidyanagar, Gujara (INDIA) iid.hiesh@gmail.com Absrac he synergy of he wo paradigms, neural nework and fuzzy inference sysem, has given rise o rapidly emerging filed, neuro-fuzzy sysems. Evolving neuro-fuzzy sysems are inended o use online learning o exrac knowledge from daa and perform a high-level adapaion of he nework srucure. We explore he poenial of evolving neuro-fuzzy sysems in reinforcemen learning (RL) applicaions. In his paper, a novel on-line sequenial learning evolving neuro-fuzzy model design for RL is proposed. We develop a dynamic evolving fuzzy neural nework (DENFIS) funcion approximaion approach o RL sysems. Poenial of his approach is demonsraed hrough a case sudy wo-link robo manipulaor. Simulaion resuls have demonsraed ha he proposed approach performs well in reinforcemen learning problems. Keywords Reinforcemen learning, Neuro-fuzzy sysem I. INRODUCION Reinforcemen learning (RL) paradigm is a compuaionally simple and direc approach o he adapive opimal conrol of nonlinear sysems []. In RL, he learning agen (conroller) ineracs wih an iniially unknown environmen (sysem) by measuring saes and applying acions according o is policy o maximize is cumulaive rewards. hus, RL provides a general mehodology o solve complex uncerain sequenial decision problems, which are very challenging in many real-world applicaions. Ofen he environmen of RL is ypically formulaed as a Markov Decision Process (MDP), consising of a se of all saes S, a se of all possible acions A, a sae ransiion probabiliy disribuion P :S A S [,], and a reward funcion R : S A. When all componens of he MDP are known, an opimal policy can be deermined, e.g., using dynamic programming. here has been a grea deal of progress in he machine learning communiy on value-funcion based reinforcemen learning mehods [2]. In value-funcion based reinforcemen learning, raher han learning a direc mapping from saes o acions, he agen learns an inermediae daa srucure known as a value funcion ha maps saes (or sae-acion pairs) o he expeced long erm reward. Value-funcion based learning mehods are appealing because he value funcion has welldefined semanics ha enable a sraighforward represenaion of he opimal policy, and heoreical resuls guaraneeing he convergence of cerain mehods [3]. Q-learning is a common model-free value funcion sraegy for RL [4]. Q-learning sysem maps every sae-acion pair o a M.Gopal Direcor, School of Engineering Shiv Nadar Universiy Noida, Uar Pradesh (INDIA) mgopal@snu.edu.in real number, he Q-value, which ells how opimal ha acion is in ha sae. For small domains, his mapping can be represened explicily by able of Q-values. For large domains, his approach is simply infeasible. If, one deals wih large discree or coninuous sae and acion spaces, i is ineviable o resor o funcion approximaion, for wo reasons: firs o overcome he sorage problem (curse of dimensionaliy), second o achieve daa efficiency (i.e., requiring only a few observaions o derive a near-opimal policy) by generalizing o unobserved saes-acion pairs. here is a large lieraure on RL algorihms using various value-funcion esimaion echniques. Funcionally, a fuzzy sysem or a neural nework can be described as a funcion approximaor. heoreical invesigaions have revealed ha neural neworks and fuzzy inference sysems are universal approximaors [5, 6]. Neural neworks are used o generalize he value funcion peraining o specific siuaions. However, hese works sill assume discree acions and canno handle coninuous-valued acions. In realisic applicaions, i is imperaive o deal wih coninuous saes and acions. Fuzzy Inference Sysem (FIS) can be used o faciliae generalizaion in he sae space and o generae coninuous acions, in paricular in conjuncion wih Q-learning widely known as fuzzy Q-learning (FQL). Glorennec [7] and he exension proposed by Jouffe [8] provided a fundamenal conribuion in he definiion of FQL, his is he basis for many of he exising implemenaions. In FQL, he consequen pars of a FIS are seleced by Q-learning. However, srucure and premise parameers are sill deermined by a priori knowledge. o circumven his problem, Er and Deng [9] proposed a dynamic fuzzy Q- learning (DFQL) approach o consruc self-uning FIS based on reinforcemen signals and deal wih coninuous sae and acion spaces. Recenly, he synergy of he wo paradigms, neural nework and fuzzy inference sysem, has given rise o rapidly emerging field, neuro-fuzzy sysems. he neuro-fuzzy erm means a ype of sysem characerized for a similar srucure of a fuzzy conroller, where he fuzzy ses and rules are adjused using neural nework uning echniques in an ieraive way wih he inpu-oupu daa vecors. A Neuro-fuzzy sysem is widely ermed as fuzzy neural nework (FuNN) [, ] in he lieraure. Fuzzy neural nework sysems are inended o capure he advanages of boh fuzzy logic (approximae reasoning) and neural neworks (learning) i.e. acquire fuzzy rules based on he learning abiliy of neural neworks [2]. 38

2 Proceedings of he 23 Inernaional Conference on Sysems, Conrol and Informaics Many researchers have developed such a neuro-fuzzy sysem for solving real-world problem effecively. he evolving fuzzy neural nework (EFuNN) was proposed by Kasabov in [3], one of he hybrid neuro-fuzzy archiecure. Dynamic evolving neural fuzzy inference sysem (dmefunn/denfis) [4] is a modified version of he EFuNN wih he idea ha, depending on he posiion of he inpu vecor in he inpu space, a FIS for calculaing he oupu is formed dynamically bases on m fuzzy rules ha had been creaed during he pas learning process. he applicaion of hese neworks has been in he areas of classificaion and regression using supervised learning mehods. DENFIS when used especially for online learning adapive sysems [4][5]. Use of neuro-fuzzy sysems for value funcion approximaion for RL seup has no ye been explored. In his paper, we explore he poenial of an alernaive dynamic evolving fuzzy-neural nework (dmefunn) for reinforcemen learning algorihms. We compare he learning performances of dmefunn and Dynamic FNN (here, dynamic fuzzy Q- learning) in reinforcemen learning framework, using simulaion experimen on wo-link robo manipulaor racking conrol problem. Furher, we examine he robusness performance of he proposed approach for handling he uncerainy in erms of parameer variaions and exernal disurbances. he paper is organized as follows. Secion II presens he heoreical background of fuzzy inference sysem wih reinforcemen learning approach and recen rends of neurofuzzy sysems. Secion III proposes archiecure and learning framework of dmefunn funcion approximaor for RL sysems. Secion IV exhibis he empirical performance based on he experimenal resuls of he sysem-wo-link robo manipulaor simulaions. Secion V, conclusions are drawn in he las secion. II. HEOREICAL BACKGROUND A neuro-fuzzy sysem is widely ermed as fuzzy neural nework (FuNN) [, ] in he lieraure. Fuzzy neural nework sysems are inended o capure he advanages of boh learning and compuaional power of neural nework and he high-level human-like hinking and reasoning of fuzzy sysem. Evolving fuzzy neural nework and dynamic evolving fuzzy neural nework are he hybrid neuro-fuzzy archiecure. A. Evolving Fuzzy Neural Nework (FEuNN) EFuNN implemens five layers Mamdani ype FIS. he firs layer passes crisp inpu variable o he second layer ha calculaes he degrees of compaibiliy in relaion o he predefined membership funcions. he hird layer is he rule layer and each node in his layer represens eiher an exising rule, or a rule anicipaed afer raining. he rule nodes represen prooypes of inpu-oupu daa as an associaion of hyperspheres from he fuzzy inpu and he fuzzy oupu spaces. Each rule node is defined by wo vecors of connecion weighs, which are adjused hrough a hybrid learning echnique. he fourh layer represens a fuzzy quanizaion of each oupu variable and calculaes he degree o which oupu membership funcions are mached he inpu daa. he fifh layer carries ou defuzzificaion and calculaes he crisp value for he oupu variable. In EFuNN, all he rule nodes are creaed during he learning phase. We used EFuNN as an funcion approximaor in RL framework, where inpu o he EFuNN is he sae or sae-acion pair resuled in o he oupu Q-value. B. Dynamic Evolving Fuzzy Neural Nework (DENFIS) he dynamic evolving neural-fuzzy inference sysem, DENFIS (also known as dmefunn), uses he firs-order akagi-sugeno ype of inference engine [4]. DENFIS is similar o EFuNN in some principles. I inheris and develops EFuNN s dynamic feaures ha make DENFIS suiable for online adapive sysems. he DENFIS model uses a local generalizaion. Principally srucure of EFuNN and DENFIS is somewha similar. Dynamic feaure of EFuNN developed wih he idea ha, depending on he posiion of he inpu vecor in he inpu space, a FIS for calculaing he oupu value is formed dynamically bases on m fuzzy rules ha has been creaed during he pas learning process. Evolving clusering mehod (ECM) [5] is used for fuzzy rules creaion and updaion wihin he inpu space pariioning. Alhough DENFIS mees he requiremens of online learning o form adapive inelligen sysems o a grea exen, however here is sill scope of advancemen. Our objecive is o use DENFIS as a funcion approximaor in reinforcemen learning framework. III. A novel value funcion approximaor for online sequenial learning on coninuous sae-acion domain based on DENFIS is proposed in his paper. Fig. shows archiecural view of he DENFIS funcion approximaion approach o RL sysem. APPROXIMAION OF VALUE FUNCION USING DENFIS s A Qs (, ai ) a A i DENFIS Acion selecor ε D error γv s ( ) Fig. DENFIS conroller archiecure s a ; where { } he sae-acion pair (, ) s = s, s 2,, s n S is he curren sysem sae and a is he each possible discree conrol acion in acion se A = { ai}; i =,,m, is he inpu of DENFIS model and he esimaed Q-value corresponding o ( s, a ) is he oupu of he nework. Q( s, a) = y = f( x ) = f( x, x2,, xq) () = β β x β x β x a 2 2 K v a pd a c c q Qs (, a ) wo-link robo q s (desired) Error meric evaluaor s 38

3 Proceedings of he 23 Inernaional Conference on Sysems, Conrol and Informaics where x is he inpu vecor ( x =[ x, x2,, xq] = ( s, a) ) of he DENFIS model and oupu y corresponds o esimaed Q-value associaed wih each sae-acion in rule Ri ; i =,2,...,m. raining samples are obained online as he ineracion beween he learning agen (conroller) and is environmen (plan). he online learning process of DENFIS involves he creaion of new fuzzy rules, and exising fuzzy rules can be updaed incremenally. In addiion, evolving clusering mehod (ECM) is used o pariion he inpu sample space o deermine he fuzzy ses in he aneceden par, i.e., ECM is used o deermine cluser ceners and membership funcions of he aneceden par, and wrls wih forgeing facor deermine he parameers of he consequen par of a fuzzy rule. he agen s acion is seleced based on he oupus of DENFIS. In specific, conrol acions are seleced using an exploraion/exploiaion policy [4] in order o explore he se of possible acions and acquire experience hrough he online RL signals. We use a pseudo-sochasic exploraion ε -greedy as in [4]. In ε -greedy exploraion, we gradually reduce he exploraion (deermined by he ε parameer) according o some schedule; we have reduced ε o is 9 percen value afer every ieraions. he lower limi of parameer ε has been kep fixed a.2 (o mainain exploraion). I is an online learning algorihm ha learns an approximae sae-acion value funcion Qs (, a ) ha converges o he opimal funcion Q (commonly called Q-value). Online version is given by Qs (, a) Qs (, a) η[ c γvs ( ) Qs (, a)] (2) c where s s is he sae ransiion under he conrol a A( s acion )(in fac a = a ( s ) a ( s ) ; where ( apd s ) is he acion generaed by inner PD loop), c is he cos incurred by he conroller, η (,] is he learning rae parameer ha can be used o opimize he speed of learning, and γ (,] is he discoun facor ha conrols he rade-off beween immediae and fuure coss. c pd A. Learning Process in DENFIS online model he firs-order agaki-sugeno fuzzy rules [58] are employed in DENFIS online model. he linear funcions in he consequence pars are creaed and updaed by linear leassquare esimaor (LSE) [5] on he learning daa. he linear funcion for a learning daa se of p daa pairs, { ([ xi, xi2,, xiq ], yi ), i =,2,, p}, can be expressed as y = β βx β2x2 βqxq (3) he leas-square esimaor (LSE) of β = β β β2 β q is calculaed as he coefficiens b b b b2 b q of =, by applying he following weighed leas-square esimaor formula: b= (A WA) A Wy (4) where x x2 x q w x2 x22 x2q 2 A= w ; y y y2 yp and W= = xp xp2 xpq wp Here W is he weigh marix and is elemens, w ij, are defined by d j ( d is he disance beween he j h j sample and he corresponding cluser cener), j =, 2,, p. We can rewrie equaion (4) wih he use of recursive LSE formula [4] as follows: P = (A WA) (5) b= PA Wy In he DENFIS online model, Kasabov and Song [99] used a weighed recursive LSE wih a forgeing facor defined as h follows. Le he k row vecor of marix A is denoed as a k h and he k elemen of y is denoed as y k. hen b can be calculaed ieraively as follows: bk = bk wk Pk a k ( yk a k bk) wk Pa k k ak P (6) k Pk = Pk λ λa k Pka k where k = n, n,... p ; w is he weigh of k -h k sample defined by d k ( d k is he disance beween he k -h sample and he corresponding cluser cenre); and λ (.8,) is forgeing facor. he iniial values of P n and bn can be calculaed direcly from (5) wih he use of firs n daa pairs from he learning daa se. In online DENFIS model, he rules are creaed and updaed a he same ime wih he inpu space pariioning using online ECM, and equaions (4) and (6). IV. SIMULAION EXPRIMENS o demonsrae he usefulness of dynamic evolving fuzzy neural nework funcion approximaor in reinforcemen learning framework, we conduced experimens using he wellknown wo-link robo manipulaor racking conrol problem. In implemenaion, he DENFIS has as inpu he saeacion pair and as oupu, he Q-value corresponding o he sae-acion pair. In paricular, he DENFIS nework begins wih zero cluser. We firs obained a group of fuzzy rules using an DENFIS off-line learning model, wih he use of raining samples available from well defined reinforcemen fuzzy sysems (here we ake raining samples from dynamic fuzzy Q-learning conroller). hen wih agen-environmen ineracion, he raining samples available and he DENFIS model build-up an online mode based on dynamic inference, i.e., clusering and reformulaion of he rules are performed whenever a new raining example is presened o he nework. he DENFIS off-line learning model when used as an 382

4 Proceedings of he 23 Inernaional Conference on Sysems, Conrol and Informaics iniializaion, improves he generalizaion (e.g., improves he learning efficiency). For simpliciy, he conroller uses wo DENFIS models as funcion approximaors; one each for he wo-links. DENFIS is one module of ECOS oolbox working in he MALAB numeric compuing environmen. he disance hreshold D hr is se o.8 and defaul value of he number of rules in dynamic fuzzy inference sysem is se o 3 for consrucing DENFIS. A. Simulaion Resuls and Discussion Simulaions were carried ou o sudy he learning performance, and robusness agains uncerainies, for DENFIS learning approach on wo-link robo manipulaor conrol problem. o analyze he DENFIS algorihm for compuaional cos, accuracy, and robusness, we compare he proposed approach wih dynamic fuzzy reinforcemen learning approach. MALAB 7. (R2a) has been used as simulaion ool. Learning performance sudy he physical sysem has been simulaed for a single run of sec using fourh-order Runge-Kua mehod, wih fixed ime sep of msec. Fig. 2 and Fig. 3 show he oupu racking error (boh he links), for boh he conrollers and. able abulaes he mean square error, absolue maximum error ( ma x e ( ) ), and absolue maximum conrol effor ( max τ ) under nominal operaing condiions..4.3 From he resuls (Figs. 2 3 and able ), we observe ha raining ime for is higher han. ouperforms, in erms of lower racking errors and he low value of absolue error and conrol effor for boh he links Robusness sudy In he following, we compare he performance of DFQ and under uncerainies. For his sudy, we rained he conroller for 2 episodes, and hen evaluaed he performance for wo cases: Effec of payload variaions : he end-effecor mass is varied wih ime, which corresponds o he roboic arm picking up and releasing payloads having differen masses. Fig. 4 and Fig. 5 show he oupu racking errors for link and link 2, respecively, and able 2 abulaes he mean square error, absolue maximum error and absolue maximum conrol effor a payload variaions wih ime ime (sec) Fig. 4 Effec of payload variaion comparison: oupu racking errors (link ) ime (sec) Fig. 2 Sandard wo-link conroller comparison: oupu racking errors (link ) ime (sec) Fig. 3 Sandard wo-link conroller comparison: oupu racking errors (link 2) able Comparison of conrollers: learning performance sudy raining max e ( ) MSE (rad) max τ (Nm) ime Conroller (rad) (sec) Link Link 2 Link Link 2 Link Link ime (sec) Fig. 5 Effec of payload variaion comparison: oupu racking errors (link 2) able 2 Comparison of conrollers: effec of payload variaions MSE (rad) Conroller max e() (rad) max τ (Nm) Link Link 2 Link Link 2 Link Link Effecs of exernal disurbances: A orque disurbance τ dis wih a sinusoidal variaion of frequency 2π rad/sec, was added wih ime o he model. he magniude of orque disurbance is expressed as a percenage of conrol effor.fig. 6 and Fig. 7 show he oupu racking errors for link and link 2, respecively, and able 3 abulaes he mean square error, absolue maximum error ( max e ( ) ), and absolue maximum conrol effor ( max τ ) for orque disurbances added wih ime o he model variaion. 383

5 Proceedings of he 23 Inernaional Conference on Sysems, Conrol and Informaics.4.3 fuzzy Q-learning based RL sysem. his feaure is achieved wihou any loss of performance ime (sec) Fig. 6 Effec of exernal disurbances comparison: oupu racking errors (link ) ime (sec) Fig. 7 Effec of exernal disurbances comparison: oupu racking errors (link 2) able 3 Comparison of conrollers: effec of exernal disurbances Conroller MSE (rad) max e ( ) (rad) max τ (Nm) Link Link 2 Link Link 2 Link Link Simulaion resuls (Figs 4 7, able 2 and able 3) show comparable robusness propery for DENFISQ-learning based conroller and Dynamic fuzzy Q-learning based conroller. V. CONCLUIONS We have explored he poenial of dynamic evolving fuzzyneural nework (DENFIS) for reinforcemen learning algorihms. DENFIS is a sequenial learning archiecure and has abiliy o grow and prune o ensure a parsimonious srucure ha is well suied for real-ime conrol applicaions. From he simulaion resuls, i is obvious ha raining ime in DENFIS based RL sysem is larger compared o he dynamic REFERENCES [] R. S. Suon, A. G. Baro, and R. J. Williams, Reinforcemen learning is direc adapive opimal conrol, IEEE Conrol Sys. Mag., vol. 2, no. 2, pp. 9 22, 992. [2] J. A. Boyan, and A. W. Moore, Generalizaion in reinforcemen learning: Safely approximaing he value funcion, Advances in Neural Informaion Proc. Sys., pp , 995. [3] B. Raich, On characerisics of Markov decision processes and reinforcemen learning in large domains, PhD hesis, Monréal: McGill Universiy, School of Compuer Science, 24. [4] R. S. Suon, and A. G. Baro, Reinforcemen Learning: An Inroducion (adapive compuaion and machine learning), Cambridge: MI Press, 998. [5] K. Hornic, M. Sinchcombe, and H. Whie, Mulilayer feed forward neworks are universal approximaors, Neural Neworks, vol. 2, pp , 989. [6] L. Wang, Fuzzy sysems are universal approximaors, in Proc. In. Conf. Fuzzy Sysem, 992. [7] P. Y. Glorennec, L. Jouffe, Fuzzy Q-learning, Proc. IEEE In. Conf. Fuzzy Sysems; vol. 2, pp , 997. [8] L. Jouffe, Fuzzy inference sysem learning by reinforcemen mehods, IEEE rans. Sysem, Man, and Cyberneics, Par C, vol. 28, no. 3, pp , 998. [9] M. J. Er, and C. Deng, Online uning of fuzzy inference sysems using dynamic fuzzy Q-learning, IEEE rans. on Sysems, Man, and Cyberneics, Par B, vol. 34, no. 3, pp , 24. [] N. Kasabov, Foundaion of Neural neworks, Fuzzy sysems and Knowledge engineering, he MI Press, CA, MA, 996. [] J. Vieira, F.M Dias, and A. Moa, Neuro-fuzzy sysems: A survey, WSEAS rans on Sysems, vol. 3, no. 2, April 24. [2] D. A. Linkes, and H. O. Nyongesa, Learning sysems in inelligen conrol: On appraisal of fuzzy, neural and geneic algorihm conrol applicaions, In Proc. Ins. Elec. Eng. Conrol heory Applicaions, vol. 43, pp , 996. [3] N. Kasabov, Evolving fuzzy neural neworks for supervised/unsupervised online knowledge-based learning, IEEE rans. Sys., Man, Cybern., Par B, vol. 3, no. 6, pp , Dec. 2. [4] N. Kasabov, and Q. Song, DENFIS: Dynamic evolving neuro-fuzzy inference sysem and is applicaion for ime-series predicion, IEEE rans. Fuzzy Sys., vol., no. 2, pp , April 22. [5] M J Was, A decade of Kasabov s evolving connecionis sysems: A review, IEEE rans. Sysems, Man, and Cyberneics-Par C: Applicaions and Reviews, vol. 39, no. 3, pp , May

Neural Network Model of the Backpropagation Algorithm

Neural Network Model of the Backpropagation Algorithm Neural Nework Model of he Backpropagaion Algorihm Rudolf Jakša Deparmen of Cyberneics and Arificial Inelligence Technical Universiy of Košice Lená 9, 4 Košice Slovakia jaksa@neuron.uke.sk Miroslav Karák

More information

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments Inernaional Journal of Elecrical and Compuer Engineering (IJECE) Vol. 6, No. 5, Ocober 2016, pp. 2415~2424 ISSN: 2088-8708, DOI: 10.11591/ijece.v6i5.10639 2415 An Effiecien Approach for Resource Auo-Scaling

More information

Fast Multi-task Learning for Query Spelling Correction

Fast Multi-task Learning for Query Spelling Correction Fas Muli-ask Learning for Query Spelling Correcion Xu Sun Dep. of Saisical Science Cornell Universiy Ihaca, NY 14853 xusun@cornell.edu Anshumali Shrivasava Dep. of Compuer Science Cornell Universiy Ihaca,

More information

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices Z. Zhang e al.: Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion in Hands-Free Voice Conrolled Devices 525 Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion

More information

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports Downloaded from ascelibrary.org by Basil Sephanis on 07/13/16. Copyrigh ASCE. For personal use only; all righs reserved. Informaion Propagaion for informing Special Populaion Subgroups abou New Ground

More information

More Accurate Question Answering on Freebase

More Accurate Question Answering on Freebase More Accurae Quesion Answering on Freebase Hannah Bas, Elmar Haussmann Deparmen of Compuer Science Universiy of Freiburg 79110 Freiburg, Germany {bas, haussmann}@informaik.uni-freiburg.de ABSTRACT Real-world

More information

1 Language universals

1 Language universals AS LX 500 Topics: Language Uniersals Fall 2010, Sepember 21 4a. Anisymmery 1 Language uniersals Subjec-erb agreemen and order Bach (1971) discusses wh-quesions across SO and SO languages, hypohesizing:...

More information

MyLab & Mastering Business

MyLab & Mastering Business MyLab & Masering Business Efficacy Repor 2013 MyLab & Masering: Business Efficacy Repor 2013 Edied by Michelle D. Speckler 2013 Pearson MyAccouningLab, MyEconLab, MyFinanceLab, MyMarkeingLab, and MyOMLab

More information

Lecture 10: Reinforcement Learning

Lecture 10: Reinforcement Learning Lecture 1: Reinforcement Learning Cognitive Systems II - Machine Learning SS 25 Part III: Learning Programs and Strategies Q Learning, Dynamic Programming Lecture 1: Reinforcement Learning p. Motivation

More information

Learning Methods for Fuzzy Systems

Learning Methods for Fuzzy Systems Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8

More information

Evolutive Neural Net Fuzzy Filtering: Basic Description

Evolutive Neural Net Fuzzy Filtering: Basic Description Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:

More information

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems

Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems Ajith Abraham School of Business Systems, Monash University, Clayton, Victoria 3800, Australia. Email: ajith.abraham@ieee.org

More information

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Proceedings of 28 ISFA 28 International Symposium on Flexible Automation Atlanta, GA, USA June 23-26, 28 ISFA28U_12 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Amit Gil, Helman Stern, Yael Edan, and

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Axiom 2013 Team Description Paper

Axiom 2013 Team Description Paper Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association

More information

Reinforcement Learning by Comparing Immediate Reward

Reinforcement Learning by Comparing Immediate Reward Reinforcement Learning by Comparing Immediate Reward Punit Pandey DeepshikhaPandey Dr. Shishir Kumar Abstract This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate

More information

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email Marilyn A. Walker Jeanne C. Fromer Shrikanth Narayanan walker@research.att.com jeannie@ai.mit.edu shri@research.att.com

More information

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria FUZZY EXPERT SYSTEMS 16-18 18 February 2002 University of Damascus-Syria Dr. Kasim M. Al-Aubidy Computer Eng. Dept. Philadelphia University What is Expert Systems? ES are computer programs that emulate

More information

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration INTERSPEECH 2013 Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration Yan Huang, Dong Yu, Yifan Gong, and Chaojun Liu Microsoft Corporation, One

More information

Visual CP Representation of Knowledge

Visual CP Representation of Knowledge Visual CP Representation of Knowledge Heather D. Pfeiffer and Roger T. Hartley Department of Computer Science New Mexico State University Las Cruces, NM 88003-8001, USA email: hdp@cs.nmsu.edu and rth@cs.nmsu.edu

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Exploration. CS : Deep Reinforcement Learning Sergey Levine Exploration CS 294-112: Deep Reinforcement Learning Sergey Levine Class Notes 1. Homework 4 due on Wednesday 2. Project proposal feedback sent Today s Lecture 1. What is exploration? Why is it a problem?

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Kamaldeep Kaur University School of Information Technology GGS Indraprastha University Delhi

Kamaldeep Kaur University School of Information Technology GGS Indraprastha University Delhi Soft Computing Approaches for Prediction of Software Maintenance Effort Dr. Arvinder Kaur University School of Information Technology GGS Indraprastha University Delhi Kamaldeep Kaur University School

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System QuickStroke: An Incremental On-line Chinese Handwriting Recognition System Nada P. Matić John C. Platt Λ Tony Wang y Synaptics, Inc. 2381 Bering Drive San Jose, CA 95131, USA Abstract This paper presents

More information

Softprop: Softmax Neural Network Backpropagation Learning

Softprop: Softmax Neural Network Backpropagation Learning Softprop: Softmax Neural Networ Bacpropagation Learning Michael Rimer Computer Science Department Brigham Young University Provo, UT 84602, USA E-mail: mrimer@axon.cs.byu.edu Tony Martinez Computer Science

More information

Artificial Neural Networks written examination

Artificial Neural Networks written examination 1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma International Journal of Computer Applications (975 8887) The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma Gilbert M.

More information

TD(λ) and Q-Learning Based Ludo Players

TD(λ) and Q-Learning Based Ludo Players TD(λ) and Q-Learning Based Ludo Players Majed Alhajry, Faisal Alvi, Member, IEEE and Moataz Ahmed Abstract Reinforcement learning is a popular machine learning technique whose inherent self-learning ability

More information

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Mariusz Łapczy ski 1 and Bartłomiej Jefma ski 2 1 The Chair of Market Analysis and Marketing Research,

More information

E mail: Phone: LIBRARY MBA MAIN OFFICE

E mail: Phone: LIBRARY MBA MAIN OFFICE MASTER OF BUSINESS ADMINISTRATION 1 Jennifer Brandow, MBA Director E mail: mba@wsc.edu Phone: 402.375.7587 MBA OFFICE Gardner Hall 106 1111 Main St. Wayne, NE 68787 ADMISSIONS 402.375.7234 admissions@wsc.edu

More information

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots

Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots Varun Raj Kompella, Marijn Stollenga, Matthew Luciw, Juergen Schmidhuber The Swiss AI Lab IDSIA, USI

More information

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler Machine Learning and Data Mining Ensembles of Learners Prof. Alexander Ihler Ensemble methods Why learn one classifier when you can learn many? Ensemble: combine many predictors (Weighted) combina

More information

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION Han Shu, I. Lee Hetherington, and James Glass Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Cambridge,

More information

Transfer Learning Action Models by Measuring the Similarity of Different Domains

Transfer Learning Action Models by Measuring the Similarity of Different Domains Transfer Learning Action Models by Measuring the Similarity of Different Domains Hankui Zhuo 1, Qiang Yang 2, and Lei Li 1 1 Software Research Institute, Sun Yat-sen University, Guangzhou, China. zhuohank@gmail.com,lnslilei@mail.sysu.edu.cn

More information

SARDNET: A Self-Organizing Feature Map for Sequences

SARDNET: A Self-Organizing Feature Map for Sequences SARDNET: A Self-Organizing Feature Map for Sequences Daniel L. James and Risto Miikkulainen Department of Computer Sciences The University of Texas at Austin Austin, TX 78712 dljames,risto~cs.utexas.edu

More information

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,

More information

AMULTIAGENT system [1] can be defined as a group of

AMULTIAGENT system [1] can be defined as a group of 156 IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART C: APPLICATIONS AND REVIEWS, VOL. 38, NO. 2, MARCH 2008 A Comprehensive Survey of Multiagent Reinforcement Learning Lucian Buşoniu, Robert Babuška,

More information

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology Tiancheng Zhao CMU-LTI-16-006 Language Technologies Institute School of Computer Science Carnegie Mellon

More information

Time series prediction

Time series prediction Chapter 13 Time series prediction Amaury Lendasse, Timo Honkela, Federico Pouzols, Antti Sorjamaa, Yoan Miche, Qi Yu, Eric Severin, Mark van Heeswijk, Erkki Oja, Francesco Corona, Elia Liitiäinen, Zhanxing

More information

Test Effort Estimation Using Neural Network

Test Effort Estimation Using Neural Network J. Software Engineering & Applications, 2010, 3: 331-340 doi:10.4236/jsea.2010.34038 Published Online April 2010 (http://www.scirp.org/journal/jsea) 331 Chintala Abhishek*, Veginati Pavan Kumar, Harish

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education

Applying Fuzzy Rule-Based System on FMEA to Assess the Risks on Project-Based Software Engineering Education Journal of Software Engineering and Applications, 2017, 10, 591-604 http://www.scirp.org/journal/jsea ISSN Online: 1945-3124 ISSN Print: 1945-3116 Applying Fuzzy Rule-Based System on FMEA to Assess the

More information

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH

CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH ISSN: 0976-3104 Danti and Bhushan. ARTICLE OPEN ACCESS CLASSIFICATION OF TEXT DOCUMENTS USING INTEGER REPRESENTATION AND REGRESSION: AN INTEGRATED APPROACH Ajit Danti 1 and SN Bharath Bhushan 2* 1 Department

More information

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and

Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and Name Qualification Sonia Thomas Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept. 2016. M.Tech in Computer science and Engineering. B.Tech in

More information

Australian Journal of Basic and Applied Sciences

Australian Journal of Basic and Applied Sciences AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean

More information

A Reinforcement Learning Variant for Control Scheduling

A Reinforcement Learning Variant for Control Scheduling A Reinforcement Learning Variant for Control Scheduling Aloke Guha Honeywell Sensor and System Development Center 3660 Technology Drive Minneapolis MN 55417 Abstract We present an algorithm based on reinforcement

More information

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Sanket S. Kalamkar and Adrish Banerjee Department of Electrical Engineering

More information

Cal s Dinner Card Deals

Cal s Dinner Card Deals Cal s Dinner Card Deals Overview: In this lesson students compare three linear functions in the context of Dinner Card Deals. Students are required to interpret a graph for each Dinner Card Deal to help

More information

Detecting English-French Cognates Using Orthographic Edit Distance

Detecting English-French Cognates Using Orthographic Edit Distance Detecting English-French Cognates Using Orthographic Edit Distance Qiongkai Xu 1,2, Albert Chen 1, Chang i 1 1 The Australian National University, College of Engineering and Computer Science 2 National

More information

Learning to Schedule Straight-Line Code

Learning to Schedule Straight-Line Code Learning to Schedule Straight-Line Code Eliot Moss, Paul Utgoff, John Cavazos Doina Precup, Darko Stefanović Dept. of Comp. Sci., Univ. of Mass. Amherst, MA 01003 Carla Brodley, David Scheeff Sch. of Elec.

More information

Discriminative Learning of Beam-Search Heuristics for Planning

Discriminative Learning of Beam-Search Heuristics for Planning Discriminative Learning of Beam-Search Heuristics for Planning Yuehua Xu School of EECS Oregon State University Corvallis,OR 97331 xuyu@eecs.oregonstate.edu Alan Fern School of EECS Oregon State University

More information

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES

PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES PREDICTING SPEECH RECOGNITION CONFIDENCE USING DEEP LEARNING WITH WORD IDENTITY AND SCORE FEATURES Po-Sen Huang, Kshitiz Kumar, Chaojun Liu, Yifan Gong, Li Deng Department of Electrical and Computer Engineering,

More information

On the Combined Behavior of Autonomous Resource Management Agents

On the Combined Behavior of Autonomous Resource Management Agents On the Combined Behavior of Autonomous Resource Management Agents Siri Fagernes 1 and Alva L. Couch 2 1 Faculty of Engineering Oslo University College Oslo, Norway siri.fagernes@iu.hio.no 2 Computer Science

More information

Education: Integrating Parallel and Distributed Computing in Computer Science Curricula

Education: Integrating Parallel and Distributed Computing in Computer Science Curricula IEEE DISTRIBUTED SYSTEMS ONLINE 1541-4922 2006 Published by the IEEE Computer Society Vol. 7, No. 2; February 2006 Education: Integrating Parallel and Distributed Computing in Computer Science Curricula

More information

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education

GCSE Mathematics B (Linear) Mark Scheme for November Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education GCSE Mathematics B (Linear) Component J567/04: Mathematics Paper 4 (Higher) General Certificate of Secondary Education Mark Scheme for November 2014 Oxford Cambridge and RSA Examinations OCR (Oxford Cambridge

More information

Henry Tirri* Petri Myllymgki

Henry Tirri* Petri Myllymgki From: AAAI Technical Report SS-93-04. Compilation copyright 1993, AAAI (www.aaai.org). All rights reserved. Bayesian Case-Based Reasoning with Neural Networks Petri Myllymgki Henry Tirri* email: University

More information

Improving Fairness in Memory Scheduling

Improving Fairness in Memory Scheduling Improving Fairness in Memory Scheduling Using a Team of Learning Automata Aditya Kajwe and Madhu Mutyam Department of Computer Science & Engineering, Indian Institute of Tehcnology - Madras June 14, 2014

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

Reducing Features to Improve Bug Prediction

Reducing Features to Improve Bug Prediction Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science

More information

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT Takuya Yoshioka,, Anton Ragni, Mark J. F. Gales Cambridge University Engineering Department, Cambridge, UK NTT Communication

More information

High-level Reinforcement Learning in Strategy Games

High-level Reinforcement Learning in Strategy Games High-level Reinforcement Learning in Strategy Games Christopher Amato Department of Computer Science University of Massachusetts Amherst, MA 01003 USA camato@cs.umass.edu Guy Shani Department of Computer

More information

Mining Association Rules in Student s Assessment Data

Mining Association Rules in Student s Assessment Data www.ijcsi.org 211 Mining Association Rules in Student s Assessment Data Dr. Varun Kumar 1, Anupama Chadha 2 1 Department of Computer Science and Engineering, MVN University Palwal, Haryana, India 2 Anupama

More information

Pre-vocational training. Unit 2. Being a fitness instructor

Pre-vocational training. Unit 2. Being a fitness instructor Pre-vocational training Unit 2 Being a fitness instructor 1 Contents Unit 2 Working as a fitness instructor: teachers notes Unit 2 Working as a fitness instructor: answers Unit 2 Working as a fitness instructor:

More information

The development and implementation of a coaching model for project-based learning

The development and implementation of a coaching model for project-based learning The development and implementation of a coaching model for project-based learning W. Van der Hoeven 1 Educational Research Assistant KU Leuven, Faculty of Bioscience Engineering Heverlee, Belgium E-mail:

More information

Accessing Higher Education in Developing Countries: panel data analysis from India, Peru and Vietnam

Accessing Higher Education in Developing Countries: panel data analysis from India, Peru and Vietnam Accessing Higher Education in Developing Countries: panel data analysis from India, Peru and Vietnam Alan Sanchez (GRADE) y Abhijeet Singh (UCL) 12 de Agosto, 2017 Introduction Higher education in developing

More information

Speeding Up Reinforcement Learning with Behavior Transfer

Speeding Up Reinforcement Learning with Behavior Transfer Speeding Up Reinforcement Learning with Behavior Transfer Matthew E. Taylor and Peter Stone Department of Computer Sciences The University of Texas at Austin Austin, Texas 78712-1188 {mtaylor, pstone}@cs.utexas.edu

More information

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform doi:10.3991/ijac.v3i3.1364 Jean-Marie Maes University College Ghent, Ghent, Belgium Abstract Dokeos used to be one of

More information

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, 2013 10.12753/2066-026X-13-154 DATA MINING SOLUTIONS FOR DETERMINING STUDENT'S PROFILE Adela BÂRA,

More information

ModellingSpace: A tool for synchronous collaborative problem solving

ModellingSpace: A tool for synchronous collaborative problem solving ModellingSpace: A tool for synchronous collaborative problem solving Nikolaos Avouris, Vassilis Komis, Meletis Margaritis, Christos Fidas University of Patras, GR-265 Rio Patras, Greece^ N.Avouris@ee.upatras.gr,

More information

Kansas Adequate Yearly Progress (AYP) Revised Guidance

Kansas Adequate Yearly Progress (AYP) Revised Guidance Kansas State Department of Education Kansas Adequate Yearly Progress (AYP) Revised Guidance Based on Elementary & Secondary Education Act, No Child Left Behind (P.L. 107-110) Revised May 2010 Revised May

More information

Assignment 1: Predicting Amazon Review Ratings

Assignment 1: Predicting Amazon Review Ratings Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model

Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Unsupervised Learning of Word Semantic Embedding using the Deep Structured Semantic Model Xinying Song, Xiaodong He, Jianfeng Gao, Li Deng Microsoft Research, One Microsoft Way, Redmond, WA 98052, U.S.A.

More information

Knowledge-Based - Systems

Knowledge-Based - Systems Knowledge-Based - Systems ; Rajendra Arvind Akerkar Chairman, Technomathematics Research Foundation and Senior Researcher, Western Norway Research institute Priti Srinivas Sajja Sardar Patel University

More information

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Active Learning. Yingyu Liang Computer Sciences 760 Fall Active Learning Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from materials developed by Mark Craven,

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

Student Name: OSIS#: DOB: / / School: Grade:

Student Name: OSIS#: DOB: / / School: Grade: Grade 6 ELA CCLS: Reading Standards for Literature Column : In preparation for the IEP meeting, check the standards the student has already met. Column : In preparation for the IEP meeting, check the standards

More information

Erkki Mäkinen State change languages as homomorphic images of Szilard languages

Erkki Mäkinen State change languages as homomorphic images of Szilard languages Erkki Mäkinen State change languages as homomorphic images of Szilard languages UNIVERSITY OF TAMPERE SCHOOL OF INFORMATION SCIENCES REPORTS IN INFORMATION SCIENCES 48 TAMPERE 2016 UNIVERSITY OF TAMPERE

More information

An Estimating Method for IT Project Expected Duration Oriented to GERT

An Estimating Method for IT Project Expected Duration Oriented to GERT An Estimating Method for IT Project Expected Duration Oriented to GERT Li Yu and Meiyun Zuo School of Information, Renmin University of China, Beijing 100872, P.R. China buaayuli@mc.e(iuxn zuomeiyun@263.nct

More information

Modeling user preferences and norms in context-aware systems

Modeling user preferences and norms in context-aware systems Modeling user preferences and norms in context-aware systems Jonas Nilsson, Cecilia Lindmark Jonas Nilsson, Cecilia Lindmark VT 2016 Bachelor's thesis for Computer Science, 15 hp Supervisor: Juan Carlos

More information

Georgetown University at TREC 2017 Dynamic Domain Track

Georgetown University at TREC 2017 Dynamic Domain Track Georgetown University at TREC 2017 Dynamic Domain Track Zhiwen Tang Georgetown University zt79@georgetown.edu Grace Hui Yang Georgetown University huiyang@cs.georgetown.edu Abstract TREC Dynamic Domain

More information

arxiv: v2 [cs.ro] 3 Mar 2017

arxiv: v2 [cs.ro] 3 Mar 2017 Learning Feedback Terms for Reactive Planning and Control Akshara Rai 2,3,, Giovanni Sutanto 1,2,, Stefan Schaal 1,2 and Franziska Meier 1,2 arxiv:1610.03557v2 [cs.ro] 3 Mar 2017 Abstract With the advancement

More information

Segmental Conditional Random Fields with Deep Neural Networks as Acoustic Models for First-Pass Word Recognition

Segmental Conditional Random Fields with Deep Neural Networks as Acoustic Models for First-Pass Word Recognition Segmental Conditional Random Fields with Deep Neural Networks as Acoustic Models for First-Pass Word Recognition Yanzhang He, Eric Fosler-Lussier Department of Computer Science and Engineering The hio

More information

Cooperative evolutive concept learning: an empirical study

Cooperative evolutive concept learning: an empirical study Cooperative evolutive concept learning: an empirical study Filippo Neri University of Piemonte Orientale Dipartimento di Scienze e Tecnologie Avanzate Piazza Ambrosoli 5, 15100 Alessandria AL, Italy Abstract

More information

Universityy. The content of

Universityy. The content of WORKING PAPER #31 An Evaluation of Empirical Bayes Estimation of Value Added Teacher Performance Measuress Cassandra M. Guarino, Indianaa Universityy Michelle Maxfield, Michigan State Universityy Mark

More information

Soft Computing based Learning for Cognitive Radio

Soft Computing based Learning for Cognitive Radio Int. J. on Recent Trends in Engineering and Technology, Vol. 10, No. 1, Jan 2014 Soft Computing based Learning for Cognitive Radio Ms.Mithra Venkatesan 1, Dr.A.V.Kulkarni 2 1 Research Scholar, JSPM s RSCOE,Pune,India

More information

Model Ensemble for Click Prediction in Bing Search Ads

Model Ensemble for Click Prediction in Bing Search Ads Model Ensemble for Click Prediction in Bing Search Ads Xiaoliang Ling Microsoft Bing xiaoling@microsoft.com Hucheng Zhou Microsoft Research huzho@microsoft.com Weiwei Deng Microsoft Bing dedeng@microsoft.com

More information

A General Class of Noncontext Free Grammars Generating Context Free Languages

A General Class of Noncontext Free Grammars Generating Context Free Languages INFORMATION AND CONTROL 43, 187-194 (1979) A General Class of Noncontext Free Grammars Generating Context Free Languages SARWAN K. AGGARWAL Boeing Wichita Company, Wichita, Kansas 67210 AND JAMES A. HEINEN

More information

INSTITUTE OF MANAGEMENT STUDIES NOIDA

INSTITUTE OF MANAGEMENT STUDIES NOIDA INSTITUTE OF MANAGEMENT STUDIES NOIDA MANDATORY DISCLOSURE- PGDM PROGRAMME The information has been provided by the concerned institution and the onus of authenticity lies with the Institution and not

More information

BUSINESS INTELLIGENCE FROM WEB USAGE MINING

BUSINESS INTELLIGENCE FROM WEB USAGE MINING BUSINESS INTELLIGENCE FROM WEB USAGE MINING Ajith Abraham Department of Computer Science, Oklahoma State University, 700 N Greenwood Avenue, Tulsa,Oklahoma 74106-0700, USA, ajith.abraham@ieee.org Abstract.

More information

Improving Action Selection in MDP s via Knowledge Transfer

Improving Action Selection in MDP s via Knowledge Transfer In Proc. 20th National Conference on Artificial Intelligence (AAAI-05), July 9 13, 2005, Pittsburgh, USA. Improving Action Selection in MDP s via Knowledge Transfer Alexander A. Sherstov and Peter Stone

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

Learning Prospective Robot Behavior

Learning Prospective Robot Behavior Learning Prospective Robot Behavior Shichao Ou and Rod Grupen Laboratory for Perceptual Robotics Computer Science Department University of Massachusetts Amherst {chao,grupen}@cs.umass.edu Abstract This

More information