Automatic Inference of Cross-modal Nonverbal Interactions in Multiparty Conversations

Size: px
Start display at page:

Download "Automatic Inference of Cross-modal Nonverbal Interactions in Multiparty Conversations"

Transcription

1 Auomaic Inference of Cross-modal Nonverbal Ineracions in Mulipary Conversaions "Who Responds o Whom, When, and How?" from Gaze, Head Gesures, and Uerances Kazuhiro Osuka NTT Communicaion Science Laboraories 3-1, Morinosao-Wakamiya Asugi, Japan osuka@eye.brl.n.co.jp Hiroshi Sawada NTT Communicaion Science Laboraories 2-4, Hikaridai, Seika-cho Kyoo, Japan sawada@cslab.n.co.jp Junji Yamao NTT Communicaion Science Laboraories 3-1, Morinosao-Wakamiya Asugi, Japan yamao@brl.n.co.jp ABSTRACT A novel probabilisic framework is proposed for analyzing cross-modal nonverbal ineracions in mulipary face-o-face conversaions. The goal is o deermine who responds o whom, when, and how from mulimodal cues including gaze, head gesures, and uerances. We formulae his problem as he probabilisic inference of he causal relaionship among paricipans behaviors involving head gesures and uerances. To solve his problem, his paper proposes a hierarchical probabilisic model; he srucures of ineracions are probabilisically deermined from high-level conversaion regimes (such as monologue or dialogue) and gaze direcions. Based on he model, he ineracion srucures, gaze, and conversaion regimes, are simulaneously inferred from observed head moion and uerances, using a Markov chain Mone Carlo mehod. The head gesures, including nodding, shaking and il, are recognized wih a novel Wavele-based echnique from magneic sensor signals. The uerances are deeced using daa capured by lapel microphones. Experimens on four-person conversaions confirm he effeciveness of he framework in discovering ineracions such as quesion-and-answer and addressing behavior followed by back-channel responses. Caegories and Subjec Descripors H1.2 [Models and Principles]: User/Machine Sysem Human Informaion Processing General Terms ALGORITHMS, HUMAN FACTORS Permission o make digial or hard copies of all or par of his work for personal or classroom use is graned wihou fee provided ha copies are no made or disribued for profi or commercial advanage and ha copies bear his noice and he full ciaion on he firs page. To copy oherwise, o republish, o pos on servers or o redisribue o liss, requires prior specific permission and/or a fee. ICMI November 12-15, 2007, Nagoya, Aichi, Japan. Copyrigh 2007 ACM /07/ $5.00. Keywords Face-o-face mulipary conversaion, Eye gaze, Head gesures, Nonverbal behaviors, Bayesian nework, Markov chain Mone Carlo, Gibbs sampler, Semi-Markov process 1. INTRODUCTION Face-o-face conversaion is one of he mos basic forms of communicaion in our life and is used for conveying/sharing informaion, undersanding ohers inenion/emoion, and making decisions. To enhance our communicaion capabiliy beyond conversaions on he spo, he auomaic analysis of conversaion scenes is a basic echnical requisie o enable effecive eleconferencing, archiving/summarizing meeings, and o realize communicaion via social agens and robos. The conversaion scene analysis arges various aspecs of conversaions, from individual/group behaviors such as who is speaking now? and who is alking/lisening o whom?, o conex/menal saus such as who made him angry? and why is she laughing?. In he face-o-face seing, he messages include no only verbal bu also nonverbal messages. The nonverbal messages are expressed by nonverbal behaviors in mulimodal channels such as eye gaze, facial expressions, head moion, hand gesure, body posure and prosody; psychologiss have elucidaed is imporance in human communicaions [1, 26]. Therefore, i is expeced ha conversaion scenes can be largely undersood by observing people s nonverbal behaviors wih sensing devices such as cameras and microphones. As a preliminary sep, auhors have focused on eye gaze as a nonverbal cue for recognizing addressing/lisening behaviors [22], based on he imporance of gaze funcionaliy in conversaions [16, 12]. We have conduced a frequency analysis of he se of gaze direcions of all paricipans, we call i gaze paern, and hypohesized ha he opology of gaze paerns (convergence and muual gaze) can indicae he paern of conversaions such as monologue and dialogue; we call hese he conversaion regimes. As an example, one relaionship is hearers end o look a speaker in monologues. To model he relaionships beween gaze paerns and regimes, we proposed a probabilisic conversaion model based on a dynamic Bayesian nework; he conversaional regime conrols he dynamics of gaze paerns and uerances; he gaze paern is a hidden variable and is esi-

2 maed from head-direcion measuremens. Wih his model, he regimes and he gaze paerns are joinly esimaed from he uerances and he head direcions measured wih sensors [22] or face racking in videos [23]; he esimaion is implemened using he MCMC(Markov chain Mone Carlo) mehod. This paper ries o exend our framework o a new arge, he auomaic inference of nonverbal ineracion srucures in mulipary conversaions; he goal is o deermine who responds o whom, when, and how. In conras o auhors previous work [22, 23], his paper ries o recognize more direc nonverbal ineracions in conversaions. We paricularly focus on head gesures (nod, shake, and il) and uerances as nonverbal cues, and ry o discover he acion-reacion pairs of paricipans behaviors such as quesion-and-answer and addressing followed by back-channel responses. The arge ineracions will yield cross-modal formaions such as an uerance acknowledged by nodding where he nod riggers he oher s uerances. The ineracion srucure is he basic primiive in conversaions and can reveal how messages are exchanged among people; i can be a clue for inferring how he aiude and minds of paricipans change. As far as he auhors know, his paper is he firs one o shed ligh on he explici srucural analysis of nonverbal ineracions in conversaions. We formulae his problem as he probabilisic inference of he causal relaionships among paricipans behaviors, we call hese relaionships he ineracion srucures. To solve his problem, his paper proposes a hierarchical probabilisic model; he ineracion srucures are probabilisically generaed from gaze and conversaion regimes; he ineracion srucures hen deermine how head gesures and uerances relae o each oher, i.e. which behavior is riggered by which behavior, as well as which behaviors are sponaneous and which are reacive. Based on he model, he ineracion srucures, gaze paerns, and conversaion regimes, are simulaneously inferred from head direcions, head gesure inervals, and uerance inervals, using a MCMC. One of he key feaures differeniaing our model from exising ineracion models is he modeling concep: explici represenaion of he causal relaionships among behaviors in Bayesian nework form; he configuraion of which is ruled by upper-layer processes, i.e. regimes and gaze. Anoher key feaure is he use of a semi-markov process [14] o accuraely model he emporal srucures of ineracions; i permis arbirary disribuions of behavior iming, such as duraion and pause lengh. As one such iming disribuion, his sudy employs a Weibull disribuion [21] due o is expressiveness. So far, several ineracion models have been proposed for conversaion scene analysis, based on he coupled-hmm [4] and is derivaives such as he influence model [2]. However, due o he Markov propery of hese models, he only exponenial emporal disribuions are suppored, which does no necessarily mach acual phenomena. Moreover, ineracion modeling has, so far, mainly argeed audio modaliy [2, 7], and he modeling of mulimodal ineracions remains an open problem. This paper focuses on head gesures such as nodding, shaking, il, for he following reasons. Firs, i is well known ha head gesures play imporan roles in face-oface conversaion for boh speakers and hearers [18]. The speaker s head gesures appear as visible signs of acions such as addressing, quesioning, and sressing. The hearers head gesures can be inerpreed as signs of lisening, acknowledgemen, agreemen/disagreemen, and he level of undersanding. These gesures are used o regulae various ineracions in conversaions, such as quesion & answer, addressing & back-channel response, and urn-aking/yielding. Therefore, head gesures are considered o be a rich informaion source for undersanding conversaions. Several head gesure recogniion mehods have been proposed for man-machine inerfaces using echniques such as HMM [15] and FFT+SVM(Suppor Vecor Machine)[20]. Unlike ineracions wih arificial agens, human-human conversaions exhibi a wide variey of gesures, in erms of periodiciy, speed, and dynamic range, which are mixed ogeher wih oher head moions such as hose synchronized o uerances, urning head when changing gaze direcion, and so on. To handle such gesures, his paper proposes a novel gesure recogniion echnique ha consiss of Wavele-analysis of head pose sequences; SVM is used as a discriminaor. This paper is organized as follows. Secion 2 overviews relaed works. Secion 3 proposes our conversaion model, and Secion 4 presens an esimaion algorihm based on he model. Secion 5 describes he experimen conduced o verify he effeciveness of our mehod. Secion 6 presens our conclusion and some discussions. 2. RELATED WORKS In recen years, conversaion scene analysis has emerged as an aracive research area [10], and wo sreams of research have been gaining aenion: he auomaic recogniion of meeing acions, and using annoaed daa o explore human mechanisms in meeings. The former sudy sream aims o realize he auomaic recogniion of meeing acions such as monologue, dialogue, discussion, noe-aking, and presenaion, from audio/visual signals. To do his, mos prior sudies employed low-level feaures such as global image moion and geomeric image primiives deeced from video, and ried o build saisical models on machine learning echniques ha linked he signals o meeing acions. So far, a number of models have been proposed based on he HMM(Hidden Markov Model) [19], layered-hmm [27], coupled-hmm [2], and dynamic Bayesian neworks [9]. However, he explici measuremen and modeling of human behaviors in meeings remain as open problems. On he oher hand, anoher line of sudies, moivaed by he desire o explore human mechanisms in meeings, akes he psychological poin of view. In pioneering work, a group led by Quek focused on he floor conrol funcion; hey used a mulimodal meeing corpus [6], creaed by a human exper, for analyzing speech paerns such as inerrupion and delegaion of he floor [5]. So far, heir research has revealed ha mulimodal cues such as gaze, gesure, speech, have imporan roles in floor conrol. However, full-auomaic daa annoaion remains a fuure work. Given he curren saus of he field, we have been rying o bridge he gap beween he wo sreams of sudies menioned above: auomaic undersanding of meeing scenes from direc measuremens of nonverbal behaviors, and explicily modeling he relaionship beween individual behaviors and he saus of conversaion, wih he help of psychological findings.

3 Top Regime Layer Middle Ineracion Layer Hidden gaze paerns X S conversaion regimes E ineracion srucures Head direcions Gaze paerns Regimes S 1 H 1 X 1 X X +1 S H H +1 S +1 H +2 X S A (a) Boom Behavior Layer Observable (b) H head direcions G U head gesures uerances Figure 1: Conversaion model, (a) concep, (b) graphical model. (a) Speaker Addressee Addressee Addressee Side-paricipan Speaker/Adressee Speaker/Addressee (b) Side-paricipan Figure 2: Gaze paerns and ineracion paerns in (a)convergence regime and (b)dyad-link regime. 3. CONVERSATION MODEL 3.1 Model Concep This sudy focuses on group conversaions held in a closed environmen; he number of paricipans is N 3. As shown in Fig. 1(a), we assume ha conversaions have hierarchical srucures. Furher, we hypohesize ha a high-level process, called a conversaion regime, governs how people inerac wih each oher on he ineracion layer, and he ineracion process governs how each individual behaves on he behavior layer. In [22], auhors have proposed conversaion regimes as a global saus of conversaions, which correspond o addressing/lisening paerns such as monologue and dialogue. We focused on he eye gaze of each paricipan as an ineracive behavior. Also, we hypohesized ha he gaze paern of paricipans can indicae he srucure of conversaions, and proposed hree regimes: Convergence, Dyad-Link, and Divergence. The regime called Convergence (also called monologue) corresponds o he siuaion ha a speaker addresses all ohers, and he addressees lisen o he speaker. This regime is indicaed by he convergence of he addressees gaze ono he speaker, as shown in Fig. 2(a). Second, he regime called Dyad-Link (also called dialogue) corresponds o he siuaion ha wo people are alking o each oher, and he ohers are side-paricipans. This regime is indicaed by muual gaze beween he wo, as shown in Fig. 2(b). Third, he regime called Divergence (also called ohers) corresponds o siuaions oher han convergence and dyad-link regimes; every one is silen and/or no organized conversaion exiss. The gaze paern does no exhibi any organized paern. This paper exends he auhors framework in [22] o infer cross-modal nonverbal ineracions. Of paricular noe, his paper newly inroduces head gesures (nod, shake, il) and uerances as he ineracing nonverbal cues, and ries o find he causal relaionship amongs hem hem. Fig. 1(b) provides a graphical represenaion of our new conver- Ineracion srucures E' E'' E''' B' B'' Behaviors B Figure 3: Temporal represenaion of model. P1 P2 P3 E B''' Nei(E)={B 1,1, B 3,2 } B 1,1 B 1,2 B 3,1 B 3,2 B 3,3 B 2,2 Figure 4: Ineracion nework represening causal relaionship deermined from ineracion srucures. saion model, consising of hidden variables (conversaion regimes S, gaze paerns X, and ineracion srucures E) and observable variables (head direcions H, head gesures G, and uerances U). This model assumes ha he ineracion srucures are probabilisically generaed depending on he conversaion regimes and gaze paerns and he head gesures and uerances are probabilisically generaed by he ineracion srucures. To esablish he link beween conversaion regimes and he ineracion srucures, we hypohesize ha he paern of ineracions resemble he gaze paerns as shown in Fig. 2. For example, in regime convergence, he addressees ofen respond o he speaker wih nods as he sign of lisening, someimes accompanied wih shor uerances like hmm and yeah. These addressing/back-channel responses are basic primiives of ineracions in conversaion. I is assumed ha he direcion of he responses follows he same paern noed in he case of gaze, as shown in Fig. 2(a). On he oher hand, here is anoher ype of ineracion, called quesion-and-answer; i is an ineracion for exchanging messages beween wo persons. We assume ha i ofen appears in he regime dyad-link, and i akes he form depiced in Fig. 2(b). 3.2 Model Srucure Fig. 3 provides a graphical represenaion of he proposed conversaion model wih emporal informaion. The upper par (A) is he same as he proposed in [22], and he lower par (B) is he novel exension made in his paper. In he upper par, we denoe he sequence of regime saes as S = {S 1, S 2,, S T }; we arge he discree emporal inerval [1, T ]. The regime sae S a ime akes one of N convergence regimes, N C 2 dyad-link regimes and he divergence regime. The regime changes are considered o follow a discree Markov process. The sequence of gaze paerns is represened as X = {X 1, X 2,, X T }, where

4 he gaze paern X a ime is composed of he se of gaze direcions of all paricipans, X = {X i, } N i=1; i akes N discree direcions: look a oher s face or aver from all of hem. The sequences of head direcions is denoed as H = {H 1, H 2, H T } T =1, and he head direcion of each paricipans, H = {h i, } N i=1, is observed as coninuous azimuh (horizonal) angle. The lower par (B) in Fig. 3 represens he relaionship beween he ineracion srucures E and behaviors consising of emporal inervals of head gesures G and uerances U. In his paper, a head-gesure deecor deecs he presence/absence of head gesures a each ime sep, and a voice aciviy deecor deecs ha of uerances a each ime sep (See also 5.4 and 5.3). From he deecion resuls, he emporal inervals of coninuous head gesures are exraced; here we define a emporal inerval G G as ha is bounded by he beginning/ending ime sep a which head moion sars/ends. The same definiion can be applied o he uerance inerval U U. Here, we denoe a se of gesure and uerance inervals as B = G U; hereafer we refer o behavior B B unless i is necessary o disinguish beween gesures and uerances. Noe his paper arges nod, shake, il, and reas hem as he same behavior, because i mainly focuses on he emporal aspec of gesures, no he meaning of gesures. This paper assumes ha each behavior is riggered by anoher s behavior, or appears sponaneously. The ineracion srucures E deermine he causal relaionship among behaviors. The proposed model assumes ha he ineracion is probabilisically generaed based on he saes of regimes and gaze paerns. Fig. 4, which corresponds o he lower par of Fig. 3, visualizes he causal relaionship among behaviors assigned by he ineracion srucures; we call i he ineracion nework. In Fig. 4, boxes indicae behaviors and an arrow from a box indicaes he reacion arge ha riggered he behavior. A box wihou any ougoing arrow indicaes sponaneous behavior. The ineracion nework can be considered as a Bayesian nework wih inverse arrow direcions. The ineracion srucures consis of a se of elemens, called acion uni, E E, which corresponds o each behavior inerval, where gesure and uerance inervals are linked if heir beginning imes are similar. Each acion uni, E, has aribues including sponaneous-reacive class and a reacion arge. The sponaneous-reacive class indicaes ha he acion is sponaneous (denoed E ø) or a reacion o anoher s behavior (denoed E B, B Nei(E) B), where B denoes he reacion arge ha riggered acion E. Nei(E) denoes a se of ohers behaviors ha occur in he emporal viciniy of E, as shown in Fig. 4. This paper assumes here is only one reacion arge for each acion uni, a mos. Here, reacive behavior is defined as he direc and immediae response o ohers behavior. Sponaneous behavior is one ha is no reacive behavior. Typical examples of sponaneous behaviors are addressing and quesioning behavior of speakers. On he oher hand, ypical reacive behaviors include he hearers back-channel responses and answers o quesions posed. 3.3 Model Definiion Based on he condiional dependency depiced in Fig. 1(b), he join probabiliy disribuion of he model is de- Table 1: Sponaneous probabiliies Regime Monologue Dialogue Ohers Role speaker addressee dyad ohers Uerance η SMSU η SMAU η SDDU η SDSU η SOU (0.95) (0.06) (0.00) (0.00) (0.78) Gesure η SMSG η SMAG η SDDG η SDSG η SOG (0.93) (0.05) (0.50) (0.00) (0.71) fined as Table 2: Direcional probabiliies Monologue Addressee o speaker η DA (0.88) Dialogue One in dyad o anoher η DD (1.00) Look a response arge η DG (0.88) p(x, S, E, H, U, G, ϕ) F H (H X, ϕ) F B (U, G E, ϕ) P (E X, S, ϕ) P (X S, ϕ) P (S ϕ) p(ϕ), (1) where ϕ denoes he se of all model parameers. Eq.(1) is composed of he produc of he likelihood funcions for observed daa and he prior disribuion of all hidden variables. This paper employs he same definiions used in [22] for he priors of regimes P (S ϕ) and gaze paerns P (X S, ϕ), and for he likelihood for head direcions F H ( ), which assumes ha head direcion follows a Gaussian disribuion for any given gaze direcion. The prior p(ϕ) of model parameers is defined as he produc of ha of each of he parameers; his assumes he independency of individual parameers. In Eq. (1), P (E X, S, ϕ) represens he probabiliy ha ineracions E occur in given regimes S and gaze paerns X. This paper decomposes his ino he produc of he probabiliies of each acion uni E E, as wrien in P (E X, S, ϕ) = Q E EP (E X, S) QB Nei(E) ψ(e, B) (2) where he firs erm, P (E X, S), represens he probabiliy of he acion uni sae and he second erm is a penaly o suppress he case of wo behaviors responding o each oher (ψ(e, B) = 0), oherwise ψ(e, B) = 1. In Eq. (2), P (E X, S, ϕ) represens he probabiliy ha acion uni E is in response o anoher s behavior or is sponaneous. To define his, his paper inroduces sponaneous probabiliies and direcional probabiliies, and defines P (E X, S, ϕ) as heir produc. The former is he probabiliy ha an acion is sponaneous behavior. This paper assumes ha i depends on he regime and role of he person making acion uni E, as summarized in Table 1; each probabiliy is a hidden variable o be esimaed. In Table 1, values inside he parenheses are examples obained from manually-annoaed daa (C1) (See 5.1); hey indicae ha for a monologue, speaker s behaviors is far more sponaneous han ha of he addressees. On he oher hand, he direcional probabiliy represens he probabiliy ha acion uni E is in response o he arge person; i is assumed o depend on gaze and regime. Table 2 summarizes he direcional probabiliies. As menioned earlier, we assume ha addressees ofen respond o he speaker in regime convergence, and respond o each oher in dyad-link regime. Also, he reacion arge ends o be a gazee, because people end o look a he arge when hey respond. The second componen, which is newly defined for he join densiy in Eq. (1), is he likelihood of ineracive behavior F B(U, G E, ϕ) for given ineracion srucures E in Eq. (1). This paper defines his componen as he produc of he likelihood funcion of each behavior inerval, f B (B E),

5 Sponaneous behaviors duraion pause Sender Receiver (Responder) Reacive behaviors rigger ime reacion ime duraion Figure 5: Duraion, pause lengh, and reacion ime. Uerance Gesure Duraion S,D U Sponaneous Pause lengh S,P f U f ( d ) ( p ) d p S,D G S,P f G f ( d ) ( p ) d p The uni of horizonal axis is second. Hisogram is obained from mannual annoaion for daa C1. E Reacive Duraion R,D f U ( d ) d R,D f G ( d ) d Reacion ime R,R f ( r ) r Figure 6: Weibull disribuions for duraion, pause lengh, and response ime. in each acion uni, by assuming he condiional independency of each behavior for given ineracion srucures E. This likelihood calculaion is based on he emporal disribuions of duraion, pause, and reacion ime of gesures and uerances (See Fig. 5). Fig. 6 summarizes Weibull models employed o represen he disribuions. The model in Fig. 6 indicaes he endency observed in he iming of behaviors; e.g. sponaneous uerances are longer han reacive ones, bu sponaneous gesures end o be shorer han reacive ones. Using hese models, he likelihood f B (B E) can be defined separaely for each case as in f B(B E) = j f S,D B ( d) f S,P B (p) if E is sponaneous, B ( d) f R,R ( r ) if E is reacive, f R,D (3) where d, p, and r denoe he duraion, pause lengh, and reacion ime of behavior B, respecively. Noe ha he Weibull parameers are hidden variables o be esimaed. If he arge behavior is a gesure, he rigger ime is se o he beginning of he inerval. Oherwise, rigger ime is considered o be a hidden random variable, which follows he probabiliy disribuion of response ime. 4. ESTIMATION ALGORITHM Based on he model defined above, he problem is o esimae he ineracion E, he regime S, gaze paern X, and model parameers ϕ from measuremens Z = {H, G, U}. We employ a Bayesian approach [3] o esimae he join poserior disribuion p(e, S, X, ϕ Z) of all unknown variables from he given measuremens. To esimae he join poserior, his sudy uses he Markov chain Mone Carlo mehod called he Gibbs sampler [11], which has an advanage when dealing wih complex models. The Gibbs sampler repeaedly generaes random samples from he fullcondiional poserior disribuions of each unknown variable, which consiue a Markov chain whose invarian disribuion equals he desired join poserior. The full-condiional disribuion is he disribuion of a variable when oher variables are given. From he random samples afer he Markov chain has converged, he maximum a poserior esimae is calculaed for discree variables, and he minimum meansquared error esimaes are calculaed for coninuous variables. Noe his esimaion algorihm is a form of unsupervised learning, which does no need raining daa o obain he model parameers ϕ. Insead, we need o experienially deermine hyper-parameers of he prior disribuions of he parameers ϕ. The full-condiional disribuion of he ineracion srucure of an acion uni can be derived from he join disribuion in Eq. (1), and is wrien as P (E B S, X, E \ E, ϕ, Z) (4) Q B B(E) f B(B E) P (E B X, S) ψ(e, B), where B(E) denoes a se of behaviors included in acion uni E. According o Eq. (4), he reacion arge (also sponaneous-reacive class) B Nei(E) ø of each acion uni E is sampled. The rigger ime for each of candidae uerance inervals in neighborhood Nei(E) is sampled from he reacion ime disribuion. The full condiional of each sponaneous acion and direcional probabiliy becomes a Bea disribuion when assuming Bea priors, and hese probabiliies are sampled from he corresponding Bea full-condiional poseriors. For Weibull models, we assume Gamma priors for he Weibull s shape and scale parameers, and runcaed uniform prior for he locaion parameer; hese priors are used o represen a priori knowledge abou he iming disribuions. For oher variables and parameers, his paper follows he procedures described in [22]. 5. EXPERIMENTS 5.1 Daa This paper arges 4-person group conversaions. The paricipans were four women wihin he same age bracke; hey were seaed as shown in Figure 7. They were insruced o hold a discussion and ry o reach a conclusion as a group for a given discussion opic wihin five minues. The discussion opics were Should ax breaks be given o full-ime housewives, or no? and Is marriage and romanic love he same or differen? ; hereafer he recorded conversaions are referred C1 and C2, respecively. The head direcions were measured a 30 Hz using magneic-based sensors (POL- HEMUS Fasrak TM ), which were aached o heir heads on hair bands. Audio daa were recorded by lapel microphones aached o each paricipan. Also, video sequences, whole sho (Figure 7(b)) and bus shos (Figure 9(a)), were recorded a 30 frames/sec. These daa were synchronized a he uni-ime sep of 1/30 sec. The lenghs of daa were and 9100 frames (5.6 and 5.1 min) for C1 and C2, respecively. 5.2 Manual Annoaion The raw daa was manually annoaed o permi a quaniaive evaluaion. Annoaion was mainly performed by one female in her 20s. Fig. 8(a) shows a par of he

6 Person 4 Person 3 Y O 1m Person 1 Person 2 (a) (b) 1 X 2 3 Figure 7: Overview of scene. (a)plan view of paricipans locaion, (b)whole view of paricipans. manual annoaion ( 20 sec.). For each person, P1 P4, he hick bands shows he uerance inervals, manually deeced based on IPU(Iner Pausal Uni) of 0.3 sec. The line segmens beneah he uerances indicae gesure inervals, manually deeced by visual inspecion of he video. The arge gesures were nodding, shaking, and iling. Oher head moions were excluded. Nex, for each uerance inerval and each gesure inerval, sponaneous-reacive class and he reacion arge were deermined. Also, rigger ime was given for each reacive behavior. In Fig. 8(a), small circles represen he beginning of acion unis and he arrows from hem indicae he reacion arges, while circles wih no arrow indicae sponaneous acions. The posiion of arrow s head indicaes he rigger ime sep. Each vericallyelongaed ellipse indicaes an inegraed acion uni consising of an uerance and a gesure. Also, he ground ruh of gaze direcions and regimes was manually creaed by waching he video sequences. 5.3 Voice Aciviy Deecion To auomaically deec uerance inervals from he audio signals capured by lapel microphones, his paper employed a voice aciviy deecion (VAD) mehod [24] ha can robusly deec each person s uerance separaely by clusering each person s signal in he ime-frequency domain. The deeced uerances oupu by he VAD mehod were reformed by filling shor-erm gaps o saisfy he IPU crieria, and eliminaing very shor inervals as noise. Fig. 8(b) shows some of he uerance inervals so deeced. Compared o he manual deecion resuls in Fig. 8(a), he auomaic resul includes deecion lapses due o whisper-like uerances, and over-deecion due o breahing, rusling, and coughing. Table 3(a) shows he accuracy of uerance deecion in erms of precision, recall, and hi raio. Here, he hi raio is he raio of correc frames o all frames. Table 3(a) confirms ha he auomaic voice deecion mehod used was highly accurae and robus even hough he amoun of cross-alk was significan. 5.4 Gesure Recogniion The head gesures were deeced wih a new echnique based on discree Wavele ransform (DWT). Firs, DWT feaures are separaely calculaed for each head pose componen; he componens are azimuh (horizonal), elevaion (verical), and roll (in-plane roaion). This sudy applied he Daubechies wavele of order 10 (db10) and decomposiion scale was se o 2-4; windows size was 16. A each ime sep, we calculaed he DWT coefficiens of deails D2-D4 and final approximaion A4, and hen calculaed he maximum, minimum, mean, sandard deviaion of he wavele coefficiens in each sub-band, as he feaure vecor of gesures. These saisics were used in EEG signal analysis [13]. 4 Nex, we rained an SVM o classify he feaure vecor ino wo caegories; gesure or non-gesure, a each ime sep. This paper employed a polynomial kernel of order 5 and a sof margin crierion. Training and classificaion was done for each person in each conversaion. C1 was classified using he SVM rained wih manually deeced daa of C2, and vice versa. The oupu of he SVM was hen reformed in a manner similar o he uerance inervals o yield he final gesure inervals. Fig. 8(b) shows some of he deeced gesures. A comparison o Fig. 8(a) shows ha here are some errors; overdeecion occurs due o coninuous gesures and small head movemens. Table 3(b) shows he accuracy of gesure deecion, and indicaes ha he deecion was moderaely successful, despie he huge dynamic range of gesures, from almos invisible ones o very large ones. 5.5 Experimen Seing This paper employed he same values as used in [22], for hyper-parameers, which deermine he prior disribuions of gaze, regime, and head direcions. The Gibbs sampling ieraion was 10000, and saisics were calculaed from samples obained from he 5000h 10000h ieraions. The same parameer se were used for boh daa C1 and C Qualiaive Evaluaion Fig. 8(b) shows a par of he ineracion srucures inferred from auomaically deeced behaviors. Fig. 9 shows hree snap shos o illusrae he flow of he conversaion. In his scene, speaking urn changes over ime; P4 P2 P1. We have confirmed ha gaze paerns and regimes were successfully esimaed for his scene. Firs, P4 sared o give her opinion o ohers who lisened o P4. During his (P4 s) urn, ohers responded o P4 wih uerances and gesures. They synchronized heir responses o a break poin in P4 s discourse. Fig. 8(b) indicaes ha hese back-channel responses were successfully deermined. Nex, a he end of P4 s uerances, she asked he ohers for agreemen and ried o confirm heir aiudes; heir answers were correcly inferred. A he same ime, P2 overlaid her uerance wih he end of P4 s senence, and ook over he speaking urn. P1, P3, and P4 urned heir gaze o P2 and acknowledged her urn. Also, hey responded o her ag-quesion wih posiive answers; heir responses were successfully deermined, even hough he urn aking by P2 was abrup. P1 hen ook advanage of a momenary chance, and ook he urn. P2, P3, and P4 laughed a wha P1 said; i appeared o be a humorous phrase. These responses oward P1 were successfully deermined. Fig. 8(b) indicaes ha mos quesion/answer and addressees back-channel responses oward speakers were accuraely esimaed, and followed he changes in speaking urn. A visual inspecion of all inferred ineracions confirmed ha he inferred ineracion srucures were reasonably accurae; a few flaws were presen. 5.7 Quaniaive Evaluaion Table 4 shows he resul of he quaniaive evaluaion of ineracions. Table 4(a) shows he raio of sponaneousreacive class (correcly esimaed) o he manual annoaion. Table 4(b) shows he raio of he number of acion unis whose arge person was correcly inferred, o he number of all acion unis whose sponaneous-reacive class was

7 uerance inervals gesure inervals (a) (a) =1 =2 =1 ah =3 Tha's righ!. if here's no sysem proecing people like ha, I's gonna be like a... yes hm! ah Tha's why, indeed, people who are selfish and si in he do nohing posiion as housewives should no be given any benefis. I hink i's beer. so hmm hmmm ha ha ha ha ha ha yes ah yeah I see ha ha ha yes, yes (b) ha ha ha They cerainly are Tha's righ, isn' i Tha's righ. common, righ? They migh have parens who need care, or somehing. Say, depending on family siuaion, people, who can' go for work, Figure 8: Ineracion nework represenaion of ineracion srucures, (a)manual annoaion, (b)inference resul from auomaically deeced behaviors, (daa = C1), 1 frame = 1/30 sec., Display lengh ' 20 sec. 1 correcly deermined. Table 4(c) shows he raio of he number of acion unis in which he difference beween he esimaed rigger ime and he one from he corresponding annoaion was less han or equal o 0.3 sec.; he denominaor of his raio is he number of acion unis whose arge person was correcly inferred. Table 4(a) shows ha he accuracy of deermining sponaneous-reacive class is raher modes; miss-classificaion happens ofen, especially when he acion uni has a shor preceding pausal lengh. Table 4(b) indicaes ha arge persons were accuraely inferred, and Table 4(c) suggess ha he accuracy of idenifying he rigger ime was reasonably high. C1 yielded superior performance o C2, because C2 was a more complex conversaion, wih a lo of rapid urn changes. In general, he resuls gained from auomaically deeced behaviors were basically comparable o hose from he manually deeced ones; his verifies he effeciveness of VAD and he gesure deecion echnique described here. Despie he limied daase and annoaion, he above resuls sugges he effeciveness of he proposed mehod in analyzing nonverbal ineracions in mulipary conversaions. Fuure work includes evaluaions using a comprehensive daase ha includes various group and opics, as well as examining he consisency of manual annoaions (used as ground ruh) given by differen annoaors P2 C R4 P3 P1 P4 = C P2 P3 P1 P4 = P2 R2 P3 P1 C R1 P4 (a) =3 (b) Figure 9: Snap shos of hree ime seps, 1, 2, 3 in Fig. 8. (a)each paricipan, (b)regime esimaes and gaze paerns (solid arrows: esimaes, wide arrows: ground ruh). RiC denoes Pi s monologue regime. 6. CONCLUSION AND DISCUSSION This paper proposed a novel arge of conversaion scene analysis, he auomaic inference of ineracion paerns from paricipans nonverbal behaviors in mulipary conversaions. To ha purpose, a hierarchical probabilisic conversaion model was inroduced. Even hough his paper focused on simple ineracions condiioned on conversaion regimes, gaze paerns, and emporal srucures such as duraion,

8 Table 3: Deecion accuracy of uerance (a) and gesure (b) (a)uerance (b)head Gesure Precision Recall Hi Precision Recall Hi C C Table 4: Acuracy of esimaed ineracions: (Manual)manual annoaion, (Auo)auomaically deeced behaviors (a)spon. (b)person (c)trigger C1 (Manual) C2 (Manual) C1 (Auo) C2 (Auo) pause, and reacion imes, he proposed framework is considered o be noeworhy in ha i provides a basic mehodology for analyzing nonverbal cross-modal ineracions in face-o-face conversaions, and offers several prospecive direcions. Firs, he ineracion srucures discovered wih he proposed framework can be used as a clue for undersanding he menal/conex level aspecs of conversaions. The firs sep would be classifying he responses ino posiive and negaive ypes, which can be indicaed by head gesure classes. The proposed gesure deecor is powerful enough o disinguish various head moions such as nod, shake, and il. The problem is o esablish useful links beween moion feaures and he inner sae of he person, like he degree of agreemen. Moreover, i can provide a basic elemen for analyzing how one person s opinion can spread hroughou a human nework and how a group s concordance is formed over ime. I is also ineresing work o relae our framework o psychological/linguisic sudies such as adjacency pair analysis [25] and synchrony analysis [8]. The ineracion srucures can be a useful elemen of meeing annoaion for auomaic archiving/summarizing sysems. For example, i could provide more semanic-based rerieval capabiliy such as who had posiive/negaive on his opinion? and who s opinion was he mos influenial?. Also, he idenified ineracion srucures can be used o improve auomaic video ediing so ha viewers can more clearly undersand who responds o whom. Furhermore, i is worh considering a sysem ha can quanify communicaion skill and provide users wih feedback o improve human communicaion skill in organizaions. To realize real-ime applicaions, fuure works include he real-ime simulaneous racking of faces from low-resoluion video sequences, image-based head gesure recogniion, and voice deecion/separaion capured wih microphone arrays. Our framework can also easily incorporae oher modaliies such as prosody and facial expressions. We are currenly developing a facial expression recogniion echnique ha is robus agains head-pose changes [17]. Finally, auhors believe ha his work will conribue o opening up a new research field ha can explore various aspecs of nonverbal cross-modal ineracions in mulipary conversaions, and bridge relaed disciplines such as psychology, social linguisics, and mulimodal applicaions. 7. REFERENCES [1] M. Argyle. Bodily Communicaion 2nd ed. Rouledge, London and New York, [2] S. Basu. Conversaional Scene Analysis. Ph.D hesis, Massachuses Insiue of Technology, [3] J. M. Bernardo and A. F. M. Smih. Bayesian Theory. John Wiley & Sons, Ld., [4] M. Brand, N. Oliver, and A. Penland. Coupled hidden Markov models for complex acion recogniion. In Proc. CVPR 97, pages , [5] L. Chen, M. Harper, A. Franklin, T. R. Rose, I. Kimbara, Z. Huang, and F. Quek. A mulimodal analysis of floor conrol in meeings. In Proc. MLMI 06, [6] L. Chen, R. T. Rose, F. Parrill, X. Han, J. Tu, Z. Huang, M. Harper, D. M. F. Quek, R. Tule, and T. Huang. VACE mulimodal meeing corpus. In Proc. MLMI, [7] T. K. Choudhury. Sensing and Modeling Human Neworks. Ph.D hesis, MIT, [8] W. S. Condon and M. B. Ogson. Sound film analysis of normal and pahological behavior paerns. J. Nervous and Disease, 143: , [9] A. Dielmann and S. Renals. Dynamic Bayesian neworks for meeing srucuring. In Proc. IEEE ICASSP 04, [10] D. Gaica-Perez. Analyzing group ineracions in conversaions: a review. In Proc. IEEE In. Conf. Mulisensor Fusion and Inegraion for Inelligen Sysems 06, pages 41 46, [11] W. R. Gilks, S. Richardson, and D. J. Spiegelhaler. Markov Chain Mone Carlo in Pracice. Chapman & Hall/CRC, [12] C. Goodwin. Conversaional Organizaion: Ineracion beween Speakers and Hearers. Academic Press, [13] Ï. Güler and E. D. Übeyli. Muliclass suppor vecor machines for EEG-signals classificaion. IEEE Trans. Informaion Technology in Biomedicine, 11: , [14] J. Janssen and R. Manca. Applied Semi-Markov Processes. Springer, [15] A. Kapoor and R. W. Picard. A real-ime head nod and shake deecor. In Proc. Workshop on PUI, [16] A. Kendon. Some funcions of gaze-direcion in social ineracion. Aca Psychologica, 26:22 63, [17] S. Kumano, K. Osuka, J. Yamao, E. Maeda, and Y. Sao. Pose-invarian facial expression recogniion using variable-inensiy emplaes. In Proc. ACCV 07, [18] S. K. Maynard. Ineracional funcions of a nonverbal sign: Head movemen in japanese dyadic casual conversaion. J. Pragmaics, 11: , [19] I. McCowan, D. Gaica-Perez, S. Bengio, G. Lahoud, M. Barnard, and D. Zhang. Auomaic analysis of mulimodal group acions in meeings,. IEEE Trans. PAMI, 27(3), [20] L.-P. Morency, C. Sidner, C. Lee, and T. Darrell. Conexual recogniion of head gesures. In Proc. ICMI 05, pages 18 24, [21] D. N. P. Murhy, M. Xie, and R. Jiang. Weibull Models. John Wiley & Sons, Ld., [22] K. Osuka, Y. Takemae, J. Yamao, and H. Murase. A probabilisic inference of mulipary-conversaion srucure based on Markov-swiching models of gaze paerns, head direcions, and uerances. In Proc. ICMI 05, [23] K. Osuka, J. Yamao, and H. Murase. Conversaion scene analysis wih dynamic Bayesian nework based on visual head racking. In Proc. ICME 06, [24] H. Sawada, S. Araki, K. Osuka, M. Fujimoo, and K. Ishizuka. Voice aciviy deecion for muliple speakers wih muliple pin microhpones. In 2007 Spring Meeing, Acousical Sociey of Japan, [25] E. A. Schegloff and H. Sacks. Opening up closings. Semioica, 8: , [26] R. Virginia and M. James. Nonverbal behavior in inerpersonal relaions 5h Ed. Allyn & Bacon, [27] D. Zhang, D. Gaica-Perez, S. Bengio, I. McCowan, and G. Lahoud. Modeling individual and group acions in meeings: A wo-layers HMM framework. In Proc. 2nd. IEEE Workshop on Even Mining, 2004.

Neural Network Model of the Backpropagation Algorithm

Neural Network Model of the Backpropagation Algorithm Neural Nework Model of he Backpropagaion Algorihm Rudolf Jakša Deparmen of Cyberneics and Arificial Inelligence Technical Universiy of Košice Lená 9, 4 Košice Slovakia jaksa@neuron.uke.sk Miroslav Karák

More information

More Accurate Question Answering on Freebase

More Accurate Question Answering on Freebase More Accurae Quesion Answering on Freebase Hannah Bas, Elmar Haussmann Deparmen of Compuer Science Universiy of Freiburg 79110 Freiburg, Germany {bas, haussmann}@informaik.uni-freiburg.de ABSTRACT Real-world

More information

1 Language universals

1 Language universals AS LX 500 Topics: Language Uniersals Fall 2010, Sepember 21 4a. Anisymmery 1 Language uniersals Subjec-erb agreemen and order Bach (1971) discusses wh-quesions across SO and SO languages, hypohesizing:...

More information

Fast Multi-task Learning for Query Spelling Correction

Fast Multi-task Learning for Query Spelling Correction Fas Muli-ask Learning for Query Spelling Correcion Xu Sun Dep. of Saisical Science Cornell Universiy Ihaca, NY 14853 xusun@cornell.edu Anshumali Shrivasava Dep. of Compuer Science Cornell Universiy Ihaca,

More information

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments Inernaional Journal of Elecrical and Compuer Engineering (IJECE) Vol. 6, No. 5, Ocober 2016, pp. 2415~2424 ISSN: 2088-8708, DOI: 10.11591/ijece.v6i5.10639 2415 An Effiecien Approach for Resource Auo-Scaling

More information

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices Z. Zhang e al.: Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion in Hands-Free Voice Conrolled Devices 525 Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion

More information

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports Downloaded from ascelibrary.org by Basil Sephanis on 07/13/16. Copyrigh ASCE. For personal use only; all righs reserved. Informaion Propagaion for informing Special Populaion Subgroups abou New Ground

More information

MyLab & Mastering Business

MyLab & Mastering Business MyLab & Masering Business Efficacy Repor 2013 MyLab & Masering: Business Efficacy Repor 2013 Edied by Michelle D. Speckler 2013 Pearson MyAccouningLab, MyEconLab, MyFinanceLab, MyMarkeingLab, and MyOMLab

More information

Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard

Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard Multi-modal Sensing and Analysis of Poster Conversations toward Smart Posterboard Tatsuya Kawahara Kyoto University, Academic Center for Computing and Media Studies Sakyo-ku, Kyoto 606-8501, Japan http://www.ar.media.kyoto-u.ac.jp/crest/

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

Human Emotion Recognition From Speech

Human Emotion Recognition From Speech RESEARCH ARTICLE OPEN ACCESS Human Emotion Recognition From Speech Miss. Aparna P. Wanare*, Prof. Shankar N. Dandare *(Department of Electronics & Telecommunication Engineering, Sant Gadge Baba Amravati

More information

Dublin City Schools Mathematics Graded Course of Study GRADE 4

Dublin City Schools Mathematics Graded Course of Study GRADE 4 I. Content Standard: Number, Number Sense and Operations Standard Students demonstrate number sense, including an understanding of number systems and reasonable estimates using paper and pencil, technology-supported

More information

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition Hua Zhang, Yun Tang, Wenju Liu and Bo Xu National Laboratory of Pattern Recognition Institute of Automation, Chinese

More information

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration

Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration INTERSPEECH 2013 Semi-Supervised GMM and DNN Acoustic Model Training with Multi-system Combination and Confidence Re-calibration Yan Huang, Dong Yu, Yifan Gong, and Chaojun Liu Microsoft Corporation, One

More information

Extending Place Value with Whole Numbers to 1,000,000

Extending Place Value with Whole Numbers to 1,000,000 Grade 4 Mathematics, Quarter 1, Unit 1.1 Extending Place Value with Whole Numbers to 1,000,000 Overview Number of Instructional Days: 10 (1 day = 45 minutes) Content to Be Learned Recognize that a digit

More information

Reducing Features to Improve Bug Prediction

Reducing Features to Improve Bug Prediction Reducing Features to Improve Bug Prediction Shivkumar Shivaji, E. James Whitehead, Jr., Ram Akella University of California Santa Cruz {shiv,ejw,ram}@soe.ucsc.edu Sunghun Kim Hong Kong University of Science

More information

Modeling Dialogue Building Highly Responsive Conversational Agents

Modeling Dialogue Building Highly Responsive Conversational Agents Modeling Dialogue Building Highly Responsive Conversational Agents ESSLLI 2016 David Schlangen, Stefan Kopp with Sören Klett CITEC // Bielefeld University Who we are Stefan Kopp, Professor for Computer

More information

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method

Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Malicious User Suppression for Cooperative Spectrum Sensing in Cognitive Radio Networks using Dixon s Outlier Detection Method Sanket S. Kalamkar and Adrish Banerjee Department of Electrical Engineering

More information

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction

Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction INTERSPEECH 2015 Robust Speech Recognition using DNN-HMM Acoustic Model Combining Noise-aware training with Spectral Subtraction Akihiro Abe, Kazumasa Yamamoto, Seiichi Nakagawa Department of Computer

More information

Word Segmentation of Off-line Handwritten Documents

Word Segmentation of Off-line Handwritten Documents Word Segmentation of Off-line Handwritten Documents Chen Huang and Sargur N. Srihari {chuang5, srihari}@cedar.buffalo.edu Center of Excellence for Document Analysis and Recognition (CEDAR), Department

More information

A Case-Based Approach To Imitation Learning in Robotic Agents

A Case-Based Approach To Imitation Learning in Robotic Agents A Case-Based Approach To Imitation Learning in Robotic Agents Tesca Fitzgerald, Ashok Goel School of Interactive Computing Georgia Institute of Technology, Atlanta, GA 30332, USA {tesca.fitzgerald,goel}@cc.gatech.edu

More information

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh The Effect of Discourse Markers on the Speaking Production of EFL Students Iman Moradimanesh Abstract The research aimed at investigating the relationship between discourse markers (DMs) and a special

More information

A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique

A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique Hiromi Ishizaki 1, Susan C. Herring 2, Yasuhiro Takishima 1 1 KDDI R&D Laboratories, Inc. 2 Indiana University

More information

Eyebrows in French talk-in-interaction

Eyebrows in French talk-in-interaction Eyebrows in French talk-in-interaction Aurélie Goujon 1, Roxane Bertrand 1, Marion Tellier 1 1 Aix Marseille Université, CNRS, LPL UMR 7309, 13100, Aix-en-Provence, France Goujon.aurelie@gmail.com Roxane.bertrand@lpl-aix.fr

More information

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION

ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION Mitchell McLaren 1, Yun Lei 1, Luciana Ferrer 2 1 Speech Technology and Research Laboratory, SRI International, California, USA 2 Departamento

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass

BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION. Han Shu, I. Lee Hetherington, and James Glass BAUM-WELCH TRAINING FOR SEGMENT-BASED SPEECH RECOGNITION Han Shu, I. Lee Hetherington, and James Glass Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology Cambridge,

More information

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri 1, and Yoichi Muraoka 1 1 Graduate School of Fundamental/Computer Science and Engineering, Waseda University,

More information

A Model of Knower-Level Behavior in Number Concept Development

A Model of Knower-Level Behavior in Number Concept Development Cognitive Science 34 (2010) 51 67 Copyright Ó 2009 Cognitive Science Society, Inc. All rights reserved. ISSN: 0364-0213 print / 1551-6709 online DOI: 10.1111/j.1551-6709.2009.01063.x A Model of Knower-Level

More information

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq 835 Different Requirements Gathering Techniques and Issues Javaria Mushtaq Abstract- Project management is now becoming a very important part of our software industries. To handle projects with success

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS Arizona s English Language Arts Standards 11-12th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS 11 th -12 th Grade Overview Arizona s English Language Arts Standards work together

More information

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation SLSP-2016 October 11-12 Natalia Tomashenko 1,2,3 natalia.tomashenko@univ-lemans.fr Yuri Khokhlov 3 khokhlov@speechpro.com Yannick

More information

Linking Task: Identifying authors and book titles in verbose queries

Linking Task: Identifying authors and book titles in verbose queries Linking Task: Identifying authors and book titles in verbose queries Anaïs Ollagnier, Sébastien Fournier, and Patrice Bellot Aix-Marseille University, CNRS, ENSAM, University of Toulon, LSIS UMR 7296,

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION L I S T E N I N G Individual Component Checklist for use with ONE task ENGLISH VERSION INTRODUCTION This checklist has been designed for use as a practical tool for describing ONE TASK in a test of listening.

More information

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT

INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT INVESTIGATION OF UNSUPERVISED ADAPTATION OF DNN ACOUSTIC MODELS WITH FILTER BANK INPUT Takuya Yoshioka,, Anton Ragni, Mark J. F. Gales Cambridge University Engineering Department, Cambridge, UK NTT Communication

More information

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE

Course Outline. Course Grading. Where to go for help. Academic Integrity. EE-589 Introduction to Neural Networks NN 1 EE EE-589 Introduction to Neural Assistant Prof. Dr. Turgay IBRIKCI Room # 305 (322) 338 6868 / 139 Wensdays 9:00-12:00 Course Outline The course is divided in two parts: theory and practice. 1. Theory covers

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

WHEN THERE IS A mismatch between the acoustic

WHEN THERE IS A mismatch between the acoustic 808 IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 14, NO. 3, MAY 2006 Optimization of Temporal Filters for Constructing Robust Features in Speech Recognition Jeih-Weih Hung, Member,

More information

Using EEG to Improve Massive Open Online Courses Feedback Interaction

Using EEG to Improve Massive Open Online Courses Feedback Interaction Using EEG to Improve Massive Open Online Courses Feedback Interaction Haohan Wang, Yiwei Li, Xiaobo Hu, Yucong Yang, Zhu Meng, Kai-min Chang Language Technologies Institute School of Computer Science Carnegie

More information

Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed.

Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed. Teachers: Use this checklist periodically to keep track of the progress indicators that your learners have displayed. Speaking Standard Language Aspect: Purpose and Context Benchmark S1.1 To exit this

More information

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Proceedings of 28 ISFA 28 International Symposium on Flexible Automation Atlanta, GA, USA June 23-26, 28 ISFA28U_12 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Amit Gil, Helman Stern, Yael Edan, and

More information

ACOUSTIC EVENT DETECTION IN REAL LIFE RECORDINGS

ACOUSTIC EVENT DETECTION IN REAL LIFE RECORDINGS ACOUSTIC EVENT DETECTION IN REAL LIFE RECORDINGS Annamaria Mesaros 1, Toni Heittola 1, Antti Eronen 2, Tuomas Virtanen 1 1 Department of Signal Processing Tampere University of Technology Korkeakoulunkatu

More information

Modeling function word errors in DNN-HMM based LVCSR systems

Modeling function word errors in DNN-HMM based LVCSR systems Modeling function word errors in DNN-HMM based LVCSR systems Melvin Jose Johnson Premkumar, Ankur Bapna and Sree Avinash Parchuri Department of Computer Science Department of Electrical Engineering Stanford

More information

Learning Methods for Fuzzy Systems

Learning Methods for Fuzzy Systems Learning Methods for Fuzzy Systems Rudolf Kruse and Andreas Nürnberger Department of Computer Science, University of Magdeburg Universitätsplatz, D-396 Magdeburg, Germany Phone : +49.39.67.876, Fax : +49.39.67.8

More information

Dialog Act Classification Using N-Gram Algorithms

Dialog Act Classification Using N-Gram Algorithms Dialog Act Classification Using N-Gram Algorithms Max Louwerse and Scott Crossley Institute for Intelligent Systems University of Memphis {max, scrossley } @ mail.psyc.memphis.edu Abstract Speech act classification

More information

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012

International Journal of Computational Intelligence and Informatics, Vol. 1 : No. 4, January - March 2012 Text-independent Mono and Cross-lingual Speaker Identification with the Constraint of Limited Data Nagaraja B G and H S Jayanna Department of Information Science and Engineering Siddaganga Institute of

More information

On the Combined Behavior of Autonomous Resource Management Agents

On the Combined Behavior of Autonomous Resource Management Agents On the Combined Behavior of Autonomous Resource Management Agents Siri Fagernes 1 and Alva L. Couch 2 1 Faculty of Engineering Oslo University College Oslo, Norway siri.fagernes@iu.hio.no 2 Computer Science

More information

Eye Movements in Speech Technologies: an overview of current research

Eye Movements in Speech Technologies: an overview of current research Eye Movements in Speech Technologies: an overview of current research Mattias Nilsson Department of linguistics and Philology, Uppsala University Box 635, SE-751 26 Uppsala, Sweden Graduate School of Language

More information

REVIEW OF CONNECTED SPEECH

REVIEW OF CONNECTED SPEECH Language Learning & Technology http://llt.msu.edu/vol8num1/review2/ January 2004, Volume 8, Number 1 pp. 24-28 REVIEW OF CONNECTED SPEECH Title Connected Speech (North American English), 2000 Platform

More information

Linking the Ohio State Assessments to NWEA MAP Growth Tests *

Linking the Ohio State Assessments to NWEA MAP Growth Tests * Linking the Ohio State Assessments to NWEA MAP Growth Tests * *As of June 2017 Measures of Academic Progress (MAP ) is known as MAP Growth. August 2016 Introduction Northwest Evaluation Association (NWEA

More information

Grammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs

Grammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs Grammar Lesson Plan: Yes/No Questions with No Overt Auxiliary Verbs DIALOGUE: Hi Armando. Did you get a new job? No, not yet. Are you still looking? Yes, I am. Have you had any interviews? Yes. At the

More information

5.1 Sound & Light Unit Overview

5.1 Sound & Light Unit Overview 5.1 Sound & Light Unit Overview Enduring Understanding: Sound and light are forms of energy that travel and interact with objects in various ways. Essential Question: How is sound energy transmitted, absorbed,

More information

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,

More information

Annotation and Taxonomy of Gestures in Lecture Videos

Annotation and Taxonomy of Gestures in Lecture Videos Annotation and Taxonomy of Gestures in Lecture Videos John R. Zhang Kuangye Guo Cipta Herwana John R. Kender Columbia University New York, NY 10027, USA {jrzhang@cs., kg2372@, cjh2148@, jrk@cs.}columbia.edu

More information

Lecturing Module

Lecturing Module Lecturing: What, why and when www.facultydevelopment.ca Lecturing Module What is lecturing? Lecturing is the most common and established method of teaching at universities around the world. The traditional

More information

Corpus Linguistics (L615)

Corpus Linguistics (L615) (L615) Basics of Markus Dickinson Department of, Indiana University Spring 2013 1 / 23 : the extent to which a sample includes the full range of variability in a population distinguishes corpora from archives

More information

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation Vivek Kumar Rangarajan Sridhar, John Chen, Srinivas Bangalore, Alistair Conkie AT&T abs - Research 180 Park Avenue, Florham Park,

More information

DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES

DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES DEVELOPMENT OF AN INTELLIGENT MAINTENANCE SYSTEM FOR ELECTRONIC VALVES Luiz Fernando Gonçalves, luizfg@ece.ufrgs.br Marcelo Soares Lubaszewski, luba@ece.ufrgs.br Carlos Eduardo Pereira, cpereira@ece.ufrgs.br

More information

Course Law Enforcement II. Unit I Careers in Law Enforcement

Course Law Enforcement II. Unit I Careers in Law Enforcement Course Law Enforcement II Unit I Careers in Law Enforcement Essential Question How does communication affect the role of the public safety professional? TEKS 130.294(c) (1)(A)(B)(C) Prior Student Learning

More information

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning

Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning Distributed Weather Net: Wireless Sensor Network Supported Inquiry-Based Learning Ben Chang, Department of E-Learning Design and Management, National Chiayi University, 85 Wenlong, Mingsuin, Chiayi County

More information

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models Stephan Gouws and GJ van Rooyen MIH Medialab, Stellenbosch University SOUTH AFRICA {stephan,gvrooyen}@ml.sun.ac.za

More information

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach

Data Integration through Clustering and Finding Statistical Relations - Validation of Approach Data Integration through Clustering and Finding Statistical Relations - Validation of Approach Marek Jaszuk, Teresa Mroczek, and Barbara Fryc University of Information Technology and Management, ul. Sucharskiego

More information

On-Line Data Analytics

On-Line Data Analytics International Journal of Computer Applications in Engineering Sciences [VOL I, ISSUE III, SEPTEMBER 2011] [ISSN: 2231-4946] On-Line Data Analytics Yugandhar Vemulapalli #, Devarapalli Raghu *, Raja Jacob

More information

Common Core Exemplar for English Language Arts and Social Studies: GRADE 1

Common Core Exemplar for English Language Arts and Social Studies: GRADE 1 The Common Core State Standards and the Social Studies: Preparing Young Students for College, Career, and Citizenship Common Core Exemplar for English Language Arts and Social Studies: Why We Need Rules

More information

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification

Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Class-Discriminative Weighted Distortion Measure for VQ-Based Speaker Identification Tomi Kinnunen and Ismo Kärkkäinen University of Joensuu, Department of Computer Science, P.O. Box 111, 80101 JOENSUU,

More information

Axiom 2013 Team Description Paper

Axiom 2013 Team Description Paper Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association

More information

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade Trend Survey on Japanese Natural Language Processing Studies over the Last Decade Masaki Murata, Koji Ichii, Qing Ma,, Tamotsu Shirado, Toshiyuki Kanamaru,, and Hitoshi Isahara National Institute of Information

More information

Ohio s New Learning Standards: K-12 World Languages

Ohio s New Learning Standards: K-12 World Languages COMMUNICATION STANDARD Communication: Communicate in languages other than English, both in person and via technology. A. Interpretive Communication (Reading, Listening/Viewing) Learners comprehend the

More information

WiggleWorks Software Manual PDF0049 (PDF) Houghton Mifflin Harcourt Publishing Company

WiggleWorks Software Manual PDF0049 (PDF) Houghton Mifflin Harcourt Publishing Company WiggleWorks Software Manual PDF0049 (PDF) Houghton Mifflin Harcourt Publishing Company Table of Contents Welcome to WiggleWorks... 3 Program Materials... 3 WiggleWorks Teacher Software... 4 Logging In...

More information

Grade 6: Correlated to AGS Basic Math Skills

Grade 6: Correlated to AGS Basic Math Skills Grade 6: Correlated to AGS Basic Math Skills Grade 6: Standard 1 Number Sense Students compare and order positive and negative integers, decimals, fractions, and mixed numbers. They find multiples and

More information

Language Acquisition Chart

Language Acquisition Chart Language Acquisition Chart This chart was designed to help teachers better understand the process of second language acquisition. Please use this chart as a resource for learning more about the way people

More information

The Evolution of Random Phenomena

The Evolution of Random Phenomena The Evolution of Random Phenomena A Look at Markov Chains Glen Wang glenw@uchicago.edu Splash! Chicago: Winter Cascade 2012 Lecture 1: What is Randomness? What is randomness? Can you think of some examples

More information

Semi-Supervised Face Detection

Semi-Supervised Face Detection Semi-Supervised Face Detection Nicu Sebe, Ira Cohen 2, Thomas S. Huang 3, Theo Gevers Faculty of Science, University of Amsterdam, The Netherlands 2 HP Research Labs, USA 3 Beckman Institute, University

More information

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH

IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, VOL. 17, NO. 3, MARCH 2009 423 Adaptive Multimodal Fusion by Uncertainty Compensation With Application to Audiovisual Speech Recognition George

More information

Matching Similarity for Keyword-Based Clustering

Matching Similarity for Keyword-Based Clustering Matching Similarity for Keyword-Based Clustering Mohammad Rezaei and Pasi Fränti University of Eastern Finland {rezaei,franti}@cs.uef.fi Abstract. Semantic clustering of objects such as documents, web

More information

Australian Journal of Basic and Applied Sciences

Australian Journal of Basic and Applied Sciences AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean

More information

Exemplar Grade 9 Reading Test Questions

Exemplar Grade 9 Reading Test Questions Exemplar Grade 9 Reading Test Questions discoveractaspire.org 2017 by ACT, Inc. All rights reserved. ACT Aspire is a registered trademark of ACT, Inc. AS1006 Introduction Introduction This booklet explains

More information

A Privacy-Sensitive Approach to Modeling Multi-Person Conversations

A Privacy-Sensitive Approach to Modeling Multi-Person Conversations A Privacy-Sensitive Approach to Modeling Multi-Person Conversations Danny Wyatt Dept. of Computer Science University of Washington danny@cs.washington.edu Jeff Bilmes Dept. of Electrical Engineering University

More information

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING

BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING BUILDING CONTEXT-DEPENDENT DNN ACOUSTIC MODELS USING KULLBACK-LEIBLER DIVERGENCE-BASED STATE TYING Gábor Gosztolya 1, Tamás Grósz 1, László Tóth 1, David Imseng 2 1 MTA-SZTE Research Group on Artificial

More information

TEKS Correlations Proclamation 2017

TEKS Correlations Proclamation 2017 and Skills (TEKS): Material Correlations to the Texas Essential Knowledge and Skills (TEKS): Material Subject Course Publisher Program Title Program ISBN TEKS Coverage (%) Chapter 114. Texas Essential

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Secondary English-Language Arts

Secondary English-Language Arts Secondary English-Language Arts Assessment Handbook January 2013 edtpa_secela_01 edtpa stems from a twenty-five-year history of developing performance-based assessments of teaching quality and effectiveness.

More information

School Inspection in Hesse/Germany

School Inspection in Hesse/Germany Hessisches Kultusministerium School Inspection in Hesse/Germany Contents 1. Introduction...2 2. School inspection as a Procedure for Quality Assurance and Quality Enhancement...2 3. The Hessian framework

More information

Voice conversion through vector quantization

Voice conversion through vector quantization J. Acoust. Soc. Jpn.(E)11, 2 (1990) Voice conversion through vector quantization Masanobu Abe, Satoshi Nakamura, Kiyohiro Shikano, and Hisao Kuwabara A TR Interpreting Telephony Research Laboratories,

More information

Case study Norway case 1

Case study Norway case 1 Case study Norway case 1 School : B (primary school) Theme: Science microorganisms Dates of lessons: March 26-27 th 2015 Age of students: 10-11 (grade 5) Data sources: Pre- and post-interview with 1 teacher

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Interactive Whiteboard

Interactive Whiteboard 50 Graphic Organizers for the Interactive Whiteboard Whiteboard-ready graphic organizers for reading, writing, math, and more to make learning engaging and interactive by Jennifer Jacobson & Dottie Raymer

More information

Evolutive Neural Net Fuzzy Filtering: Basic Description

Evolutive Neural Net Fuzzy Filtering: Basic Description Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:

More information

Non-Secure Information Only

Non-Secure Information Only 2006 California Alternate Performance Assessment (CAPA) Examiner s Manual Directions for Administration for the CAPA Test Examiner and Second Rater Responsibilities Completing the following will help ensure

More information

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s)) Ohio Academic Content Standards Grade Level Indicators (Grade 11) A. ACQUISITION OF VOCABULARY Students acquire vocabulary through exposure to language-rich situations, such as reading books and other

More information

Meta Comments for Summarizing Meeting Speech

Meta Comments for Summarizing Meeting Speech Meta Comments for Summarizing Meeting Speech Gabriel Murray 1 and Steve Renals 2 1 University of British Columbia, Vancouver, Canada gabrielm@cs.ubc.ca 2 University of Edinburgh, Edinburgh, Scotland s.renals@ed.ac.uk

More information

Seminar - Organic Computing

Seminar - Organic Computing Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts

More information

INPE São José dos Campos

INPE São José dos Campos INPE-5479 PRE/1778 MONLINEAR ASPECTS OF DATA INTEGRATION FOR LAND COVER CLASSIFICATION IN A NEDRAL NETWORK ENVIRONNENT Maria Suelena S. Barros Valter Rodrigues INPE São José dos Campos 1993 SECRETARIA

More information

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282) B. PALTRIDGE, DISCOURSE ANALYSIS: AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC. 2012. PP. VI, 282) Review by Glenda Shopen _ This book is a revised edition of the author s 2006 introductory

More information

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY Chen, Hsin-Hsi Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan E-mail: hh_chen@csie.ntu.edu.tw Abstract

More information

Calibration of Confidence Measures in Speech Recognition

Calibration of Confidence Measures in Speech Recognition Submitted to IEEE Trans on Audio, Speech, and Language, July 2010 1 Calibration of Confidence Measures in Speech Recognition Dong Yu, Senior Member, IEEE, Jinyu Li, Member, IEEE, Li Deng, Fellow, IEEE

More information

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016 AGENDA Advanced Learning Theories Alejandra J. Magana, Ph.D. admagana@purdue.edu Introduction to Learning Theories Role of Learning Theories and Frameworks Learning Design Research Design Dual Coding Theory

More information