A Fast Bandit Algorithm for Recommendations to Users with Heterogeneous Tastes

Size: px
Start display at page:

Download "A Fast Bandit Algorithm for Recommendations to Users with Heterogeneous Tastes"

Transcription

1 A Fas Bandi Algorihm for Recommendaions o Users wih Heerogeneous Tases Pushmee Kohli and Mahyar Salek Microsof Research Cambridge, Unied Kingdom {pkohli, mahyar}@microsof.com Greg Soddard Norhwesern Universiy Evanson, Illinois, USA gregs@u.norhwesern.edu Absrac We sudy recommendaion in scenarios where here s no prior informaion abou he qualiy of conen in he sysem. We presen an online algorihm ha coninually opimizes recommendaion relevance based on behavior of pas users. Our mehod rades weaker heoreical guaranees in asympoic performance han he sae-ofhe-ar for sronger heoreical guaranees in he online seing. We es our algorihm on real-world daa colleced from previous recommender sysems and show ha our algorihm learns faser han exising mehods and performs equally well in he long-run. 1 Inroducion The marke for online conen consumpion and he rae a which conen is produced has experienced immense growh over he pas few years. New conen is generaed on a daily or even hourly basis, creaing an incredibly fas urn-over ime for relevan conen. While radiional search and recommendaion engines have he abiliy o discover qualiy conen in an offline manner, services such as news aggregaors need o consanly adjus heir recommendaions o caer o curren ho opics. For example, aricles abou he U.S. presidenial inauguraion may be quie popular on January 21 s, he day of he inauguraion, bu hey re likely o fall ou of favor on he morning of he 22 nd. In he face of such rapid changes in relevance, online algorihms which coninually opimize recommendaions based on user usage daa provide an aracive soluion. We propose a simple online recommendaion algorihm which learns quickly from user click daa o minimize abandonmen, he even ha a user does no click on any aricles in he recommended se (also known as %no in he informaion rerieval communiy). Our algorihm operaes wih minimal assumpions and no knowledge of feaures of users or aricles, and hus is well-suied o address changing environmens induced by frequen urn-over in he se of poenial aricles and shifs in user preferences. We focus on conen such as news aricles, jokes, or movies, where users have varying ases bu here s no noion of a single correc recommendaion. Copyrigh c 2013, Associaion for he Advancemen of Arificial Inelligence ( All righs reserved. Recommending relevan conen is a key challenge for search engines and recommendaion sysems and has been exensively sudied in he informaion rerieval communiy. The early guiding principle in he IR lieraure was he probabiliy ranking principle (PRP) (Roberson 1977), saing ha aricles should be ranked in decreasing order of relevance probabiliy. (Chen and Karger 2006) noed ha opimizing wih PRP in mind may yield sub-opimal oucomes, paricularly when he objecive is minimizing abandonmen. In recen years, he concep of diversiy in recommended ses of conen has emerged as a guiding principle which beer serves in addressing goals such as abandonmen minimizaion. The inuiive goal behind a diverse se of conen is o use each aricle in he se o saisfy a differen ype of user. This approach is paricularly applicable o he canonical problem of handling a variey of user inens; when a user searches for a erm such as jaguar heir inended meaning could be he car, he animal, he American fooball eam, or a number of differen meanings. This paper compares he PRP and diversiy principle from an online algorihm perspecive. We compare our online algorihm, which is implicily based on he PRP, wih he Ranked Bandi Algorihm (RBA) of (Radlinski, Kleinberg, and Joachims 2008), which is based on he diversiy principle. While he diversiy principle yields superior offline performance, our approach has sronger heoreical guaranees in he online case. Our empirical work focuses on a fundamenally differen sor of user preference han he previous diversiy work. Insead of inen, we caer o a heerogeneiy of users ases, i.e. does he user find his joke funny or will he user like his news aricle. Surprisingly, we find ha explicily incorporaing diversiy in his seing doesn yield a large gain; he offline PRP-based soluion gives nearly he same performance as he offline diversiy-based soluion. A he hear of our mehod is he use of a sochasic muliarmed bandi algorihm o conrol he rade-off beween exploraion and exploiaion of aricles. A muli-armed bandi problem is an absrac game where a player is in a room wih many differen slo machines (slo machines are someimes called one-armed bandis), wih no prior knowledge of he payoffs of any machines. His goal is o maximize his oal reward from he slo machines and in doing so, he mus explore machines o es which machine has he highes average payoff bu also exploi hose he knows o

2 have high rewards. Similar o (Radlinski, Kleinberg, and Joachims 2008), he primary conribuion of our algorihm is he mehod by which we combine insances of several MAB algorihms o efficienly approximae his combinaorial se recommendaion problem. 1.1 Our Conribuions We presen an online algorihm for he minimizaion of abandonmen. Our mehod uses several insances of a muliarmed bandi algorihm working (almos) independenly o recommend a se of aricles. Alhough he independence beween bandi insances carries all he drawbacks of he PRP, we use a sochasic opimizaion concep known as he correlaion gap (Agrawal e al. 2010) o prove ha our algorihm has near-opimal performance in he online seing. Furhermore, he independence beween bandi insances allows for a faser learning rae han online algorihms based on he diversiy principle. Our second conribuion is an empirical sudy of bandi-based recommendaion algorihms on real-world daases colleced from previous recommendaion algorihm research. We find ha while in heory he diversiy-based soluions yield superior offline soluions, in pracice here are only small differences beween he offline diversiy-based soluion and he offline PRP-based soluion. We also empirically verify ha he learning rae of our mehod is faser han ha of exising mehods. 2 Previous Work Previous work in informaion rerieval and machine learning has addressed recommendaion o heerogenous populaions via he goal of maximizing diversiy in search resuls bu he lieraure varies widely in modeling assumpions. In some work, diversiy refers o increasing he se of opics ha a recommended se of aricles or search resuls may cover (Agrawal e al. 2009) and (Panigrahi e al. 2012). Oher works assume users inrinsically value diversiy; (Raman, Shivaswamy, and Joachims 2012) and (Yue and Guesrin 2011) boh assume a rich feaure model and use online learning echniques o learn user uiliy funcions. (Li e al. 2010) give an online approach for news recommendaion using a user s profile as a feaure vecor. (Chen and Karger 2006) prove in a general sense ha he sandard submodular greedy algorihm is he opimal way o incorporae diversiy ino search resul rankings. By conras, our work carries lile assumpions. In his sense, our work is closer o he lieraure on online sochasic submodular maximizaion, paricularly in he bandi seing. (Calinescu e al. 2011) prove ha a coninuous version of he sandard submodular greedy algorihm yields an opimal approximaions for all maroid consrains and (Sreeer, Golovin, and Krause 2009) give a similar mehod (hough less general) which can be exended o he online seing. The work mos closely relaed o ours is (Radlinski, Kleinberg, and Joachims 2008) and (Slivkins, Radlinski, and Gollapudi 2010), alhough he laer work makes srong use of a similariy measure beween documens whereas we assume no such consruc. Their ranked bandi algorihm serves as our baseline in his paper and we discuss he relaionships beween our mehods in laer secions. 3 Problem Formalizaion We consider he problem of minimizing abandonmen for an aricle recommendaion sysem. A he beginning of he day, n aricles are submied o he sysem. When a user visis our sie, hey re presened wih a se of k aricles; if he user finds any of he aricles relevan, he clicks on i and we receive a payoff of 1. If he user finds no aricles relevan, we receive a payoff of 0. We receive no addiional payoffs if he user clicks on more han one aricle. Each user j can be represened by a {0, 1} n -vecor X j, where a X j i = 1 indicaes ha he user j finds aricle i relevan. These relevance vecors X j are disribued according o some unknown disribuion D. These relevance vecors can be hough o represen he ype of a user. This ype srucure allows for a large degree of correlaion beween aricle relevances. A each ime period, a random user arrives, corresponding o choosing a vecor X i.i.d. from D, and we presen a se of k aricles S. Le F (S, X ) denoe he payoff for showing se S o a user wih relevance vecor X. We ll refer F as he se relevance funcion and i has he following form { 1 if X F (S, X ) = i = 1 for some i S (1) 0 oherwise The user s relevance vecor X is no observed before he algorihm he se is chosen. Thus he value of displaying a se S is he expeced value E[F (S, X)] where he expecaion is aken over he realizaion of he relevance vecor X from he disribuion D. When i is clear, we will wrie E[F (S)] as shorhand for E[F (S, X)]. In words, E[F (S)] is he fracion of users who will be saisfied by a leas one aricle in S. The problem of minimizing abandonmen is equivalen o he problem of maximizing E[F (S)] subjec o S k. For he remainder of his paper, we ll focus on maximizing expeced se relevance E[F (S)]. Before urning o he online version of his problem, we consider opimizaion in he offline seing. In he offline seing, an algorihm would have access o he disribuion bu even wih such assumpions he problem is NP-hard 1 Despie his inracabiliy, we can ake advanage of he srucure of F (S), namely ha i is submodular, and use he greedy algorihm of (Nemhauser, Wolsey, and Fisher 1978). This yields a (1 1 e ) approximaion, which is he bes possible approximaion under complexiy heoreic assumpions. (Chen and Karger 2006) argue his greedy approach yields an opimally diverse se of aricles. A se funcion G is said o be submodular if, for all elemens a and ses S, T such ha S T, G(S a) G(S) G(T a) G(T ). The se relevance funcion F (S, X), as defined in equaion 1, is submodular and his propery forms he heoreical basis for he online approaches given in he nex secion. 1 This can be shown by a sandard reducion from he max coverage problem. See (Radlinski, Kleinberg, and Joachims 2008) for deails.

3 4 The Online Problem We now urn o he online version of his problem, which presens a classic explore-exploi radeoff: we mus balance he need o learn he average relevance of aricles wih no feedback agains he need o exploi he good aricles ha we ve already discovered. We solve his problem using heoreical resuls from he muli-armed bandi (MAB) lieraure, a class of algorihms which solve exploraion-exploiaions problems. Bandi problems can be disinguished by he assumpions made on he rewards. In he sochasic bandi problem, rewards for each opion are drawn from a saionary disribuion while in he adversarial seing, payoffs for each opion are deermined by an adversary who has knowledge of pas play, hisory of rewards, and he sraegy ha he player is using. The objecive of an online algorihm is he minimizaion of regre, where he regre of an algorihm is defined as he expeced difference beween he accumulaed rewards of he single bes opion and he rewards accumulaed by ha algorihm. In our conex, his is he difference beween he fracion of users saisfied by he opimal se of aricles and he fracion of users saisfied by he recommendaion algorihm. However, as we noed in he previous secion, maximizaion of E[F (S)] is inracable, so we follow he approach of (Sreeer, Golovin, and Krause 2009) and (Radlinski, Kleinberg, and Joachims 2008) and use (1 1 e )OP T as he offline benchmark. The regre afer ime is defined as R(T ) = (1 1 T e ) E[F (S )] =0 T E[F (S )] =0 There are known bandi algorihms which achieve provably-minimal regre (up o consan facors), bu direc applicaion of hese bandi algorihms requires exploring all possible opions a leas once. In our seing, each subse of aricles is a poenial opion and hence here are exponenially many opions, making sandard MAB algorihms impracical. In he nex secion we presen wo approaches, one from previous work and our algorihm, for combining several insances of a bandi algorihm o yield a low-regre and compuaionally efficien soluion o his recommendaion problem. 4.1 Ranked Bandi Approach The work of (Radlinski, Kleinberg, and Joachims 2008) and (Sreeer, Golovin, and Krause 2009) inroduced he ranked bandi algorihm o solve he problem of minimizing abandonmen. The pseudocode is given in algorihm 1. The idea behind he ranked bandi algorihm is o use k insances of a MAB algorihm o learn he greedy-opimal soluion (which is also he diversiy-opimal soluion). Specifically, k insances of a bandi algorihm are creaed, where bandi i is responsible for selecing he aricle o be displayed in slo i. The algorihm is designed such ha he bandi in slo i aemps o maximize he marginal gain of he aricle in slo i. In he conex of minimizing abandonmen, bandi i aemps o maximize he click-hrough-rae of he aricle in slo i given ha he user has no clicked on any earlier aricles. Algorihm 1 Ranked Bandi 1: MAB i : Bandi algorihm for slo i 2: for = 1...T do 3: s i selecaricle(mab i, N) 4: S i s i 5: Display S o user, receive feedback vecor X 6: Feedback: { 1 if aricle si was he firs click z i = 0 oherwise 7: updae(mab i, z i ) 8: end for While RBA works wih any bandi algorihm, he regre of RBA depends on he choice of bandi algorihm. (Radlinski, Kleinberg, and Joachims 2008) use an adversarial bandi algorihm known as EXP3 in heir work and show ha RBA inheris he regre bounds guaraneed by EXP3. However he adversarial assumpion is overly pessimisic in his problem and ideally we could make use of he sochasic naure of user behavior. Sochasic bandi algorihms such as UCB1 have beer heoreical and pracical performance bu he dependence beween slos in RBA violaes he necessary independence assumpions for he sochasic seing. In heir work, (Radlinski, Kleinberg, and Joachims 2008) show RBA o have regre on he order of O(k T n lg(n)). Our approach, discussed in he nex secion, is able o leverage he sochasic naure of he problem wihou complicaion and hus achieves a provable regre of O(kn lg(t )). In addiion o he lack of heoreical guaranees, he learning rae of RBA can be quie slow because of wrong feedback. The correc value of an aricle in slo i + 1 is he marginal value of ha aricle given ha slos 1 o i are displaying he correc aricles, ha is he firs i aricles in he greedy soluion. In any ime period where hose aricles aren displayed, he marginal value of any aricle in slo i+1 will no necessarily be correc. Alhough early slos should display he correc aricles mos of he ime, laer slos can begin learning correcly unil he earlier slos converge. This effecively induces sequenial learning across slos and back of he envelope calculaions sugges ha correc learning will only begin in slo k + 1 afer Ω(n k ), ime seps have pas. 4.2 Independen Bandi Approach In his secion we describe our mehod which we call he independen bandi algorihm (IBA) which is implicily based on he probabiliy ranking principle. Raher han learning he marginal values as in he ranked bandi algorihm, he independen bandi algorihm opimizes he click-hrough-rae of each slo independenly of he oher slos. Using ools from sochasic opimizaion heory, we prove ha he independen bandi algorihm has near-opimal regre and our simulaions demonsrae ha IBA converges o is offline-opimal soluion much quicker han RBA. The pseudocode for he independen bandi algorihm is given in algorihm 2. Line 5 ensures ha he bandi algo-

4 rihms don selec he same aricles by emporarily removing aricles already displayed from he se of poenial aricles for bandis in laer slos. The main difference beween he independen and he ranked bandi algorihm is he feedback; IBA gives a reward of 1 o any aricle ha was clicked on while RBA only gives a reward of 1 o he firs aricle ha was clicked on. This independence beween bandi insances in IBA allows for learning o happen in parallel, enabling a faser rae of learning for IBA. To analyze he regre of IBA, we mus firs derive an approximaion guaranee for wha he offline version of he independen algorihm would compue. The independenopimal soluion consiss of he k aricles wih he highes click-hrough-raes. If aricle relevances were all independen hen he independen-opimal soluion is he opimal soluion, however he independen-opimal soluion will be sub-opimal when here are correlaions beween aricle relevances. We use he correlaion gap resul of (Agrawal e al. 2010) o show ha he independen-opimal soluion yields a (1 1 e ) approximaion o he opimal soluion for any disribuion over user relevance vecors. The correlaion gap is a concep in sochasic opimizaion which quanifies he loss incurred by opimizing under he assumpion ha all random variables are independen. Formally le G(S, X) be some funcion where S is he decision variable and X is a vecor of {0, 1} random variables, where X is drawn from some arbirary disribuion D. Le D I be he produc disribuion if each X i were an independen bernoulli variable wih probabiliy equal o is marginal probabiliy under D. When G is a nondecreasing, submodular funcion he correlaion gap is quie small. Theorem (Agrawal e al. 2010) 1. Le G be a nondecreasing, submodular funcion. Le S and SI be he opimizers for E D [G(S, X)] and E D I [G(S, X)] respecively. Then E D [G(SI, X)] (1 1 e )E D[G(S, X)]. Now we consider he independen bandi algorihm. The key propery of IBA is ha individual bandi insances do no affec each oher and his allows us o prove ha IBA inheris he low regre of he underlying sochasic bandi algorihms, yielding beer regre bounds han RBA. For he purposes of he nex heorem, we use he UCB1 algorihm (deails are given in secion 5), which has regre O(n lg(t )). Theorem 1. When UCB1 is used as he bandi algorihm for IBA, he accumulaed rewards saisfy T E[ F (S, X)] (1 1 )OP T O(kn lg(t )) e Proof. The high level is o firs show ha IBA has low regre when compared wih he independen-opimal se. We hen apply he correlaion gap of (Agrawal e al. 2010) o conclude he regre is close o (1 1 e )OP T. For a given documen displayed in slo i, le p i denoe he marginal probabiliy of relevance, ha is p i = E X [X i ]. Assume for now ha all X i are independen. Using his independence assumpion, for a given se S we can wrie he Algorihm 2 Independen Bandi 1: MAB i : Bandi algorihm for slo i 2: for = 1...T do 3: S0 = 4: for i = 1...k do 5: Si selecaricle(mab i, N \ Si 1 ) 6: end for 7: Display S o user, receive feedback vecor X 8: Feedback: { 1 if aricle si was clicked on z i = 0 oherwise 9: updae(mab i, z i ) 10: end for expeced valued of F (S, X) as follows k i 1 E[F (S )] = (1 p j )p i (2) i=1 j=1 (noe, his equaion gives he same value for any permuaion of S ). Le SI denoe he se which maximizes he above funcion under he assumpion ha all X i are independen. Trivially, his se consiss of he k aricles wih he larges p i. Label hese elemens p i for i = 1...k. A a given ime le S denoe he se played and le Si represen he ih elemen of his se. Define δ i = p i p i, ha is he difference beween he relevance probabiliy of he bes aricle and he relevance probabiliy of he aricle acually played a ime. k i 1 E[F (S )] = (1 (p j δ j ))(p i δ i ) i=1 j=1 k i 1 (1 p j )(p i ) δ i i=1 j=1 = E[F (S I)] i Now aking he sum of he f(s, X) over ime yields E[ F (S, X)] F (SI) δi i The erm δ i is he regre incurred in slo i. (Auer, Cesa-Bianchi, and Fischer 2002) proves ha he regre of UCB1 is bounded by O(n lg(t )), so δ i O(n lg(t )) for each slo. In he above analysis, we assume ha he probabiliy of an aricle being relevan was independen of each oher X i, which is usually a fauly assumpion. However, he work of (Agrawal e al. 2010) shows ha opimizing under he independence assumpion yields a provable approximaion. Le S denoe he se which maximizes E[f(S, X)]. Then he correlaion gap implies E[f(SI )] (1 1 e )E[f(S )]. Combining his wih he above regre bound yields he resul E[ F (S, X)] (1 1 )OP T O(kn lg(t )) e δ i

5 Figure 1: Movie-Lens-100 daase wih relevance hreshold θ = 2, he low hreshold. The Ranked-ɛGreedy mehod sars performing beer afer = bu fails o achieve he heoreical opimum performance wihin ime seps. The Independen-ɛGreedy algorihm achieves is offline opimum afer ime seps. I is worh noing ha he independen-opimal soluion is (weakly) worse han he greedy-opimal soluion, so RBA will asympoically ouperform IBA. However, he previous heorem shows ha IBA has he same wors-case guaranee along wih a beer regre bound ha holds uniformly hroughou ime. In he nex secion, we simulae boh algorihms using real-world daases and show ha he asympoic performances of he wo mehods are essenially equal bu IBA performs beer in he shor erm. 5 Experimenal Resuls In his secion, we give he resuls of experimens we used o es he empirical difference in performance beween he ranked bandi algorihm and he independen bandi algorihm. Daases. We used wo publicly available daases as our inpu for user preferences. Our firs daase is from he Jeser projec (Goldberg e al. 2001) and is a collecion of user raings on jokes, ranging from (very no funny) o 10 (very funny). Our second daase comes from he MovieLens projec (movieslens.umn.edu) and consiss of user raings assigned o movies, where each raing is from 1 (bad) o 5 (very good). Each daase consiss of a collecion of < userid, aricleid, raing >-uples denoing he raing ha he user gave o his aricle (eiher a joke or a movie). Wih he Jeser daase, we used wo separae daases. Jeser-Small consis of users raings on 10 aricles where each user had raed mos aricles in he se. Jeser-large consiss of users raings on 100 aricles bu here many unraed aricles for each user. In he case where a user didn rae an aricle, we assign ha aricle he lowes score. Movie-Lens-100 consiss of raings by 943 Figure 2: Movie-Lens-100 daase wih relevance hreshold θ = 4, he high hreshold. The Independen-ɛGreedy mehod performs he bes ou of all four mehods. 2 lg() i users on a sub-sampled se of 100 aricles from he Movie- Lens daase. For all daases, we conver real-valued raings o binary relevan-or-no scores by using a hreshold rule; if he raing assigned by a user o an aricle exceeds a hreshold θ, hen ha aricle is deemed relevan o ha user. For each daase, we esed a high and a low hreshold for relevance. 2 The daa we use is of a fundamenally differen naure han he generaed by (Radlinski, Kleinberg, and Joachims 2008). In ha work, hey model user inen, i.e. is a user ha searches for he erm jaguar alking abou he car, he animal, or some oher meaning? In our work, we care abou user ase, i.e. which joke or movie will a user like? In he case of inen, here s generally a correc answer and a single aricle rarely saisfies muliple ypes of users. For he case of ase, here is rarely a single correc answer and a single aricle may saisfy many differen ypes of users. Baselines.In our experimens, we used wo well-known sochasic muli-armed bandi algorihms o es he Ranked Bandi Algorihm and he Independen Bandi Algorihm. Boh algorihms, UCB1 and ɛ-greedy, are examined in deail in (Auer, Cesa-Bianchi, and Fischer 2002) bu we briefly review hem here. In each ime sep, UCB1 plays he opion which maximizes x i + where x i denoes he curren average reward of opion i and i denoes he number of imes ha opion i has been played so far. The second erm in his equaion naurally induces exploraion since his erm grows for opions ha have no been played in a while. The second MAB algorihm is he ɛ-greedy algorihm. A each ime, wih probabiliy ɛ a uniformly random arm is played, and wih probabiliy 1 ɛ he opion wih he curren highes average reward is played. Noe ha his algorihm requires he ɛ parameer o be uned; for hese experimens, we se ɛ =.05, which proved o give he bes average performance during iniial ess. 2 We only show he resuls for a few differen daases due o space consrains. These daases are represenaive of he qualiaive resuls from he enire se of experimens.

6 Figure 3: Jeser-Large daase wih relevance hreshold θ = 7, he high hreshold. Ranked-ɛGreedy and IndependenɛGreedy perform similarly;independen-ɛgreedy performs beer unil =20000 bu boh remain very close, and well below he offline greedy opimum, for all ime seps. Our experimen consiss of he following seps: a each ime, we draw a random user from he daase and he algorihm recommends a se of k = 5 aricles o display. We assume ha he user clicks on any relevan aricles displayed. If he user clicks on any aricles, we ge a payoff of 1 and a payoff of 0 oherwise. Each experimens consiss of T = ime seps and we average our resuls over 200 repeiions of each experimen. Performance of each algorihm was measured by he percen of ses ha conained a leas one relevan aricle o he user. We show daapoins a 1000 ime sep incremens and each daapoin shown is he average se relevance over he las 1000 ime seps. Key Resuls. The resuls of our experimens are displayed in figures 1, 2, 3, and 4. Each plo shows he performance of he online algorihms as well as he offline benchmarks. The performance of Independen-ɛGreedy and Independen- UCB were roughly he same in all cases, so we omi he resuls for Independen-UCB for clariy. Our mos surprising finding is he closeness of he greedy-opimal and he independen-opimal soluions. The larges difference beween he wo soluions, shown in figure 1, is 4%; if we displayed he greedy-opimal se of aricles, approximaely 92% of users will find a leas one relevan aricle while if we displayed he independen-opimal se, hen 88% of users will find a leas one relevan aricles. This finding suggess ha in seings where a recommendaion algorihm is caering o he ases (as opposed o inens ), explici consideraion of diversiy may no be necessary since he independenopimal soluion yields similar resuls o he greedy-opimal soluion. Our second finding, which goes hand in hand wih he previous one, is he favorable performance of he Independen Bandi Algorihm versus he performance of he Ranked Bandi Algorihm. In half of our experimens, eiher Independen-ɛGreedy or Independen-UCB perform sricly beer han Ranked-ɛGreedy. In he experimens shown in Figure 4: Jeser-Small daase wih relevance hreshold θ = 3.5, he low hreshold. In his case, he offline greedy and offline independen soluion were he exac same se. figures 1 and 3, Ranked-ɛGreedy performs beer han he independen soluions bu only begins o perform beer afer or ime seps. The faser learning raes of IBA compared o RBA demonsraes a key feaure of IBA; he independence beween bandi insances in differen slos allows learning o happen in parallel as opposed o he de faco sequenial learning in RBA. This parallel learning allows for a quicker convergence o he independen-opimal soluion. In all cases, he Ranked-ɛGreedy algorihm doesn converge o he value of he greedy-opimal soluion wihin ime seps. Lasly, our experimens demonsrae a sark difference beween he performance of Ranked-ɛGreedy and Ranked- UCB. As we noed a he end of secion 4.1, learning for laer slos in RBA is hindered by exploraion in early slos. This effec is especially pronounced in he UCB1 algorihm when here are muliple aricles ha have high average rewards. The relaively low exploraion rae of he ɛ-greedy algorihm allows for faser convergence in earlier slos and hence a faser learning rae for laer slos. In RBA, low exploraion raises he risk of playing a sub-opimal aricle in an earlier slo bu he gain from he faser learning rae ouweighs ha poenial loss. 6 Conclusion We ve presened a simple online algorihm for he problem of abandonmen minimizaion in recommendaion sysems which has near-opimal performance in he online problem. We have demonsraed, heoreically and empirically, ha our approach rades off a small loss in offline performance for a faser learning rae and sronger performance in he online seing. In he fuure, we would like o invesigae he exension of hese MAB echniques o general submodular uiliy funcions. Addiionally, we would like o invesigae how o run algorihms such as IBA or RBA when i is only possible o observe user feedback on he se of aricles bu no on he individual aricles wihin he se.

7 References Agrawal, R.; Gollapudi, S.; Halverson, A.; and Ieong, S Diversifying search resuls. In Proceedings of he Second ACM Inernaional Conference on Web Search and Daa Mining, ACM. Agrawal, S.; Ding, Y.; Saberi, A.; and Ye, Y Correlaion robus sochasic opimizaion. In Proceedings of he Tweny-Firs Annual ACM-SIAM Symposium on Discree Algorihms, Sociey for Indusrial and Applied Mahemaics. Auer, P.; Cesa-Bianchi, N.; and Fischer, P Finieime analysis of he muliarmed bandi problem. Machine learning 47(2): Calinescu, G.; Chekuri, C.; Pál, M.; and Vondrák, J Maximizing a monoone submodular funcion subjec o a maroid consrain. SIAM Journal on Compuing 40(6): Chen, H., and Karger, D Less is more: probabilisic models for rerieving fewer relevan documens. In Proceedings of he 29h annual inernaional ACM SIGIR conference on Research and developmen in informaion rerieval, ACM. Goldberg, K.; Roeder, T.; Gupa, D.; and Perkins, C Eigenase: A consan ime collaboraive filering algorihm. Informaion Rerieval 4(2): Li, L.; Chu, W.; Langford, J.; and Schapire, R. E A conexual-bandi approach o personalized news aricle recommendaion. In Proceedings of he 19h inernaional conference on World wide web, ACM. Mahajan, D. K.; Rasogi, R.; Tiwari, C.; and Mira, A Logucb: an explore-exploi algorihm for commens recommendaion. In Proceedings of he 21s ACM inernaional conference on Informaion and knowledge managemen, ACM. Sreeer, M.; Golovin, D.; and Krause, A Online learning of assignmens. In Neural Informaion Processing Sysems (NIPS). Yue, Y., and Guesrin, C Linear submodular bandis and heir applicaion o diversified rerieval. In Neural Informaion Processing Sysems (NIPS). Nemhauser, G.; Wolsey, L.; and Fisher, M An analysis of approximaions for maximizing submodular se funcionsi. Mahemaical Programming 14(1): Panigrahi, D.; Das Sarma, A.; Aggarwal, G.; and Tomkins, A Online selecion of diverse resuls. In Proceedings of he fifh ACM inernaional conference on Web search and daa mining, ACM. Radlinski, F.; Kleinberg, R.; and Joachims, T Learning diverse rankings wih muli-armed bandis. In Proceedings of he 25h inernaional conference on Machine learning, ACM. Raman, K.; Shivaswamy, P.; and Joachims, T Online learning o diversify from implici feedback. In Proceedings of he 18h ACM SIGKDD inernaional conference on Knowledge discovery and daa mining, KDD 12, New York, NY, USA: ACM. Roberson, S. E The probabiliy ranking principle in ir. Journal of documenaion 33(4): Slivkins, A.; Radlinski, F.; and Gollapudi, S Learning opimally diverse rankings over large documen collecions. arxiv preprin arxiv:

Neural Network Model of the Backpropagation Algorithm

Neural Network Model of the Backpropagation Algorithm Neural Nework Model of he Backpropagaion Algorihm Rudolf Jakša Deparmen of Cyberneics and Arificial Inelligence Technical Universiy of Košice Lená 9, 4 Košice Slovakia jaksa@neuron.uke.sk Miroslav Karák

More information

Fast Multi-task Learning for Query Spelling Correction

Fast Multi-task Learning for Query Spelling Correction Fas Muli-ask Learning for Query Spelling Correcion Xu Sun Dep. of Saisical Science Cornell Universiy Ihaca, NY 14853 xusun@cornell.edu Anshumali Shrivasava Dep. of Compuer Science Cornell Universiy Ihaca,

More information

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments

An Effiecient Approach for Resource Auto-Scaling in Cloud Environments Inernaional Journal of Elecrical and Compuer Engineering (IJECE) Vol. 6, No. 5, Ocober 2016, pp. 2415~2424 ISSN: 2088-8708, DOI: 10.11591/ijece.v6i5.10639 2415 An Effiecien Approach for Resource Auo-Scaling

More information

MyLab & Mastering Business

MyLab & Mastering Business MyLab & Masering Business Efficacy Repor 2013 MyLab & Masering: Business Efficacy Repor 2013 Edied by Michelle D. Speckler 2013 Pearson MyAccouningLab, MyEconLab, MyFinanceLab, MyMarkeingLab, and MyOMLab

More information

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports

Information Propagation for informing Special Population Subgroups about New Ground Transportation Services at Airports Downloaded from ascelibrary.org by Basil Sephanis on 07/13/16. Copyrigh ASCE. For personal use only; all righs reserved. Informaion Propagaion for informing Special Populaion Subgroups abou New Ground

More information

More Accurate Question Answering on Freebase

More Accurate Question Answering on Freebase More Accurae Quesion Answering on Freebase Hannah Bas, Elmar Haussmann Deparmen of Compuer Science Universiy of Freiburg 79110 Freiburg, Germany {bas, haussmann}@informaik.uni-freiburg.de ABSTRACT Real-world

More information

1 Language universals

1 Language universals AS LX 500 Topics: Language Uniersals Fall 2010, Sepember 21 4a. Anisymmery 1 Language uniersals Subjec-erb agreemen and order Bach (1971) discusses wh-quesions across SO and SO languages, hypohesizing:...

More information

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices

Channel Mapping using Bidirectional Long Short-Term Memory for Dereverberation in Hands-Free Voice Controlled Devices Z. Zhang e al.: Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion in Hands-Free Voice Conrolled Devices 525 Channel Mapping using Bidirecional Long Shor-Term Memory for Dereverberaion

More information

Lecture 1: Machine Learning Basics

Lecture 1: Machine Learning Basics 1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3

More information

Active Learning. Yingyu Liang Computer Sciences 760 Fall

Active Learning. Yingyu Liang Computer Sciences 760 Fall Active Learning Yingyu Liang Computer Sciences 760 Fall 2017 http://pages.cs.wisc.edu/~yliang/cs760/ Some of the slides in these lectures have been adapted/borrowed from materials developed by Mark Craven,

More information

Exploration. CS : Deep Reinforcement Learning Sergey Levine

Exploration. CS : Deep Reinforcement Learning Sergey Levine Exploration CS 294-112: Deep Reinforcement Learning Sergey Levine Class Notes 1. Homework 4 due on Wednesday 2. Project proposal feedback sent Today s Lecture 1. What is exploration? Why is it a problem?

More information

Go fishing! Responsibility judgments when cooperation breaks down

Go fishing! Responsibility judgments when cooperation breaks down Go fishing! Responsibility judgments when cooperation breaks down Kelsey Allen (krallen@mit.edu), Julian Jara-Ettinger (jjara@mit.edu), Tobias Gerstenberg (tger@mit.edu), Max Kleiman-Weiner (maxkw@mit.edu)

More information

Objective: Total Time. (60 minutes) (6 minutes) (6 minutes) starting at 0. , 8, 10 many fourths? S: 4 fourths. T: (Beneat , 2, 4, , 14 , 16 , 12

Objective: Total Time. (60 minutes) (6 minutes) (6 minutes) starting at 0. , 8, 10 many fourths? S: 4 fourths. T: (Beneat , 2, 4, , 14 , 16 , 12 Lesson 9 5 Lesson 9 Objective: Estimate sums and differences using benchmark numbers. Suggested Lesson Structure F Fluency Practice ( minutes) A Application Problem (3 minutes) C Concept Development (35

More information

On the Combined Behavior of Autonomous Resource Management Agents

On the Combined Behavior of Autonomous Resource Management Agents On the Combined Behavior of Autonomous Resource Management Agents Siri Fagernes 1 and Alva L. Couch 2 1 Faculty of Engineering Oslo University College Oslo, Norway siri.fagernes@iu.hio.no 2 Computer Science

More information

Lecture 10: Reinforcement Learning

Lecture 10: Reinforcement Learning Lecture 1: Reinforcement Learning Cognitive Systems II - Machine Learning SS 25 Part III: Learning Programs and Strategies Q Learning, Dynamic Programming Lecture 1: Reinforcement Learning p. Motivation

More information

Assignment 1: Predicting Amazon Review Ratings

Assignment 1: Predicting Amazon Review Ratings Assignment 1: Predicting Amazon Review Ratings 1 Dataset Analysis Richard Park r2park@acsmail.ucsd.edu February 23, 2015 The dataset selected for this assignment comes from the set of Amazon reviews for

More information

Proof Theory for Syntacticians

Proof Theory for Syntacticians Department of Linguistics Ohio State University Syntax 2 (Linguistics 602.02) January 5, 2012 Logics for Linguistics Many different kinds of logic are directly applicable to formalizing theories in syntax

More information

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

Machine Learning and Development Policy

Machine Learning and Development Policy Machine Learning and Development Policy Sendhil Mullainathan (joint papers with Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, Ziad Obermeyer) Magic? Hard not to be wowed But what makes

More information

Georgetown University at TREC 2017 Dynamic Domain Track

Georgetown University at TREC 2017 Dynamic Domain Track Georgetown University at TREC 2017 Dynamic Domain Track Zhiwen Tang Georgetown University zt79@georgetown.edu Grace Hui Yang Georgetown University huiyang@cs.georgetown.edu Abstract TREC Dynamic Domain

More information

Reinforcement Learning by Comparing Immediate Reward

Reinforcement Learning by Comparing Immediate Reward Reinforcement Learning by Comparing Immediate Reward Punit Pandey DeepshikhaPandey Dr. Shishir Kumar Abstract This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate

More information

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler Machine Learning and Data Mining Ensembles of Learners Prof. Alexander Ihler Ensemble methods Why learn one classifier when you can learn many? Ensemble: combine many predictors (Weighted) combina

More information

In Workflow. Viewing: Last edit: 10/27/15 1:51 pm. Approval Path. Date Submi ed: 10/09/15 2:47 pm. 6. Coordinator Curriculum Management

In Workflow. Viewing: Last edit: 10/27/15 1:51 pm. Approval Path. Date Submi ed: 10/09/15 2:47 pm. 6. Coordinator Curriculum Management 1 of 5 11/19/2015 8:10 AM Date Submi ed: 10/09/15 2:47 pm Viewing: Last edit: 10/27/15 1:51 pm Changes proposed by: GODWINH In Workflow 1. BUSI Editor 2. BUSI Chair 3. BU Associate Dean 4. Biggio Center

More information

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM

ISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Proceedings of 28 ISFA 28 International Symposium on Flexible Automation Atlanta, GA, USA June 23-26, 28 ISFA28U_12 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Amit Gil, Helman Stern, Yael Edan, and

More information

Learning to Rank with Selection Bias in Personal Search

Learning to Rank with Selection Bias in Personal Search Learning to Rank with Selection Bias in Personal Search Xuanhui Wang, Michael Bendersky, Donald Metzler, Marc Najork Google Inc. Mountain View, CA 94043 {xuanhui, bemike, metzler, najork}@google.com ABSTRACT

More information

Discriminative Learning of Beam-Search Heuristics for Planning

Discriminative Learning of Beam-Search Heuristics for Planning Discriminative Learning of Beam-Search Heuristics for Planning Yuehua Xu School of EECS Oregon State University Corvallis,OR 97331 xuyu@eecs.oregonstate.edu Alan Fern School of EECS Oregon State University

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

A General Class of Noncontext Free Grammars Generating Context Free Languages

A General Class of Noncontext Free Grammars Generating Context Free Languages INFORMATION AND CONTROL 43, 187-194 (1979) A General Class of Noncontext Free Grammars Generating Context Free Languages SARWAN K. AGGARWAL Boeing Wichita Company, Wichita, Kansas 67210 AND JAMES A. HEINEN

More information

Artificial Neural Networks written examination

Artificial Neural Networks written examination 1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14

More information

ALL-IN-ONE MEETING GUIDE THE ECONOMICS OF WELL-BEING

ALL-IN-ONE MEETING GUIDE THE ECONOMICS OF WELL-BEING ALL-IN-ONE MEETING GUIDE THE ECONOMICS OF WELL-BEING LeanIn.0rg, 2016 1 Overview Do we limit our thinking and focus only on short-term goals when we make trade-offs between career and family? This final

More information

Automatic Discretization of Actions and States in Monte-Carlo Tree Search

Automatic Discretization of Actions and States in Monte-Carlo Tree Search Automatic Discretization of Actions and States in Monte-Carlo Tree Search Guy Van den Broeck 1 and Kurt Driessens 2 1 Katholieke Universiteit Leuven, Department of Computer Science, Leuven, Belgium guy.vandenbroeck@cs.kuleuven.be

More information

Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2

Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2 Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2 Ted Pedersen Department of Computer Science University of Minnesota Duluth, MN, 55812 USA tpederse@d.umn.edu

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

Reduce the Failure Rate of the Screwing Process with Six Sigma Approach

Reduce the Failure Rate of the Screwing Process with Six Sigma Approach Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Reduce the Failure Rate of the Screwing Process with Six Sigma Approach

More information

College Pricing and Income Inequality

College Pricing and Income Inequality College Pricing and Income Inequality Zhifeng Cai U of Minnesota, Rutgers University, and FRB Minneapolis Jonathan Heathcote FRB Minneapolis NBER Income Distribution, July 20, 2017 The views expressed

More information

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany

Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Entrepreneurial Discovery and the Demmert/Klein Experiment: Additional Evidence from Germany Jana Kitzmann and Dirk Schiereck, Endowed Chair for Banking and Finance, EUROPEAN BUSINESS SCHOOL, International

More information

STUDENT LEARNING ASSESSMENT REPORT

STUDENT LEARNING ASSESSMENT REPORT STUDENT LEARNING ASSESSMENT REPORT PROGRAM: Sociology SUBMITTED BY: Janine DeWitt DATE: August 2016 BRIEFLY DESCRIBE WHERE AND HOW ARE DATA AND DOCUMENTS USED TO GENERATE THIS REPORT BEING STORED: The

More information

Age Effects on Syntactic Control in. Second Language Learning

Age Effects on Syntactic Control in. Second Language Learning Age Effects on Syntactic Control in Second Language Learning Miriam Tullgren Loyola University Chicago Abstract 1 This paper explores the effects of age on second language acquisition in adolescents, ages

More information

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and

More information

An Introduction to Simio for Beginners

An Introduction to Simio for Beginners An Introduction to Simio for Beginners C. Dennis Pegden, Ph.D. This white paper is intended to introduce Simio to a user new to simulation. It is intended for the manufacturing engineer, hospital quality

More information

How to get the most out of EuroSTAR 2013

How to get the most out of EuroSTAR 2013 Overview The idea of a conference like EuroSTAR can be a little daunting, even if this is not the first time that you have attended this or a similar gather of testers. So we (and who we are is covered

More information

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT PRACTICAL APPLICATIONS OF RANDOM SAMPLING IN ediscovery By Matthew Verga, J.D. INTRODUCTION Anyone who spends ample time working

More information

A Study of the Effectiveness of Using PER-Based Reforms in a Summer Setting

A Study of the Effectiveness of Using PER-Based Reforms in a Summer Setting A Study of the Effectiveness of Using PER-Based Reforms in a Summer Setting Turhan Carroll University of Colorado-Boulder REU Program Summer 2006 Introduction/Background Physics Education Research (PER)

More information

School Size and the Quality of Teaching and Learning

School Size and the Quality of Teaching and Learning School Size and the Quality of Teaching and Learning An Analysis of Relationships between School Size and Assessments of Factors Related to the Quality of Teaching and Learning in Primary Schools Undertaken

More information

Team Formation for Generalized Tasks in Expertise Social Networks

Team Formation for Generalized Tasks in Expertise Social Networks IEEE International Conference on Social Computing / IEEE International Conference on Privacy, Security, Risk and Trust Team Formation for Generalized Tasks in Expertise Social Networks Cheng-Te Li Graduate

More information

Experience Corps. Mentor Toolkit

Experience Corps. Mentor Toolkit Experience Corps Mentor Toolkit 2 AARP Foundation Experience Corps Mentor Toolkit June 2015 Christian Rummell Ed. D., Senior Researcher, AIR 3 4 Contents Introduction and Overview...6 Tool 1: Definitions...8

More information

Houghton Mifflin Online Assessment System Walkthrough Guide

Houghton Mifflin Online Assessment System Walkthrough Guide Houghton Mifflin Online Assessment System Walkthrough Guide Page 1 Copyright 2007 by Houghton Mifflin Company. All Rights Reserved. No part of this document may be reproduced or transmitted in any form

More information

Aspectual Classes of Verb Phrases

Aspectual Classes of Verb Phrases Aspectual Classes of Verb Phrases Current understanding of verb meanings (from Predicate Logic): verbs combine with their arguments to yield the truth conditions of a sentence. With such an understanding

More information

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing a Moving Target How Do We Test Machine Learning Systems? Peter Varhol, Technology

More information

12- A whirlwind tour of statistics

12- A whirlwind tour of statistics CyLab HT 05-436 / 05-836 / 08-534 / 08-734 / 19-534 / 19-734 Usable Privacy and Security TP :// C DU February 22, 2016 y & Secu rivac rity P le ratory bo La Lujo Bauer, Nicolas Christin, and Abby Marsh

More information

INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE

INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE Please read the following carefully: The completed application packet with all materials listed below must be submitted and reviewed by an ISF

More information

The Evolution of Random Phenomena

The Evolution of Random Phenomena The Evolution of Random Phenomena A Look at Markov Chains Glen Wang glenw@uchicago.edu Splash! Chicago: Winter Cascade 2012 Lecture 1: What is Randomness? What is randomness? Can you think of some examples

More information

Cooperative Game Theoretic Models for Decision-Making in Contexts of Library Cooperation 1

Cooperative Game Theoretic Models for Decision-Making in Contexts of Library Cooperation 1 Cooperative Game Theoretic Models for Decision-Making in Contexts of Library Cooperation 1 Robert M. Hayes Abstract This article starts, in Section 1, with a brief summary of Cooperative Economic Game

More information

Getting Started with Deliberate Practice

Getting Started with Deliberate Practice Getting Started with Deliberate Practice Most of the implementation guides so far in Learning on Steroids have focused on conceptual skills. Things like being able to form mental images, remembering facts

More information

Conceptual Framework: Presentation

Conceptual Framework: Presentation Meeting: Meeting Location: International Public Sector Accounting Standards Board New York, USA Meeting Date: December 3 6, 2012 Agenda Item 2B For: Approval Discussion Information Objective(s) of Agenda

More information

Undergraduates Views of K-12 Teaching as a Career Choice

Undergraduates Views of K-12 Teaching as a Career Choice Undergraduates Views of K-12 Teaching as a Career Choice A Report Prepared for The Professional Educator Standards Board Prepared by: Ana M. Elfers Margaret L. Plecki Elise St. John Rebecca Wedel University

More information

Red Flags of Conflict

Red Flags of Conflict CONFLICT MANAGEMENT Introduction Webster s Dictionary defines conflict as a battle, contest of opposing forces, discord, antagonism existing between primitive desires, instincts and moral, religious, or

More information

3. Improving Weather and Emergency Management Messaging: The Tulsa Weather Message Experiment. Arizona State University

3. Improving Weather and Emergency Management Messaging: The Tulsa Weather Message Experiment. Arizona State University 3. Improving Weather and Emergency Management Messaging: The Tulsa Weather Message Experiment Kenneth J. Galluppi 1, Steven F. Piltz 2, Kathy Nuckles 3*, Burrell E. Montz 4, James Correia 5, and Rachel

More information

FF+FPG: Guiding a Policy-Gradient Planner

FF+FPG: Guiding a Policy-Gradient Planner FF+FPG: Guiding a Policy-Gradient Planner Olivier Buffet LAAS-CNRS University of Toulouse Toulouse, France firstname.lastname@laas.fr Douglas Aberdeen National ICT australia & The Australian National University

More information

INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE

INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE INTERDISCIPLINARY STUDIES FIELD MAJOR APPLICATION TO DECLARE Please read the following carefully: The completed application packet with all materials listed below must be submitted and reviewed by an ISF

More information

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

San José State University Department of Psychology PSYC , Human Learning, Spring 2017 San José State University Department of Psychology PSYC 155-03, Human Learning, Spring 2017 Instructor: Valerie Carr Office Location: Dudley Moorhead Hall (DMH), Room 318 Telephone: (408) 924-5630 Email:

More information

Usability Design Strategies for Children: Developing Children Learning and Knowledge in Decreasing Children Dental Anxiety

Usability Design Strategies for Children: Developing Children Learning and Knowledge in Decreasing Children Dental Anxiety Presentation Title Usability Design Strategies for Children: Developing Child in Primary School Learning and Knowledge in Decreasing Children Dental Anxiety Format Paper Session [ 2.07 ] Sub-theme Teaching

More information

ACTIVITY: Comparing Combination Locks

ACTIVITY: Comparing Combination Locks 5.4 Compound Events outcomes of one or more events? ow can you find the number of possible ACIVIY: Comparing Combination Locks Work with a partner. You are buying a combination lock. You have three choices.

More information

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only. Calculus AB Priority Keys Aligned with Nevada Standards MA I MI L S MA represents a Major content area. Any concept labeled MA is something of central importance to the entire class/curriculum; it is a

More information

Learning From the Past with Experiment Databases

Learning From the Past with Experiment Databases Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University

More information

Procedia Computer Science

Procedia Computer Science Available online at www.sciencedirect.com Procedia Computer Science 00 (2012) 000 000 Procedia Computer Science www.elsevier.com/locate/procedia New Challenges in Systems Engineering and Architecting Conference

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

Understanding and Changing Habits

Understanding and Changing Habits Understanding and Changing Habits We are what we repeatedly do. Excellence, then, is not an act, but a habit. Aristotle Have you ever stopped to think about your habits or how they impact your daily life?

More information

1 3-5 = Subtraction - a binary operation

1 3-5 = Subtraction - a binary operation High School StuDEnts ConcEPtions of the Minus Sign Lisa L. Lamb, Jessica Pierson Bishop, and Randolph A. Philipp, Bonnie P Schappelle, Ian Whitacre, and Mindy Lewis - describe their research with students

More information

Virtually Anywhere Episodes 1 and 2. Teacher s Notes

Virtually Anywhere Episodes 1 and 2. Teacher s Notes Virtually Anywhere Episodes 1 and 2 Geeta and Paul are final year Archaeology students who don t get along very well. They are working together on their final piece of coursework, and while arguing over

More information

A Comparison of Standard and Interval Association Rules

A Comparison of Standard and Interval Association Rules A Comparison of Standard and Association Rules Choh Man Teng cmteng@ai.uwf.edu Institute for Human and Machine Cognition University of West Florida 4 South Alcaniz Street, Pensacola FL 325, USA Abstract

More information

hmhco.com The Power of Blended Learning Maximizing Instructional Time, Accelerating Student Achievement

hmhco.com The Power of Blended Learning Maximizing Instructional Time, Accelerating Student Achievement hmhco.com he ower of Blended earning Maximizing nstructional ime, ccelerating Student chievement mplementing 180 With Success From its inception, 180 has provided districts with a powerful blended learning

More information

Centralized Assignment of Students to Majors: Evidence from the University of Costa Rica. Job Market Paper

Centralized Assignment of Students to Majors: Evidence from the University of Costa Rica. Job Market Paper Centralized Assignment of Students to Majors: Evidence from the University of Costa Rica Job Market Paper Allan Hernandez-Chanto December 22, 2016 Abstract Many countries use a centralized admissions process

More information

Probability estimates in a scenario tree

Probability estimates in a scenario tree 101 Chapter 11 Probability estimates in a scenario tree An expert is a person who has made all the mistakes that can be made in a very narrow field. Niels Bohr (1885 1962) Scenario trees require many numbers.

More information

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming Data Mining VI 205 Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming C. Romero, S. Ventura, C. Hervás & P. González Universidad de Córdoba, Campus Universitario de

More information

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology Essentials of Ability Testing Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology Basic Topics Why do we administer ability tests? What do ability tests measure? How are

More information

What Am I Getting Into?

What Am I Getting Into? 01-Eller.qxd 2/18/2004 7:02 PM Page 1 1 What Am I Getting Into? What lies behind us is nothing compared to what lies within us and ahead of us. Anonymous You don t invent your mission, you detect it. Victor

More information

Direct and Indirect Passives in East Asian. C.-T. James Huang Harvard University

Direct and Indirect Passives in East Asian. C.-T. James Huang Harvard University Direct and Indirect Passives in East Asian C.-T. James Huang Harvard University 8.20-22.2002 I. Direct and Indirect Passives (1) Direct (as in 2a) Passive Inclusive (as in 2b) Indirect Exclusive (Adversative,

More information

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees

Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Impact of Cluster Validity Measures on Performance of Hybrid Models Based on K-means and Decision Trees Mariusz Łapczy ski 1 and Bartłomiej Jefma ski 2 1 The Chair of Market Analysis and Marketing Research,

More information

FREE COLLEGE Can Happen to You!

FREE COLLEGE Can Happen to You! FREE COLLEGE Can Happen to You! LECTURE NOTES by Jean Burk Thank you so much for coming to my seminars/ booth or purchasing this lecture on tape. Here is the handout from my speech, complete with notes.

More information

Developing Grammar in Context

Developing Grammar in Context Developing Grammar in Context intermediate with answers Mark Nettle and Diana Hopkins PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United

More information

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR ROLAND HAUSSER Institut für Deutsche Philologie Ludwig-Maximilians Universität München München, West Germany 1. CHOICE OF A PRIMITIVE OPERATION The

More information

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download

More information

COOPERATIVE WORK EXPERIENCE - OVERVIEW

COOPERATIVE WORK EXPERIENCE - OVERVIEW Register for Course Complete Application Prepare for Orientation Create Learning Objectives Complete Work Hours Worksite Visit Final Meeting with Instructor COOPERATIVE WORK EXPERIENCE - OVERVIEW Prior

More information

Rote rehearsal and spacing effects in the free recall of pure and mixed lists. By: Peter P.J.L. Verkoeijen and Peter F. Delaney

Rote rehearsal and spacing effects in the free recall of pure and mixed lists. By: Peter P.J.L. Verkoeijen and Peter F. Delaney Rote rehearsal and spacing effects in the free recall of pure and mixed lists By: Peter P.J.L. Verkoeijen and Peter F. Delaney Verkoeijen, P. P. J. L, & Delaney, P. F. (2008). Rote rehearsal and spacing

More information

Axiom 2013 Team Description Paper

Axiom 2013 Team Description Paper Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association

More information

A Version Space Approach to Learning Context-free Grammars

A Version Space Approach to Learning Context-free Grammars Machine Learning 2: 39~74, 1987 1987 Kluwer Academic Publishers, Boston - Manufactured in The Netherlands A Version Space Approach to Learning Context-free Grammars KURT VANLEHN (VANLEHN@A.PSY.CMU.EDU)

More information

(Includes a Detailed Analysis of Responses to Overall Satisfaction and Quality of Academic Advising Items) By Steve Chatman

(Includes a Detailed Analysis of Responses to Overall Satisfaction and Quality of Academic Advising Items) By Steve Chatman Report #202-1/01 Using Item Correlation With Global Satisfaction Within Academic Division to Reduce Questionnaire Length and to Raise the Value of Results An Analysis of Results from the 1996 UC Survey

More information

NCEO Technical Report 27

NCEO Technical Report 27 Home About Publications Special Topics Presentations State Policies Accommodations Bibliography Teleconferences Tools Related Sites Interpreting Trends in the Performance of Special Education Students

More information

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten

How to read a Paper ISMLL. Dr. Josif Grabocka, Carlotta Schatten How to read a Paper ISMLL Dr. Josif Grabocka, Carlotta Schatten Hildesheim, April 2017 1 / 30 Outline How to read a paper Finding additional material Hildesheim, April 2017 2 / 30 How to read a paper How

More information

International Business BADM 455, Section 2 Spring 2008

International Business BADM 455, Section 2 Spring 2008 International Business BADM 455, Section 2 Spring 2008 Call #: 11947 Class Meetings: 12:00 12:50 pm, Monday, Wednesday & Friday Credits Hrs.: 3 Room: May Hall, room 309 Instruct or: Rolf Butz Office Hours:

More information

Improving Action Selection in MDP s via Knowledge Transfer

Improving Action Selection in MDP s via Knowledge Transfer In Proc. 20th National Conference on Artificial Intelligence (AAAI-05), July 9 13, 2005, Pittsburgh, USA. Improving Action Selection in MDP s via Knowledge Transfer Alexander A. Sherstov and Peter Stone

More information

Cara Jo Miller. Lead Designer, Simple Energy Co-Founder, Girl Develop It Boulder

Cara Jo Miller. Lead Designer, Simple Energy Co-Founder, Girl Develop It Boulder Cara Jo Miller Lead Designer, Simple Energy Co-Founder, Girl Develop It Boulder * Thank you all for having me tonight. * I m Cara Jo Miller - Lead Designer at Simple Energy & Co-Founder of Girl Develop

More information

Laboratorio di Intelligenza Artificiale e Robotica

Laboratorio di Intelligenza Artificiale e Robotica Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning

More information

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S N S ER E P S I M TA S UN A I S I T VER RANKING AND UNRANKING LEFT SZILARD LANGUAGES Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A-1997-2 UNIVERSITY OF TAMPERE DEPARTMENT OF

More information

EXECUTIVE SUMMARY. Online courses for credit recovery in high schools: Effectiveness and promising practices. April 2017

EXECUTIVE SUMMARY. Online courses for credit recovery in high schools: Effectiveness and promising practices. April 2017 EXECUTIVE SUMMARY Online courses for credit recovery in high schools: Effectiveness and promising practices April 2017 Prepared for the Nellie Mae Education Foundation by the UMass Donahue Institute 1

More information

INSTRUCTIONAL TECHNIQUES. Teaching by Lecture

INSTRUCTIONAL TECHNIQUES. Teaching by Lecture Teaching by Lecture You must excuse the occasional unstifled yawn among students. You see, by the time they complete four years of college they will have endured almost 2000 hours of classroom instruction.

More information

A. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION

A. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION INVENTORY OF PROCESSES IN COLLEGE COMPOSITION This questionnaire describes the different ways that college students go about writing essays and papers. There are no right or wrong answers because there

More information

Part I. Figuring out how English works

Part I. Figuring out how English works 9 Part I Figuring out how English works 10 Chapter One Interaction and grammar Grammar focus. Tag questions Introduction. How closely do you pay attention to how English is used around you? For example,

More information