Meta Learning & Self Play
|
|
- Chad Powers
- 5 years ago
- Views:
Transcription
1 Meta Learning & Self Play Ilya Sutskever MARCH 24, 2018
2 The Reinforcement Learning Problem
3 Reinforcement Learning (RL) Good framework for building intelligent agents Acting to achieve goals is a key part of intelligence Can specify nearly any AI problem RL is interesting because interesting RL algorithms exist Agent action Environment observation reward
4 Reinforcement Learning Formulation: find a policy that maximizes expected reward Rewards are given by the environment In the real world, environments don t specify rewards. It is up to the agent to determine that a reward has occurred. Agent action observation reward Environment
5 Reinforcement Learning Agent = neural network action Environment observation reward
6 Reinforcement Learning algorithms in a nutshell Add randomness to your actions If the result was better than expected, do more of the same in the future
7 RL s potential An agent running a really good RL algorithm can accomplish an overwhelming variety of tasks The goal achiever A truly good RL algorithm will combine elements supervised learning unsupervised and representation learning reasoning and inference and test time and more! Today s RL algorithms have a very long way to go But it doesn t mean that progress will be slow
8 Hindsight Experience Replay [Andrychowicz et al., 2017]
9 Exploration can be hard When rewards are spares, most random attempts result in failure, and thus no learning Can we learn from failure?
10 Learn From Failure Setup: build a system that can reach any state Goal: reach state A Any trajectory ends up in some other state B Use this as training data to reach state B? Try to reach A A Starting point The result: how to reach B B
11 Cool visual explanation of HER
12 Dynamics randomization for Sim2Real [Peng et al., 2017]
13 Sim2Real with meta learning [Peng et al., 2017] It would be nice to train robots in simulation And have the policies succeed on the real robot
14 Key idea : simulation randomization Randomize simulation parameters Gravity Friction Torques Width and length of different geometric shapes Type of contact simulation Etc. Train a policy that can adapt to all settings of simulation parameters
15 This is a meta learning approach Policy quickly infers simulation parameters Could it infer the simulation parameters of the real world?
16 Baseline
17 Results
18 Learning a hierarchy of actions with meta learning [Frans et al., 2017]
19 It would be nice if learning was hierarchical Current RL learns by trying out random actions at each timestep Downsides: Hard to explore in a persistent direction Hard to do credit assignment over long horizons Example: Suppose all your agents want to maximize your GDP Should each agent decide if it should go to work on the basis of GDP fluctuations? May require a real model to really solve this problem
20 Meta learning approach to hierarchy Ingredients: a distribution over tasks Goal: learn a set of meta-actions that solve training tasks as quickly as possible
21 Evolved Policy Gradients [Houthooft et al., 2018]
22 Goal: learn a cost function that leads to rapid learning Train a cost function such that RL on this cost function learns very quickly Ingredients: a distribution over tasks Use evolution strategies to learn the cost function
23 Result: a single learning trial
24 Result: a single learning trial Learned cost: never learned to move right
25 Self Play
26 Self Play: TD-Gammon TD-Gammon (Tesauro, 1992) Incredibly old work: Q-learning + neural networks + self-play Beat all humans, discovered unconventional strategies that were deemed to be better! Approach was dormant until DQN for Atari
27 Self Play: AlphaGo Zero
28 Self Play: Dota 2 Pure self play Popular competitive online e-sports game Serious professional scene: $140M awarded in prizes in v5 is main variant; 1v1 also played OpenAI beat all the pros 1v1
29 Appealing properties of Self Play Simple environment extremely complex strategy Convert compute into data Perfect curriculum
30 Carl Sims, 1994 Self Play: Artificial Life
31 Carl Sims, 1994 Self Play: Artificial Life
32 Self Play for physicality and dexterity Environment is simple, behavior is very complex Pre-train general dexterity by competing against an opponent [Bansal et al., 2017]
33 What s next? Main open question: design the self play environment so that the result will be useful to some external task
34 Can Self Play lead all the way to AGI? Social life incentivizes evolution of intelligence Homo sapiens Because corvids and apes share these cognitive tools, we argue that complex cognitive abilities evolved multiple times in Homo neanderthalensis 1500 cm 3 distantly related species with vastly different brain structures in order to solve similar socioecological problems. Science, Vol. 306, Issue 5703, pp. Homo erectus Homo habilis 1000 cm 3 Cranial Capacity Australopithecus Mate selection Sahelanthropus 500 cm 3 Open-ended self play produces: Theory of mind, negotiation, social skills, empathy, real language understanding Millions of Years Ago
35 AI Alignment: Learning from human feedback [Christiano et al., 2017]
36 How to communicate goals quickly? One approach: have humans judge the behavior of an algorithm
37 Human judges select good behavior
38 Fit a scalar reward function to the human feedback Optimize a triplet loss: if a human judge deems that A > B Learn a real-valued reward consistent with the human feedback PREDICTED REWARD REWARD PREDICTOR HUMAN FEEDBACK RL ALGORITHM OBSERVATION ACTION ENVIRONMENT
39 500 bits of interaction It works
40 It works Several thousand bits of interactions to solve Atari games
41 Drive right behind the competitor Can easily convey unusual goals
42 Alignment: the future The technical problem of subtle communication will likely be solved But what are the right goals? Political problem
43 Thanks! Visit openai.com to learn more.
Laboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationAxiom 2013 Team Description Paper
Axiom 2013 Team Description Paper Mohammad Ghazanfari, S Omid Shirkhorshidi, Farbod Samsamipour, Hossein Rahmatizadeh Zagheli, Mohammad Mahdavi, Payam Mohajeri, S Abbas Alamolhoda Robotics Scientific Association
More informationExploration. CS : Deep Reinforcement Learning Sergey Levine
Exploration CS 294-112: Deep Reinforcement Learning Sergey Levine Class Notes 1. Homework 4 due on Wednesday 2. Project proposal feedback sent Today s Lecture 1. What is exploration? Why is it a problem?
More informationChallenges in Deep Reinforcement Learning. Sergey Levine UC Berkeley
Challenges in Deep Reinforcement Learning Sergey Levine UC Berkeley Discuss some recent work in deep reinforcement learning Present a few major challenges Show some of our recent work toward tackling
More informationLaboratorio di Intelligenza Artificiale e Robotica
Laboratorio di Intelligenza Artificiale e Robotica A.A. 2008-2009 Outline 2 Machine Learning Unsupervised Learning Supervised Learning Reinforcement Learning Genetic Algorithms Genetics-Based Machine Learning
More informationModule 12. Machine Learning. Version 2 CSE IIT, Kharagpur
Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should
More informationTD(λ) and Q-Learning Based Ludo Players
TD(λ) and Q-Learning Based Ludo Players Majed Alhajry, Faisal Alvi, Member, IEEE and Moataz Ahmed Abstract Reinforcement learning is a popular machine learning technique whose inherent self-learning ability
More informationReinforcement Learning by Comparing Immediate Reward
Reinforcement Learning by Comparing Immediate Reward Punit Pandey DeepshikhaPandey Dr. Shishir Kumar Abstract This paper introduces an approach to Reinforcement Learning Algorithm by comparing their immediate
More informationGeorgetown University at TREC 2017 Dynamic Domain Track
Georgetown University at TREC 2017 Dynamic Domain Track Zhiwen Tang Georgetown University zt79@georgetown.edu Grace Hui Yang Georgetown University huiyang@cs.georgetown.edu Abstract TREC Dynamic Domain
More informationISFA2008U_120 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM
Proceedings of 28 ISFA 28 International Symposium on Flexible Automation Atlanta, GA, USA June 23-26, 28 ISFA28U_12 A SCHEDULING REINFORCEMENT LEARNING ALGORITHM Amit Gil, Helman Stern, Yael Edan, and
More informationA Reinforcement Learning Variant for Control Scheduling
A Reinforcement Learning Variant for Control Scheduling Aloke Guha Honeywell Sensor and System Development Center 3660 Technology Drive Minneapolis MN 55417 Abstract We present an algorithm based on reinforcement
More informationArtificial Neural Networks written examination
1 (8) Institutionen för informationsteknologi Olle Gällmo Universitetsadjunkt Adress: Lägerhyddsvägen 2 Box 337 751 05 Uppsala Artificial Neural Networks written examination Monday, May 15, 2006 9 00-14
More informationLecture 1: Machine Learning Basics
1/69 Lecture 1: Machine Learning Basics Ali Harakeh University of Waterloo WAVE Lab ali.harakeh@uwaterloo.ca May 1, 2017 2/69 Overview 1 Learning Algorithms 2 Capacity, Overfitting, and Underfitting 3
More informationThe Strong Minimalist Thesis and Bounded Optimality
The Strong Minimalist Thesis and Bounded Optimality DRAFT-IN-PROGRESS; SEND COMMENTS TO RICKL@UMICH.EDU Richard L. Lewis Department of Psychology University of Michigan 27 March 2010 1 Purpose of this
More informationTransferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task
Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task Stephen James Dyson Robotics Lab Imperial College London slj12@ic.ac.uk Andrew J. Davison Dyson Robotics
More informationOPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS
OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS Václav Kocian, Eva Volná, Michal Janošek, Martin Kotyrba University of Ostrava Department of Informatics and Computers Dvořákova 7,
More informationEvolution of Symbolisation in Chimpanzees and Neural Nets
Evolution of Symbolisation in Chimpanzees and Neural Nets Angelo Cangelosi Centre for Neural and Adaptive Systems University of Plymouth (UK) a.cangelosi@plymouth.ac.uk Introduction Animal communication
More informationIAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)
IAT 888: Metacreation Machines endowed with creative behavior Philippe Pasquier Office 565 (floor 14) pasquier@sfu.ca Outline of today's lecture A little bit about me A little bit about you What will that
More informationAI Agent for Ice Hockey Atari 2600
AI Agent for Ice Hockey Atari 2600 Emman Kabaghe (emmank@stanford.edu) Rajarshi Roy (rroy@stanford.edu) 1 Introduction In the reinforcement learning (RL) problem an agent autonomously learns a behavior
More informationKnowledge based expert systems D H A N A N J A Y K A L B A N D E
Knowledge based expert systems D H A N A N J A Y K A L B A N D E What is a knowledge based system? A Knowledge Based System or a KBS is a computer program that uses artificial intelligence to solve problems
More informationA Neural Network GUI Tested on Text-To-Phoneme Mapping
A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis
More informationWhile you are waiting... socrative.com, room number SIMLANG2016
While you are waiting... socrative.com, room number SIMLANG2016 Simulating Language Lecture 4: When will optimal signalling evolve? Simon Kirby simon@ling.ed.ac.uk T H E U N I V E R S I T Y O H F R G E
More informationTesting A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA
Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA Testing a Moving Target How Do We Test Machine Learning Systems? Peter Varhol, Technology
More informationA Case-Based Approach To Imitation Learning in Robotic Agents
A Case-Based Approach To Imitation Learning in Robotic Agents Tesca Fitzgerald, Ashok Goel School of Interactive Computing Georgia Institute of Technology, Atlanta, GA 30332, USA {tesca.fitzgerald,goel}@cc.gatech.edu
More informationLecture 10: Reinforcement Learning
Lecture 1: Reinforcement Learning Cognitive Systems II - Machine Learning SS 25 Part III: Learning Programs and Strategies Q Learning, Dynamic Programming Lecture 1: Reinforcement Learning p. Motivation
More informationLecture 6: Applications
Lecture 6: Applications Michael L. Littman Rutgers University Department of Computer Science Rutgers Laboratory for Real-Life Reinforcement Learning What is RL? Branch of machine learning concerned with
More informationADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF
Read Online and Download Ebook ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF Click link bellow and free register to download
More informationMachine Learning and Development Policy
Machine Learning and Development Policy Sendhil Mullainathan (joint papers with Jon Kleinberg, Himabindu Lakkaraju, Jure Leskovec, Jens Ludwig, Ziad Obermeyer) Magic? Hard not to be wowed But what makes
More informationAssessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2
Assessing System Agreement and Instance Difficulty in the Lexical Sample Tasks of SENSEVAL-2 Ted Pedersen Department of Computer Science University of Minnesota Duluth, MN, 55812 USA tpederse@d.umn.edu
More informationSeminar - Organic Computing
Seminar - Organic Computing Self-Organisation of OC-Systems Markus Franke 25.01.2006 Typeset by FoilTEX Timetable 1. Overview 2. Characteristics of SO-Systems 3. Concern with Nature 4. Design-Concepts
More informationFF+FPG: Guiding a Policy-Gradient Planner
FF+FPG: Guiding a Policy-Gradient Planner Olivier Buffet LAAS-CNRS University of Toulouse Toulouse, France firstname.lastname@laas.fr Douglas Aberdeen National ICT australia & The Australian National University
More informationSelling Skills. Tailored to Your Needs. Consultants & trainers in sales, presentations, negotiations and influence
Tailored to Your Needs Consultants & trainers in sales, presentations, negotiations and influence helping your client succeed Product pushers. Floggers. They are everywhere and they are known by many names,
More informationThe dilemma of Saussurean communication
ELSEVIER BioSystems 37 (1996) 31-38 The dilemma of Saussurean communication Michael Oliphant Deparlment of Cognitive Science, University of California, San Diego, CA, USA Abstract A Saussurean communication
More informationPractice Examination IREB
IREB Examination Requirements Engineering Advanced Level Elicitation and Consolidation Practice Examination Questionnaire: Set_EN_2013_Public_1.2 Syllabus: Version 1.0 Passed Failed Total number of points
More informationSpeeding Up Reinforcement Learning with Behavior Transfer
Speeding Up Reinforcement Learning with Behavior Transfer Matthew E. Taylor and Peter Stone Department of Computer Sciences The University of Texas at Austin Austin, Texas 78712-1188 {mtaylor, pstone}@cs.utexas.edu
More informationLearning Prospective Robot Behavior
Learning Prospective Robot Behavior Shichao Ou and Rod Grupen Laboratory for Perceptual Robotics Computer Science Department University of Massachusetts Amherst {chao,grupen}@cs.umass.edu Abstract This
More informationGetting Started with Deliberate Practice
Getting Started with Deliberate Practice Most of the implementation guides so far in Learning on Steroids have focused on conceptual skills. Things like being able to form mental images, remembering facts
More information(Sub)Gradient Descent
(Sub)Gradient Descent CMSC 422 MARINE CARPUAT marine@cs.umd.edu Figures credit: Piyush Rai Logistics Midterm is on Thursday 3/24 during class time closed book/internet/etc, one page of notes. will include
More informationIntroduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition
Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition Todd Holloway Two Lecture Series for B551 November 20 & 27, 2007 Indiana University Outline Introduction Bias and
More informationAbstractions and the Brain
Abstractions and the Brain Brian D. Josephson Department of Physics, University of Cambridge Cavendish Lab. Madingley Road Cambridge, UK. CB3 OHE bdj10@cam.ac.uk http://www.tcm.phy.cam.ac.uk/~bdj10 ABSTRACT
More informationK5 Math Practice. Free Pilot Proposal Jan -Jun Boost Confidence Increase Scores Get Ahead. Studypad, Inc.
K5 Math Practice Boost Confidence Increase Scores Get Ahead Free Pilot Proposal Jan -Jun 2017 Studypad, Inc. 100 W El Camino Real, Ste 72 Mountain View, CA 94040 Table of Contents I. Splash Math Pilot
More informationAn Introduction to Simio for Beginners
An Introduction to Simio for Beginners C. Dennis Pegden, Ph.D. This white paper is intended to introduce Simio to a user new to simulation. It is intended for the manufacturing engineer, hospital quality
More informationSCHOOL WITHOUT CLASSROOMS BERLIN ARCHITECTURE COMPETITION TO
SCHOOL WITHOUT CLASSROOMS BERLIN ARCHITECTURE COMPETITION 01.04.2017 TO 30.06.2017 www.archasm.in MISSION STATEMENT What if we lived in an age where school and learning was not systemized but optimized?
More informationKnowledge-Based - Systems
Knowledge-Based - Systems ; Rajendra Arvind Akerkar Chairman, Technomathematics Research Foundation and Senior Researcher, Western Norway Research institute Priti Srinivas Sajja Sardar Patel University
More informationPython Machine Learning
Python Machine Learning Unlock deeper insights into machine learning with this vital guide to cuttingedge predictive analytics Sebastian Raschka [ PUBLISHING 1 open source I community experience distilled
More informationDesigning a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses
Designing a Rubric to Assess the Modelling Phase of Student Design Projects in Upper Year Engineering Courses Thomas F.C. Woodhall Masters Candidate in Civil Engineering Queen s University at Kingston,
More informationThe Enterprise Knowledge Portal: The Concept
The Enterprise Knowledge Portal: The Concept Executive Information Systems, Inc. www.dkms.com eisai@home.com (703) 461-8823 (o) 1 A Beginning Where is the life we have lost in living! Where is the wisdom
More informationImproving Action Selection in MDP s via Knowledge Transfer
In Proc. 20th National Conference on Artificial Intelligence (AAAI-05), July 9 13, 2005, Pittsburgh, USA. Improving Action Selection in MDP s via Knowledge Transfer Alexander A. Sherstov and Peter Stone
More informationStrategic Management and Business Policy Globalization, Innovation, and Sustainability Fourteenth Edition
Concepts Instructor s Manual Ross L. Mecham, III Virginia Tech Strategic Management and Business Policy Globalization, Innovation, and Sustainability Fourteenth Edition Thomas L. Wheelen J. David Hunger
More informationEvolutive Neural Net Fuzzy Filtering: Basic Description
Journal of Intelligent Learning Systems and Applications, 2010, 2: 12-18 doi:10.4236/jilsa.2010.21002 Published Online February 2010 (http://www.scirp.org/journal/jilsa) Evolutive Neural Net Fuzzy Filtering:
More informationA non-profit educational institution dedicated to making the world a better place to live
NAPOLEON HILL FOUNDATION A non-profit educational institution dedicated to making the world a better place to live YOUR SUCCESS PROFILE QUESTIONNAIRE You must answer these 75 questions honestly if you
More informationIntroduction to Simulation
Introduction to Simulation Spring 2010 Dr. Louis Luangkesorn University of Pittsburgh January 19, 2010 Dr. Louis Luangkesorn ( University of Pittsburgh ) Introduction to Simulation January 19, 2010 1 /
More informationMaximizing Learning Through Course Alignment and Experience with Different Types of Knowledge
Innov High Educ (2009) 34:93 103 DOI 10.1007/s10755-009-9095-2 Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge Phyllis Blumberg Published online: 3 February
More informationWELCOME! Of Social Competency. Using Social Thinking and. Social Thinking and. the UCLA PEERS Program 5/1/2017. My Background/ Who Am I?
Social Thinking and the UCLA PEERS Program Joan Storey Gorsuch, M.Ed. Social Champaign Champaign, Illinois j.s.gorsuch@gmail.com WELCOME! THE And Using Social Thinking and the UCLA PEERS Program Of Social
More informationWhat s in a Step? Toward General, Abstract Representations of Tutoring System Log Data
What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data Kurt VanLehn 1, Kenneth R. Koedinger 2, Alida Skogsholm 2, Adaeze Nwaigwe 2, Robert G.M. Hausmann 1, Anders Weinstein
More informationContinual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots
Continual Curiosity-Driven Skill Acquisition from High-Dimensional Video Inputs for Humanoid Robots Varun Raj Kompella, Marijn Stollenga, Matthew Luciw, Juergen Schmidhuber The Swiss AI Lab IDSIA, USI
More informationLTHS Summer Reading Study Packet
LTHS Summer Reading Study Packet English I Pre-AP 2017/18 To Kill a Mockingbird by Harper Lee; ISBN: 0-446-31078-6 Please read this packet in its entirety (by doing so you may save yourself some work in
More informationAn OO Framework for building Intelligence and Learning properties in Software Agents
An OO Framework for building Intelligence and Learning properties in Software Agents José A. R. P. Sardinha, Ruy L. Milidiú, Carlos J. P. Lucena, Patrick Paranhos Abstract Software agents are defined as
More informationLanguage Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus
Language Acquisition Fall 2010/Winter 2011 Lexical Categories Afra Alishahi, Heiner Drenhaus Computational Linguistics and Phonetics Saarland University Children s Sensitivity to Lexical Categories Look,
More informationNew Venture Financing
New Venture Financing General Course Information: FINC-GB.3373.01-F2017 NEW VENTURE FINANCING Tuesdays/Thursday 1.30-2.50pm Room: TBC Course Overview and Objectives This is a capstone course focusing on
More informationProbabilistic Latent Semantic Analysis
Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview
More informationSTUDENTS' RATINGS ON TEACHER
STUDENTS' RATINGS ON TEACHER Faculty Member: CHEW TECK MENG IVAN Module: Activity Type: DATA STRUCTURES AND ALGORITHMS I CS1020 LABORATORY Class Size/Response Size/Response Rate : 21 / 14 / 66.67% Contact
More informationExperience Corps. Mentor Toolkit
Experience Corps Mentor Toolkit 2 AARP Foundation Experience Corps Mentor Toolkit June 2015 Christian Rummell Ed. D., Senior Researcher, AIR 3 4 Contents Introduction and Overview...6 Tool 1: Definitions...8
More informationLearning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for
Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email Marilyn A. Walker Jeanne C. Fromer Shrikanth Narayanan walker@research.att.com jeannie@ai.mit.edu shri@research.att.com
More informationLearning From the Past with Experiment Databases
Learning From the Past with Experiment Databases Joaquin Vanschoren 1, Bernhard Pfahringer 2, and Geoff Holmes 2 1 Computer Science Dept., K.U.Leuven, Leuven, Belgium 2 Computer Science Dept., University
More informationMYCIN. The MYCIN Task
MYCIN Developed at Stanford University in 1972 Regarded as the first true expert system Assists physicians in the treatment of blood infections Many revisions and extensions over the years The MYCIN Task
More informationCase Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games
Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games Santiago Ontañón
More informationA Finnish Academic Libraries Perspective on the Information Literacy Framework
A Finnish Academic Libraries Perspective on the Information Literacy Framework European Conference on Information Literacy (ECIL) 2017, Saint-Malo, France Kati Syvälahti, Helsinki University Library, Finland
More informationA GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING
A GENERIC SPLIT PROCESS MODEL FOR ASSET MANAGEMENT DECISION-MAKING Yong Sun, a * Colin Fidge b and Lin Ma a a CRC for Integrated Engineering Asset Management, School of Engineering Systems, Queensland
More informationAustralian Journal of Basic and Applied Sciences
AENSI Journals Australian Journal of Basic and Applied Sciences ISSN:1991-8178 Journal home page: www.ajbasweb.com Feature Selection Technique Using Principal Component Analysis For Improving Fuzzy C-Mean
More informationReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology
ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology Tiancheng Zhao CMU-LTI-16-006 Language Technologies Institute School of Computer Science Carnegie Mellon
More informationState Budget Update February 2016
State Budget Update February 2016 2016-17 BUDGET TRAILER BILL SUMMARY The Budget Trailer Bill Language is the implementing statute needed to effectuate the proposals in the annual Budget Bill. The Governor
More informationPh.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept B.Tech in Computer science and
Name Qualification Sonia Thomas Ph.D in Advance Machine Learning (computer science) PhD submitted, degree to be awarded on convocation, sept. 2016. M.Tech in Computer science and Engineering. B.Tech in
More informationGo fishing! Responsibility judgments when cooperation breaks down
Go fishing! Responsibility judgments when cooperation breaks down Kelsey Allen (krallen@mit.edu), Julian Jara-Ettinger (jjara@mit.edu), Tobias Gerstenberg (tger@mit.edu), Max Kleiman-Weiner (maxkw@mit.edu)
More informationUniversity of Groningen. Peer influence in clinical workplace learning Raat, Adriana
University of Groningen Peer influence in clinical workplace learning Raat, Adriana IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please
More informationHigh-level Reinforcement Learning in Strategy Games
High-level Reinforcement Learning in Strategy Games Christopher Amato Department of Computer Science University of Massachusetts Amherst, MA 01003 USA camato@cs.umass.edu Guy Shani Department of Computer
More informationNavigating the PhD Options in CMS
Navigating the PhD Options in CMS This document gives an overview of the typical student path through the four Ph.D. programs in the CMS department ACM, CDS, CS, and CMS. Note that it is not a replacement
More informationCOMPUTER-AIDED DESIGN TOOLS THAT ADAPT
COMPUTER-AIDED DESIGN TOOLS THAT ADAPT WEI PENG CSIRO ICT Centre, Australia and JOHN S GERO Krasnow Institute for Advanced Study, USA 1. Introduction Abstract. This paper describes an approach that enables
More informationHow the Guppy Got its Spots:
This fall I reviewed the Evobeaker labs from Simbiotic Software and considered their potential use for future Evolution 4974 courses. Simbiotic had seven labs available for review. I chose to review the
More informationSuccess Factors for Creativity Workshops in RE
Success Factors for Creativity s in RE Sebastian Adam, Marcus Trapp Fraunhofer IESE Fraunhofer-Platz 1, 67663 Kaiserslautern, Germany {sebastian.adam, marcus.trapp}@iese.fraunhofer.de Abstract. In today
More informationAnalysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems
Analysis of Hybrid Soft and Hard Computing Techniques for Forex Monitoring Systems Ajith Abraham School of Business Systems, Monash University, Clayton, Victoria 3800, Australia. Email: ajith.abraham@ieee.org
More informationDecision Analysis. Decision-Making Problem. Decision Analysis. Part 1 Decision Analysis and Decision Tables. Decision Analysis, Part 1
Decision Support: Decision Analysis Jožef Stefan International Postgraduate School, Ljubljana Programme: Information and Communication Technologies [ICT3] Course Web Page: http://kt.ijs.si/markobohanec/ds/ds.html
More informationLEARNING TO PLAY IN A DAY: FASTER DEEP REIN-
LEARNING TO PLAY IN A DAY: FASTER DEEP REIN- FORCEMENT LEARNING BY OPTIMALITY TIGHTENING Frank S. He Department of Computer Science University of Illinois at Urbana-Champaign Zhejiang University frankheshibi@gmail.com
More informationSystem Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks
System Implementation for SemEval-2017 Task 4 Subtask A Based on Interpolated Deep Neural Networks 1 Tzu-Hsuan Yang, 2 Tzu-Hsuan Tseng, and 3 Chia-Ping Chen Department of Computer Science and Engineering
More informationDeep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach
#BaselOne7 Deep search Enhancing a search bar using machine learning Ilgün Ilgün & Cedric Reichenbach We are not researchers Outline I. Periscope: A search tool II. Goals III. Deep learning IV. Applying
More informationMachine Learning from Garden Path Sentences: The Application of Computational Linguistics
Machine Learning from Garden Path Sentences: The Application of Computational Linguistics http://dx.doi.org/10.3991/ijet.v9i6.4109 J.L. Du 1, P.F. Yu 1 and M.L. Li 2 1 Guangdong University of Foreign Studies,
More informationCommunity Power Simulation
Activity Community Power Simulation Time: 30 40 min Purpose: To practice community decision-making through a simulation. Skills: Communication, Conflict resolution, Cooperation, Inquiring, Patience, Paying
More informationA Genetic Irrational Belief System
A Genetic Irrational Belief System by Coen Stevens The thesis is submitted in partial fulfilment of the requirements for the degree of Master of Science in Computer Science Knowledge Based Systems Group
More informationChapter 2. Intelligent Agents. Outline. Agents and environments. Rationality. PEAS (Performance measure, Environment, Actuators, Sensors)
Intelligent Agents Chapter 2 1 Outline Agents and environments Rationality PEAS (Performance measure, Environment, Actuators, Sensors) Agent types 2 Agents and environments sensors environment percepts
More informationPredicting Future User Actions by Observing Unmodified Applications
From: AAAI-00 Proceedings. Copyright 2000, AAAI (www.aaai.org). All rights reserved. Predicting Future User Actions by Observing Unmodified Applications Peter Gorniak and David Poole Department of Computer
More informationGuidelines for the Master s Thesis Project in Biomedicine BIMM60 (30 hp): planning, writing and presentation.
Guidelines for the Master s Thesis Project in Biomedicine BIMM60 (30 hp): planning, writing and presentation. 1. The Master s Thesis Project The Master s Thesis Project is a mandatory course on the Master
More informationActive Ingredients of Instructional Coaching Results from a qualitative strand embedded in a randomized control trial
Active Ingredients of Instructional Coaching Results from a qualitative strand embedded in a randomized control trial International Congress of Qualitative Inquiry May 2015, Champaign, IL Drew White, Michelle
More informationCOMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS
COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS L. Descalço 1, Paula Carvalho 1, J.P. Cruz 1, Paula Oliveira 1, Dina Seabra 2 1 Departamento de Matemática, Universidade de Aveiro (PORTUGAL)
More informationProbability estimates in a scenario tree
101 Chapter 11 Probability estimates in a scenario tree An expert is a person who has made all the mistakes that can be made in a very narrow field. Niels Bohr (1885 1962) Scenario trees require many numbers.
More informationSSIS SEL Edition Overview Fall 2017
Image by Photographer s Name (Credit in black type) or Image by Photographer s Name (Credit in white type) Use of the new SSIS-SEL Edition for Screening, Assessing, Intervention Planning, and Progress
More informationMaking Sales Calls. Watertown High School, Watertown, Massachusetts. 1 hour, 4 5 days per week
Making Sales Calls Classroom at a Glance Teacher: Language: Eric Bartolotti Arabic I Grades: 9 and 11 School: Lesson Date: April 13 Class Size: 10 Schedule: Watertown High School, Watertown, Massachusetts
More informationSocial Emotional Learning in High School: How Three Urban High Schools Engage, Educate, and Empower Youth
SCOPE ~ Executive Summary Social Emotional Learning in High School: How Three Urban High Schools Engage, Educate, and Empower Youth By MarYam G. Hamedani and Linda Darling-Hammond About This Series Findings
More informationLearning Cases to Resolve Conflicts and Improve Group Behavior
From: AAAI Technical Report WS-96-02. Compilation copyright 1996, AAAI (www.aaai.org). All rights reserved. Learning Cases to Resolve Conflicts and Improve Group Behavior Thomas Haynes and Sandip Sen Department
More informationXXII BrainStorming Day
UNIVERSITA DEGLI STUDI DI CATANIA FACOLTA DI INGEGNERIA PhD course in Electronics, Automation and Control of Complex Systems - XXV Cycle DIPARTIMENTO DI INGEGNERIA ELETTRICA ELETTRONICA E INFORMATICA XXII
More informationComputerized Adaptive Psychological Testing A Personalisation Perspective
Psychology and the internet: An European Perspective Computerized Adaptive Psychological Testing A Personalisation Perspective Mykola Pechenizkiy mpechen@cc.jyu.fi Introduction Mixed Model of IRT and ES
More information