UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL

Similar documents
Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Laboratorio di Intelligenza Artificiale e Robotica

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

INPE São José dos Campos

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

PROFESSIONAL TREATMENT OF TEACHERS AND STUDENT ACADEMIC ACHIEVEMENT. James B. Chapman. Dissertation submitted to the Faculty of the Virginia

Laboratorio di Intelligenza Artificiale e Robotica

Activities, Exercises, Assignments Copyright 2009 Cem Kaner 1

TD(λ) and Q-Learning Based Ludo Players

Speech Recognition at ICSI: Broadcast News and beyond

MODULE 4 Data Collection and Hypothesis Development. Trainer Outline

Switchboard Language Model Improvement with Conversational Data from Gigaword

(Sub)Gradient Descent

Rule Learning With Negation: Issues Regarding Effectiveness

Reducing Features to Improve Bug Prediction

Language Acquisition Fall 2010/Winter Lexical Categories. Afra Alishahi, Heiner Drenhaus

Answers To Hawkes Learning Systems Intermediate Algebra

Analysis: Evaluation: Knowledge: Comprehension: Synthesis: Application:

What can I learn from worms?

Abstractions and the Brain

How do adults reason about their opponent? Typologies of players in a turn-taking game

COORDINATING COMMITTEE ON GRADUATE AFFAIRS. Minutes of Meeting --Wednesday, October 1, 2014

District Advisory Committee. October 27, 2015

DOCTOR OF PHILOSOPHY HANDBOOK

MYCIN. The MYCIN Task

Case Acquisition Strategies for Case-Based Reasoning in Real-Time Strategy Games

Evolutive Neural Net Fuzzy Filtering: Basic Description

Observation Summary (following 45 minute observation) Summative Annual Appraisal (end-of-year summative conference)

MBA 5652, Research Methods Course Syllabus. Course Description. Course Material(s) Course Learning Outcomes. Credits.

Developing Students Research Proposal Design through Group Investigation Method

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Lecture 10: Reinforcement Learning

San Diego State University Division of Undergraduate Studies Sustainability Center Sustainability Center Assistant Position Description

Reinforcement Learning by Comparing Immediate Reward

A Case Study: News Classification Based on Term Frequency

Further, Robert W. Lissitz, University of Maryland Huynh Huynh, University of South Carolina ADEQUATE YEARLY PROGRESS

AMULTIAGENT system [1] can be defined as a group of

Oklahoma State University Policy and Procedures

Python Machine Learning

Rule Learning with Negation: Issues Regarding Effectiveness

Developing creativity in a company whose business is creativity By Andy Wilkins

Characterizing Mathematical Digital Literacy: A Preliminary Investigation. Todd Abel Appalachian State University

English Language and Applied Linguistics. Module Descriptions 2017/18

Iterative Cross-Training: An Algorithm for Learning from Unlabeled Web Pages

Greetings, Ed Morris Executive Director Division of Adult and Career Education Los Angeles Unified School District

Professional Learning Suite Framework Edition Domain 3 Course Index

Lecture 1: Machine Learning Basics

AP Calculus AB. Nevada Academic Standards that are assessable at the local level only.

Knowledge-Based - Systems

EDUC-E328 Science in the Elementary Schools

DOCTOR OF PHILOSOPHY BOARD PhD PROGRAM REVIEW PROTOCOL

1 3-5 = Subtraction - a binary operation

A Case-Based Approach To Imitation Learning in Robotic Agents

Using computational modeling in language acquisition research

Minutes. Student Learning Outcomes Committee March 3, :30 p.m. Room 2411A

QuickStroke: An Incremental On-line Chinese Handwriting Recognition System

City University of Hong Kong Course Syllabus. offered by Department of Architecture and Civil Engineering with effect from Semester A 2017/18

Maximizing Learning Through Course Alignment and Experience with Different Types of Knowledge

Midterm Evaluation of Student Teachers

Accuplacer Implementation Report Submitted by: Randy Brown, Ph.D. Director Office of Institutional Research Gavilan College May 2012

Colorado State University Department of Construction Management. Assessment Results and Action Plans

Creating an Information Literacy Plan

Written Expression Examples For La County Exam

ABET Criteria for Accrediting Computer Science Programs

Thesis and Dissertation Submission Instructions

Procedures for Academic Program Review. Office of Institutional Effectiveness, Academic Planning and Review

University of Toronto Mississauga Degree Level Expectations. Preamble

Malaysia & Singapore [DK TRAVEL GD MALAYSIA & SINGAP] [Paperback] By DK Publishing"(Manufactured by)

DRAFT VERSION 2, 02/24/12

Analysis of Emotion Recognition System through Speech Signal Using KNN & GMM Classifier

ED : Methods for Teaching EC-6 Social Studies, Language Arts and Fine Arts

Soaring With Strengths

Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for

Affecting Factors to Improve Adversity Quotient in Children through Game-based Learning

Xenia High School Credit Flexibility Plan (CFP) Application

Guru: A Computer Tutor that Models Expert Human Tutors

VOL VISION 2020 STRATEGIC PLAN IMPLEMENTATION

Copyright Corwin 2014

Georgia Tech College of Management Project Management Leadership Program Eight Day Certificate Program: October 8-11 and November 12-15, 2007

Ontologies vs. classification systems

NATIONAL SURVEY OF STUDENT ENGAGEMENT (NSSE)

On-the-Fly Customization of Automated Essay Scoring

BENCHMARKING OF FREE AUTHORING TOOLS FOR MULTIMEDIA COURSES DEVELOPMENT

Software Development Plan

Multisensor Data Fusion: From Algorithms And Architectural Design To Applications (Devices, Circuits, And Systems)

How to Do Research. Jeff Chase Duke University

TABLE OF CONTENTS Credit for Prior Learning... 74

Prentice Hall Chemistry Test Answer Key

Mining Student Evolution Using Associative Classification and Clustering

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

On-Line Data Analytics

Calibration of Confidence Measures in Speech Recognition

The Singapore Copyright Act applies to the use of this document.

North Carolina Information and Technology Essential Standards

Exemplar Grade 9 Reading Test Questions

A Decision Tree Analysis of the Transfer Student Emma Gunu, MS Research Analyst Robert M Roe, PhD Executive Director of Institutional Research and

P-4: Differentiate your plans to fit your students

Systematic reviews in theory and practice for library and information studies

LATTC Program Review Instructional -Department Level

Transcription:

UNIVERSITY OF CALIFORNIA SANTA CRUZ TOWARDS A UNIVERSAL PARAMETRIC PLAYER MODEL A thesis submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in COMPUTER SCIENCE by JOSEPH OSBORN November 2012 The Thesis of Joseph Osborn is approved: Professor X, Chair Professor Y Professor Z Tyrus Miller Vice Provost and Dean of Graduate Studies

Copyright c by Joseph Osborn 2012

Table of Contents List of Figures iv List of Tables v Abstract vi Dedication vii Acknowledgments viii 1 Introduction 1 2 Related Work 2 3 Method 2 4 Experiments 3 5 Results 3 6 Discussion 3 7 Figure/Table/Ref Competency 4 References 5 iii

List of Figures 1 Combat in Dragon Quest: A toy environment.............. 2 iv

List of Tables 1 Edges in a DFS forest........................... 4 v

Abstract Towards A Universal Parametric Player Model by Joseph Osborn 1. State the problem briefly. 2. Describe the methodology. 3. Summarize the findings. ProQuest recommends that the Abstract be no longer than 350 words, as it may be posted to sites with limited file size

DEDICATION! vii

ACKNOWLEDGMENTS! viii

1 Introduction Automated design support for goal-oriented dynamic systems with human users (e.g. games, software, or amusement parks) relies on three types of model to make useful recommendations: A model of the task under consideration, a model of the human doing the design, and a model of the human(s) using the designed system. In the domain of game design, this last concept is called a player model [1]. There are many different types of player model which are used for different analysis or user-adaptation tasks, but here I am concerned with the project of generalizing a set of Individual Generative Action models from individual games to a Universal Induced Generative Action model applicable to any of a class of games (these terms indicate that the first models exist to Generate game Actions representative of an Individual player; if generalization succeeds, I gain a model Induced from the data which describes the Universe of players from which I can synthesize any individual by instantiating a parameter vector). Specifically, I intend to lift from agents trained in the same style across multiple toy environments a fingerprint: hidden parameters which inform the specific policies in each case. I make the simplifying assumption for the sake of data gathering and labeling that I may use Synthetic ( justified by reference to an internal belief or external theory ) models in the data to be classified, rather than requiring that the Individual models be Induced from actual human play traces; this is not a requirement of the theory, but an expedience. In other words, I want to identify whether it is possible to classify distinct play traces, possibly from distinct games, as possessing the same style or some other identifying characteristics, and to describe those characteristics. I expect from intuition that candidate parameters include, but are not limited to, risk-aversion, greed, forethought, stubbornness, and curiosity all of which can be operationalized in terms of expected negative reward (loss), expected positive reward, time discounting factors, and selected actions over time. It is also possible that additional parameters such as reaction-time or social-consciousness might exhibit in real-time or multi-user environments, but this is an area for future research. It is enough for this project if 1

Figure 1: Combat in Dragon Quest: A toy environment. the parameters cover the players of the class of turn-taking games. 2 Related Work Könik et al. have studied how an agent can transfer task knowledge across environments [2], but I am more interested in how an agent with the same generic parameters but no knowledge transfer would learn in the new environment. Researchers have used play metrics [3] and specific actions ([4, 5]) to classify players into groups, and some have identified game-specific player parameters [6]. Of the parametric approaches, Spronck and den Teuling s considers hours of play and requires that players signal changes in intention, and Gold s considers only a single task in the game under test and, further, finds that non-synthetic players cut across all of the preconceived categories in the model. Thue et al. use an online and incremental approach which hand-codes the parameter shifts based on in-game events, whereas I hope to induce those parameter shifts by other means. All of these classifications are closely and pragmatically tied to the games or genres under test. 3 Method... 2

4 Experiments... 5 Results Results information will go in here... 6 Discussion Discussion information will go in here... 3

7 Figure/Table/Ref Competency Above is a test figure 1 and below is a test table 1 and aligned equation environment (I prefer align to the \[\] environment). And here s a url: http://ndseg.asee. org/application_instructions/summary_of_goals. Edge Order Type GD 1 Tree DA 2 Tree AB 3 Tree BC 4 Tree BD 5 Back AC 6 Descendant AF 7 Tree F A 8 Back F C 9 Cross DC 10 Descendant GE 11 Tree EC 12 Cross EG 13 Back Table 1: Edges in a DFS forest. 4

An equation: w = (X T X) 1 X T t w = 1 1 1 1 1 3 6 7 8 1 9 9 7 6 0 2 1 7 4 8 1 3 9 2 1 6 9 1 1 7 7 7 1 8 6 4 1 1 0 8 1 1 1 1 1 1 3 6 7 8 1 9 9 7 6 0 2 1 7 4 8 10 11 3 w = 5 25 31 22 25 159 178 101 31 178 247 100 22 101 100 134 1 1 1 1 1 1 3 6 7 8 1 9 9 7 6 0 2 1 7 4 8 10 11 3 w = 6.611 0.080 0.607 0.693 0.080 0.047 0.035 0.023 0.607 0.035 0.078 0.068 0.693 0.023 0.068 0.088 1 1 1 1 1 3 6 7 8 1 9 9 7 6 0 2 1 7 4 8 10 11 3 w = 0.002 0.935 1.929 0.837 1.147 0.140 0.024 0.003 0.154 0.057 0.126 0.047 0.170 0.147 0.098 0.026 0.131 0.238 0.117 0.012 10 11 3 w = 4.279 0.309 1.878 0.866 5

References [1] A. Smith, C. Lewis, K. Hullett, G. Smith, and A. Sullivan, An inclusive taxonomy of player modeling, University of California, Santa Cruz, Tech. Rep. UCSC-SOE-11-13, 2011. [Online]. Available: http : / / sokath. com / main / files/amsmith-ucsc-soe-11-13.pdf. [2] T. Könik, P. O Rorke, D. Shapiro, D. Choi, N. Nejati, and P. Langley, Skill transfer through goal-driven representation mapping, Cognitive Systems Research, vol. 10, no. 3, pp. 270 285, 2009. [Online]. Available: http://www. sciencedirect.com/science/article/pii/s1389041708000715. [3] A. Drachen, A. Canossa, and G. Yannakakis, Player modeling using selforganization in tomb raider: underworld, pp. 1 8, 2009. [Online]. Available: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5286500. [4] D. Thue, V. Bulitko, M. Spetch, and E. Wasylishen, Interactive storytelling: A player modelling approach, pp. 43 48, 2007. [Online]. Available: http:// www.aaai.org/papers/aiide/2007/aiide07-008.pdf. [5] K. Gold, Trigram Timmies and Bayesian Johnnies: Probabilistic Models of Personality in Dominion, 2011. [Online]. Available: http://www.aaai.org/ ocs/index.php/aiide/aiide11/paper/viewfile/4072/4426. [6] P. Spronck and F. den Teuling, Player Modeling in Civilization IV, pp. 180 185, 2010. [Online]. Available: http://www.aaai.org/ocs/index.php/aiide/ AIIDE10/paper/viewPDFInterstitial/2124/2565. 6