The Nottingham eprints service makes this work by researchers of the University of Nottingham available open access under the following conditions.

Similar documents
SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Using dialogue context to improve parsing performance in dialogue systems

A Case Study: News Classification Based on Term Frequency

Automating the E-learning Personalization

AQUA: An Ontology-Driven Question Answering System

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

CS Machine Learning

Linking Task: Identifying authors and book titles in verbose queries

Seminar - Organic Computing

BEETLE II: a system for tutoring and computational linguistics experimentation

Getting the Story Right: Making Computer-Generated Stories More Entertaining

Summary / Response. Karl Smith, Accelerations Educational Software. Page 1 of 8

Word Segmentation of Off-line Handwritten Documents

On-Line Data Analytics

Rule Learning With Negation: Issues Regarding Effectiveness

An OO Framework for building Intelligence and Learning properties in Software Agents

Using Moodle in ESOL Writing Classes

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

GACE Computer Science Assessment Test at a Glance

Organizational Knowledge Distribution: An Experimental Evaluation

Chamilo 2.0: A Second Generation Open Source E-learning and Collaboration Platform

COMPUTER-ASSISTED INDEPENDENT STUDY IN MULTIVARIATE CALCULUS

A new Dataset of Telephone-Based Human-Human Call-Center Interaction with Emotional Evaluation

Learning and Transferring Relational Instance-Based Policies

LQVSumm: A Corpus of Linguistic Quality Violations in Multi-Document Summarization

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Parsing of part-of-speech tagged Assamese Texts

AN INTRODUCTION (2 ND ED.) (LONDON, BLOOMSBURY ACADEMIC PP. VI, 282)

The Round Earth Project. Collaborative VR for Elementary School Kids

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

Jacqueline C. Kowtko, Patti J. Price Speech Research Program, SRI International, Menlo Park, CA 94025

DYNAMIC ADAPTIVE HYPERMEDIA SYSTEMS FOR E-LEARNING

Using Virtual Manipulatives to Support Teaching and Learning Mathematics

Problems of practice-based Doctorates in Art and Design: a viewpoint from Finland

EDITORIAL: ICT SUPPORT FOR KNOWLEDGE MANAGEMENT IN CONSTRUCTION

Integrating Meta-Level and Domain-Level Knowledge for Task-Oriented Dialogue

Evolution of Symbolisation in Chimpanzees and Neural Nets

Metadata of the chapter that will be visualized in SpringerLink

Fragment Analysis and Test Case Generation using F- Measure for Adaptive Random Testing and Partitioned Block based Adaptive Random Testing

NCEO Technical Report 27

Activities, Exercises, Assignments Copyright 2009 Cem Kaner 1

A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique

International Conference on Current Trends in ELT

An Investigation into Team-Based Planning

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Learning Lesson Study Course

Cross-Lingual Text Categorization

Shockwheat. Statistics 1, Activity 1

Revisiting the role of prosody in early language acquisition. Megha Sundara UCLA Phonetics Lab

KIS MYP Humanities Research Journal

Using AMT & SNOMED CT-AU to support clinical research

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

CWIS 23,3. Nikolaos Avouris Human Computer Interaction Group, University of Patras, Patras, Greece

Rule Learning with Negation: Issues Regarding Effectiveness

Learning Methods for Fuzzy Systems

Liquid Narrative Group Technical Report Number

Chemistry Senior Seminar - Spring 2016

MAKING YOUR OWN ALEXA SKILL SHRIMAI PRABHUMOYE, ALAN W BLACK

To link to this article: PLEASE SCROLL DOWN FOR ARTICLE

PowerTeacher Gradebook User Guide PowerSchool Student Information System

Patterns for Adaptive Web-based Educational Systems

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Evaluation of Usage Patterns for Web-based Educational Systems using Web Mining

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Using Blackboard.com Software to Reach Beyond the Classroom: Intermediate

AUTHORING E-LEARNING CONTENT TRENDS AND SOLUTIONS

BLACKBOARD TRAINING PHASE 2 CREATE ASSESSMENT. Essential Tool Part 1 Rubrics, page 3-4. Assignment Tool Part 2 Assignments, page 5-10

TEKS Correlations Proclamation 2017

Reinforcement Learning by Comparing Immediate Reward

Guru: A Computer Tutor that Models Expert Human Tutors

Learning From the Past with Experiment Databases

Specification of the Verity Learning Companion and Self-Assessment Tool

Community-oriented Course Authoring to Support Topic-based Student Modeling

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

Session 2B From understanding perspectives to informing public policy the potential and challenges for Q findings to inform survey design

Common Core Exemplar for English Language Arts and Social Studies: GRADE 1

OPTIMIZATINON OF TRAINING SETS FOR HEBBIAN-LEARNING- BASED CLASSIFIERS

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq

Learning goal-oriented strategies in problem solving

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Detecting English-French Cognates Using Orthographic Edit Distance

Digital Fabrication and Aunt Sarah: Enabling Quadratic Explorations via Technology. Michael L. Connell University of Houston - Downtown

Unpacking a Standard: Making Dinner with Student Differences in Mind

Ordered Incremental Training with Genetic Algorithms

Degree Qualification Profiles Intellectual Skills

Research directions on Semantic Web and education

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

Speech Emotion Recognition Using Support Vector Machine

re An Interactive web based tool for sorting textbook images prior to adaptation to accessible format: Year 1 Final Report

Data Fusion Models in WSNs: Comparison and Analysis

Instructor: Mario D. Garrett, Ph.D. Phone: Office: Hepner Hall (HH) 100

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Characteristics of the Text Genre Realistic fi ction Text Structure

A Study on professors and learners perceptions of real-time Online Korean Studies Courses

cmp-lg/ Jan 1998

What is Initiative? R. Cohen, C. Allaby, C. Cumbaa, M. Fitzgerald, K. Ho, B. Hui, C. Latulipe, F. Lu, N. Moussa, D. Pooley, A. Qian and S.

Transcription:

Zhu, Wenjue and Chowanda, Andry and Valstar, Michel F. (2016) Topic switch models for dialogue management in virtual humans. In: 16th International Conference on Intelligent Virtual Agents (IVA 2016), 20-23 Sept, 2016, Los Angeles, California, USA. Access from the University of Nottingham repository: http://eprints.nottingham.ac.uk/35622/1/topic-switch-models.pdf Copyright and reuse: The Nottingham eprints service makes this work by researchers of the University of Nottingham available open access under the following conditions. This article is made available under the University of Nottingham End User licence and may be reused according to the conditions of the licence. For more details see: http://eprints.nottingham.ac.uk/end_user_agreement.pdf A note on versions: The version presented here may differ from the published version or from the version of record. If you wish to cite this item you are advised to consult the publisher s version. Please see the repository url above for details on accessing the published version and note that access may require a subscription. For more information, please contact eprints@nottingham.ac.uk

Topic Switch Models for Dialogue Management in Virtual Humans Wenjue Zhu 1, Andry Chowanda 1,2, Michel Valstar 1 1 School of Computer Science, The University of Nottingham, Nottingham, UK-GB {psywz2,psxac6,pszmv}@nottingham.ac.uk 2 School of Computer Science, Bina Nusantara University, Jakarta, ID Abstract. This paper presents a novel data-driven Topic Switch Model based on a cognitive representation of a limited set of topics that are currently in-focus, which determines what utterances are chosen next. The transition model was statistically learned from a large set of transcribed dyadic interactions. Results show that using our proposed model results in interactions that on average last 2.17 times longer compared to the same system without our model. Keywords: Social Relationship, Framework, Game-Agents, Interactions 1 Introduction Current Dialogue Management (DM) systems are not natural enough and cannot sustain a coherent conversation with humans [2]. One issue is that systems are not able to stay on topic and are incapable of following a train of thought. As most DM systems have no notion of the concept of a topic and generate their responses based only on a set of predefined rules that operate on the specific words or phrases retrieved from the last user input. To address this, we propose a novel data-driven Topic Switch Model (TSM), devise an algorithm for sensible topic switching and instantiate it in a software program that can imbue virtual humans with the capability of staying on topic or making sensible topic switches with the aim of achieving more coherent conversations with humans. Our TSM learns connections between topics, which allows for sensible topic switches, and learns connections between topics and utterances, which allows for the selection of sentences that match the current topic. The system is otherwise naive, in that it does not implement an agent s goals, or states such as social relations or an agent s emotion [3, 4]. However, it is entirely data driven and thus does not require crafting of any rule whatsoever. It is thus suggested that for full effectiveness the TSM be integrated into a more complex stateful model, perhaps with one TSM per state. We evaluate the efficacy of our system by comparing it to a version of our system without the TSM enabled. When tested on over 20 participants, we show that people communicate on average 2.17 times longer with the agent when the TSM is enabled. Corresponding Author

2 Wenjue Zhu, Andry Chowanda, Michel Valstar 2 Related work A number of different approaches to DM have been proposed and implemented to date. Plan-based DM makes use of a general planner which is responsible for identifying the goal and making a plan. The plan consists of predefined operations and is aimed to achieve the final goal [1]. The most common and simplest approach to DM is to represent the dialogue as a graph where its nodes represents the dialogue states. The nodes usually define the proposed action based on the input of previous node [6]. Another common approach uses the concept of information state. In this approach, the state of conversation is formally represented by some informational components and a set of rules is defined for the DM to update the state and decide on the corresponding action according to the current state, system input and the applicable rule(s) [7]. 3 The Topic Switch Model Topic switches occur constantly in one s brain, and are influenced by both internal factors (e.g. your own knowledge of topics and their relationships) and external factors, including what you hear and see. There may be several topics in a person s mind at a time, each taking up a portion of one s attention. Over time topics in the brain will be replaced by others because of the various influencing factors at any point of time. This is our abstract concept of a TSM. As we focus here on text-based dialogue systems, topic extraction is comprised of text preprocessing and topic retrieval. Two natural language processing techniques (i.e. stop-word removal and stemming) are chosen to pre-process the text input. With respect to topic retrieval, we maintain a lookup table of words and their corresponding topics. This list was manually created by the authors. Topic relations, that should be learned by virtual humans consist of three types of topic statistics. The topic frequency P f (t) is the prior probability of a topic occurring in an utterance. The second relation, concurrency probability P con (t 1, t 2,..., t n ), is the probability of two or more topics appearing in the same utterance. The third relation is the adjacency probability, P adj (t 1, t 2 ) representing the probability of topic t 2 occurring in a utterance if its previous utterance contains t 1. Topic Frequency: P f (t) = n t N T (1) Concurrency Possibility: P con (t 1, t 2,..., t n ) = n con(t 1, t 2,..., t n ) N s (2) Adjacency possibility: P adj (t 1, t 2 ) = n adj(t 1, t 2 ) N s where N s is the number of utterances and N s is the potential times of two topics appearing in two adjacent utterances. In this paper, the topic statistics were obtained from the SEMAINE database [5]. (3)

Topic Switch Models for Dialogue Management in Virtual Humans 3 Algorithm 1 Topic switch with external factors Input: all topics L T, stop words L sw, pairs of words and topics L p, topic statistics, sentence database, user input Initialisation: 1. Randomly select 5 topics from L T to be the topics in the brain, L 1 = [t 1, t 2, t 3, t 4, t 5]. 2. Topics of which the virtual human is thinking currently, L 2 = []. 3. Topics that are out of the brain, L 3 = L T L 1. Procedure: 1. Read the text input and extract the user topic t u. Find the topic t i in L 1 which has the largest adjacency possibility with t u. If no user topic is found, randomly select a topic t i in L 1 based on the topic frequencies. Add t i into L 2, L 2 = [t i]. 2. Generate a random number r between 0 and 1. Continuously try to find topics in (L 1 L 2) whose concurrency possibility with topics in L 2 is larger than r and add it into L 2 until no such topic is found. 3. Make a response by randomly selecting an utterance which contains all topics in L 2. 4. Randomly select a topic t out from (L 1 - L 2) to be swapped out based on the adjacency possibilities. Then L 1 = L 1 [t out] and L 3 = L 3 + [t out] The possibility of each topic being selected is P out(t) = ta L 2 (1 P adj (t a,t)) ta L2,tb (L1 L2) (1 P adj (t a,t b )). 5. Randomly select a topic t in from L 3 to be swapped in according to the adjacency possibilities. Then L 1 = L 1 + [t in] and L 3 = L 3 [t in]. The possibility of each topic in L 3 being chosen is P in(t) = ta L 2 (P adj (t a,t)) ta L2,tb L3 (P adj (t a,t b )). 6. Empty L 2 and go to step 1 until the end of the conversation. Table 1. Performance comparison between systems. System Weather Work Christmas Other Total TS/NTS Mean SD Min Max TS 3.7 4.05 5.45 4.15 17.35 2.17 NTS 1.8 2.15 1.8 2.25 8 TS 2.03 1.76 3.89 3.73 8.14 3.55 NTS 0.95 1.04 1.06 1.55 2.29 TS 1 2 1 1 11 1.10 NTS 2 2 1 2 10 TS 6 8 17 12 39 3.90 NTS 1 1 2 2 10 4 Evaluation Twenty participants were invited to interact with our proposed system with TSM implemented and with a baseline version without TSM implemented as a comparison. Participants were asked to start a conversation on a specific topic and stop when they thought the topic switch made by the DM was not sensible. As performance measure we counted the number of user turns before they stopped. Table 1 demonstrates the performance comparisons between both systems, giving mean, standard deviation, min and max number of turns interacted with either system for the different topics. The last column shows the relative

4 Wenjue Zhu, Andry Chowanda, Michel Valstar increase in interaction time, measured as the number of turns interacting with Topic Switch (TS) divided by the number of turns with No Topic Switch (NTS). Table 1 shows that people always communicated longer with the TS model, and on average people interacted 2.17 times longer with TS. Additionally, the performance of TS partially depended on the discussed topic. NTS on the other hand had about the same performance for all specified topics. 5 Discussion The results indicate that the system with TSM could make a more sensible response which was either staying on the current topic or switched to another related topic. Moreover, the system TS had slightly different performance on different topics. The main reason should be the DM had better knowledge of some topics than that of others because the knowledge it learned from real conversations cannot cover all topics. However, unreasonable topic switches may still occur in the TS system for various reasons, including multiple meanings of an individual word, unfamiliar topics and inaccurate topic extractions. Another drawback of our current implementation is that topics are a flat structure - there is no hierarchy or ontology. This means that choices about the grouping of topics had to be made. For example, sunny is part of the topic weather, but an alternative choice would have been splitting that topic into good weather and bad weather, or constructing a hierarchy. Acknowledgement The work by A. Chowanda and M. Valstar is partly funded by European Union s Horizon 2020 research and innovation programme under grant agreement No 645378, ARIA-VALUSPA. References 1. James F Allen and C Raymond Perrault. Analyzing intention in utterances. Artificial intelligence, 15(3):143 178, 1980. 2. Björn Bringert. Programming Language Techniques for Natural Language Applications. Department of Computer Science and Engineering, 2008. 3. Andry Chowanda, Peter Blanchfield, Martin Flintham, and Michel Valstar. Erisa: Building emotionally realistic social game-agents companions. In Intelligent Virtual Agents, pages 134 143. Springer International Publishing, 2014. 4. Andry Chowanda, Martin Flintham, Peter Blanchfield, and Michel Valstar. Playing with social and emotional game companions. In Intelligent Virtual Agents. 2016. 5. G. McKeown, M.F. Valstar, R. Cowie, M. Pantic, and M. Schröder. The semaine database: Annotated multimodal records of emotionally coloured conversations between a person and a limited agent. Trans. Affective Computing, 3(1):5 17, 2012. 6. Bahador Nooraei, Charles Rich, and C Sidner. A real-time architecture for embodied conversational agents: beyond turn-taking. ACHI, 14:381 388, 2014. 7. D.R. Traum and S. Larsson. The information state approach to dialogue management. In Current and new directions in discourse and dialogue. Springer, 2003.