Investigation of annotator s behaviour using eye-tracking data

Similar documents
Japanese Language Course 2017/18

What is the status of task repetition in English oral communication

The Interplay of Text Cohesion and L2 Reading Proficiency in Different Levels of Text Comprehension Among EFL Readers

Challenging Assumptions

JAPELAS: Supporting Japanese Polite Expressions Learning Using PDA(s) Towards Ubiquitous Learning

Teaching intellectual property (IP) English creatively

My Japanese Coach: Lesson I, Basic Words

Emphasizing Informality: Usage of tte Form on Japanese Conversation Sentences

Fluency is a largely ignored area of study in the years leading up to university entrance

<September 2017 and April 2018 Admission>

CJS was honored to have Izukura share his innovative techniques with the larger UHM community, where he showcased indoor and outdoor

Adding Japanese language synthesis support to the espeak system

Add -reru to the negative base, that is to the "-a" syllable of any Godan Verb. e.g. becomes becomes

3 Character-based KJ Translation

THE PERCEPTIONS OF THE JAPANESE IMPERFECTIVE ASPECT MARKER TEIRU AMONG NATIVE SPEAKERS AND L2 LEARNERS OF JAPANESE

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

The Role of the Head in the Interpretation of English Deverbal Compounds

THE VERB ARGUMENT BROWSER

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Possibility to Prevent Learning Disabilities (LD) in School by Performing Special Developmental Intervention to them in Preschool period

Applications of memory-based natural language processing

SEMAFOR: Frame Argument Resolution with Log-Linear Models

MOUNT LAWLEY SENIOR HIGH SCHOOL SENIOR SCHOOL YEAR

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

BYLINE [Heng Ji, Computer Science Department, New York University,

The taming of the data:

Extracting Opinion Expressions and Their Polarities Exploration of Pipelines and Joint Models

AQUA: An Ontology-Driven Question Answering System

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Using dialogue context to improve parsing performance in dialogue systems

Leveraging Sentiment to Compute Word Similarity

Practical Integrated Learning for Machine Element Design

Eye Movements in Speech Technologies: an overview of current research

Linking Task: Identifying authors and book titles in verbose queries

Storytelling Made Simple

Frequencies of the Spatial Prepositions AT, ON and IN in Native and Non-native Corpora

Compositional Semantics

The stages of event extraction

Introduction to Text Mining

Prediction of Maximal Projection for Semantic Role Labeling

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Knowledge-Based - Systems

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

The Smart/Empire TIPSTER IR System

Segmented Discourse Representation Theory. Dynamic Semantics with Discourse Structure

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Page 1 of 11. Curriculum Map: Grade 4 Math Course: Math 4 Sub-topic: General. Grade(s): None specified

"f TOPIC =T COMP COMP... OBJ

Practice Examination IREB

Lecture 1: Machine Learning Basics

CS 598 Natural Language Processing

Star Math Pretest Instructions

Minutes. Student Learning Outcomes Committee March 3, :30 p.m. Room 2411A

Houghton Mifflin Online Assessment System Walkthrough Guide

CHANCERY SMS 5.0 STUDENT SCHEDULING

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

What is a Mental Model?

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

THE ROLE OF TOOL AND TEACHER MEDIATIONS IN THE CONSTRUCTION OF MEANINGS FOR REFLECTION

Essential Cellular and Molecular Life Sciences Collection A

Disambiguation of Thai Personal Name from Online News Articles

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

DIBELS Next BENCHMARK ASSESSMENTS

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Multilingual Sentiment and Subjectivity Analysis

SER CHANGES~ACCOMMODATIONS PAGES

Numeracy Medium term plan: Summer Term Level 2C/2B Year 2 Level 2A/3C

Study Guide for Right of Way Equipment Operator 1

A Syllable Based Word Recognition Model for Korean Noun Extraction

Semantic Inference at the Lexical-Syntactic Level for Textual Entailment Recognition

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade

The Discourse Anaphoric Properties of Connectives

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Test Administrator User Guide

Accurate Unlexicalized Parsing for Modern Hebrew

Online Updating of Word Representations for Part-of-Speech Tagging

Many instructors use a weighted total to calculate their grades. This lesson explains how to set up a weighted total using categories.

Exploiting Wikipedia as External Knowledge for Named Entity Recognition

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Vocabulary Usage and Intelligibility in Learner Language

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Identifying Unknown Proper Names in Newswire Text

Yoshida Honmachi, Sakyo-ku, Kyoto, Japan 1 Although the label set contains verb phrases, they

Intension, Attitude, and Tense Annotation in a High-Fidelity Semantic Representation

Annotating (Anaphoric) Ambiguity 1 INTRODUCTION. Paper presentend at Corpus Linguistics 2005, University of Birmingham, England

ANALYSIS OF USER BROWSING BEHAVIOR ON A HEALTH DISCUSSION FORUM USING AN EYE TRACKER WENJING PIAN, CHRISTOPHER S.G. KHOO & YUN-KE CHANG

Let's Learn English Lesson Plan

PowerTeacher Gradebook User Guide PowerSchool Student Information System

Building a Semantic Role Labelling System for Vietnamese

Experts Retrieval with Multiword-Enhanced Author Topic Model

2.1 The Theory of Semantic Fields

Generation of Referring Expressions: Managing Structural Ambiguities

MASTER S THESIS GUIDE MASTER S PROGRAMME IN COMMUNICATION SCIENCE

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

The following information has been adapted from A guide to using AntConc.

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Some Principles of Automated Natural Language Information Extraction

Transcription:

Investigation of annotator s behaviour using eye-tracking data Ryu Iida, Koh Mitsuda, Takenobu Tokunaga Tokyo Institute of Technology, Japan LAW VII & ID (August 9, 2013)

Research background 2 Manual annotation: essential for ML-based approaches in various NLP tasks Shallow processing tasks: POS tagging, NP chunking ML-based approaches have been largely successful Surface information (e.g. word and POS) can be easily introduced as useful features Deeper processing tasks: coreference resolution, discourse parsing Deeper linguistic knowledge has been integrated WordNet, linguistic theories (e.g. Centering Theory) There is still room for further improvement

Cognitive science approach based on annotator s behaviour 3 Look into human behaviour during annotation Elicit useful information for NLP tasks requiring deeper linguistic knowledge Focus on annotator eye gaze during annotation Developments in eye-tracking technology Eye gaze data has been widely used Psycholinguistics & problem solving (Duchowski, 2002) Tomanek et al. (2010): utilised eye-tracking data to evaluate the degree of difficulty in annotating named entities

Aim 4 Design experimental setting for collecting annotator s behaviour (annotation events & eye gaze) during annotation Investigate annotator s behaviour to elicit useful information in an NLP task Annotating predicate-argument relations in Japanese Moderately difficult annotation task due to the existence of zero-anaphora Meaningful eye movement may be observed

Outline 5 1. 2. 3. Motivation of analysing annotation behaviour Task setting of annotating predicateargument relations in Japanese and data collection including annotation behaviour Manual investigation using collected data

Annotation task: annotation of Japanese predicateargument relations 6 Annotation task: annotating obligatory arguments (subj, obj, iobj) of predicates in a text Segments of predicates and candidate arguments are pre-annotated automatically トムは 公園に 行った Tom-top park-iobj go/past (Tom went to a park.) そこで ジョンに 会った (φ ガ ) φ-subj there John-obj meet/past ( φ(he) met John there. ) subj obj iobj

Annotation tool: modified version of Slate (Kaplan et al. 2012) 7 subj obj iobj

Recorded annotation events 8 Record seven event types together with occurring time of each event and its related segments Event label Description predid argid linkid link type create_link_start creating a link starts create_link_end creating a link ends select_link a link is selected delete_link a link is deleted select_segment a relation type is selected annotation_start annotating a text starts annotation_end annotating a text ends or

Annotation environment 9 Equipment Eye-tracker: Tobii-T60 Chin rest Keyboard size: 1,280x1,024 select link type: ga(subj), o(obj), ni(iobj) Mouse create link between a predicate and its argument

Experimental settings 10 Recruited three annotators Experience in annotating predicate-argument relations Data: 43 articles in BCCWJ PB-corpus (Maekawa et al. 2010) Texts were truncated to about 1,000 characters to fit onto the screen to prevent scrolling

Annotation results done by three human annotators 11 case total selected ga (subj) o (obj) ni (iobj) annotator A annotator B annotator C 3,353 3,764 3,462 1,776 1,170 383 223 1,430 904 298 228 1,795 1,105 421 269 total 10,579 5,001 3,179 1,102 720 Our analysis requires an annotator s fixation on segments of both a predicate and its argument available instances for analysis were reduced

Outline 12 1. 2. 3. Motivation of analysing annotation behaviour Task setting of annotating predicateargument relations in Japanese and data collection including annotation behaviour Manual investigation using collected data

Division of annotation process 13 Divided into three stages (Russo&Leclerc (1994)) first fixation on target predicate first fixation on linked argument create_link_start orientation reads a given text and understands its context evaluation searches for an argument of a target predicate verification looks around the context in order to confirm the predarg relation time

Division of annotation process 14 Divided into three sub-processes (Russo&Leclerc (1994)) first fixation on target predicate first fixation on linked argument create_link_start orientation evaluation verification time Most informative for extracting useful features Analysing annotator eye gaze during this stage could reveal useful information for predicate-argument analysis Insufficient to regard only fixated arguments during this stage (annotator captures an overview of the current problem during the orientation stage)

Division of annotation process 15 Divided into three sub-processes (Russo&Leclerc (1994)) first fixation on target predicate first fixation on linked argument create_link_start target of our analysis orientation evaluation verification time Probable argument has been already determined and its validity confirmed by investigating its competitors Considered competitors are explicitly fixated during this stage Possible to analyse annotator s behaviour during this stage based on eye gaze concentrated on the analysis of the verification stage

Two viewpoints for investigation 16 1. 2. Types of eye movement of annotator in verification stage Distance of a target predicate and its argument in terms of character-based distance

1. Eye movement in verification stage 17 Concentrated: after the first fixation of the argument annotated earlier, the fixations are concentrated onto it and the target predicate Distracted: fixates on the competitors 人から好かれたいと強く願う人が陥りがちな失敗として 人の顔色をう argument かがっ 強 願 人 人 顔色 うかがっ 人 好か 失敗 人 く う 人 顔色 うかがっ てしまうことがあげられます こ あげ と あげ 始終びくびくして 人の顔色を見 自分の発言の中で何か人を傷つける びくびく 自分 発言 中 何 人 傷づける 人 顔色 し ようなtarget predicate ふさわし それ い ことをいわなかっただろうか 自分の態度はふさわしいのだろうか そ こと いわ れで嫌 自分 態度

2. Distance of a predicate and its argument 18 Hypothesis: annotator s behaviour depends on the distance between predicate-argument Classified into the either Near and Far type 22 ave. of all annotation instances Near Far

Investigation from three aspects 19 1. Predicate-argument distance and argument case 2. Effect of pre-annotated links 3. Specificity of arguments and dispersal of fixations

1. Distance of predicate-argument relations and their case 20 Annotator changes her/his behaviour with regard to the case of the argument Near ga (subj) o (obj) ni (iobj) 2,201 (0.44) 1,042 (0.34) 662 (0.22) Far 978 (0.90) 60 (0.05) 58 (0.05) total 3,179 (0.64) 1,102 (0.22) 720 (0.14) 90% of Far class ga arguments are often omitted to make ellipses o and ni arguments less frequently appear as Far instances because they are rarely omitted Each case requires individual specific treatment in a

1. Distance of predicate-argument and their case (Cont d) 21 Concentrated/Distracted distinction impacts on Near/Far distinction? NearConcentrated ga (subj) o (obj) ni (iobj) 0.40 0.60 NearFarFarDistracted Concentrated Distracted 0.47 0.53 0.92 0.08 0.90 0.10 Concentrated/distracted distinction does not impact the distribution of the argument types Even if an argument appears far from its predicate, the verification is completed without seeing any competitors

2. Effect of pre-annotated links 22 In the situation of annotating A for P, 6 links SL have already been annotated These links make the argument visually or cognitively salient in annotator s short-term memory cognitively or visually salient

Relationship between #already-existing links and #dwells on competitors 23 Only Far instances Peaks around the intersection of instances with the fewest #links and dwells on competitors Lower #links Higher #links Mostly symmetrical relation Symmetry brakes Visual and cognitive salience a # exis t n reduces annotators cognitive notat ing li ed nks arg of load um ent efficiently confirming correct arguments

3. Relationship of specificity of arguments and dispersal of eye gaze 24 Specific problem of our annotation setting Only head of NP is pre-annotated as a segment in our annotation setting e.g. Benkyo-suru koto Annotation to study -ing target NP (to study / studying) Head noun of an argument does not always have enough information Inspecting a whole NP including its modifiers is necessary to verify the validity of the NP for an argument

Empirical investigation about dispersal of eye gaze: head of NP 25 Annotated arguments which have any NP modifiers are classified into... (a) fixations remain within the region of the argument NP (b) fixations go out of the region (a) within NP Concentrated Distracted 1,190 242 (b) out of NP 839 22% of Distracted arguments (242 instances) with any modifiers remain within NP region Need to treat candidate argument depending on if they have modifier or not In addition to the head of NP, we should introduce information on modifiers into ML algorithms as features

Summary 26 Aim: analysis of annotator s behaviour during her/his annotation for eliciting useful information for NLP tasks Conducted an experiment for collecting three annotators eye gaze and annotation events during annotation of predicate-argument relations in Japanese texts Analysed from three aspects: Relationship of predicate-argument distances and argument cases Effect of already-existing links Specificity of arguments and dispersal of eye gaze

Future work 27 Further investigation of the collected data Use of mining techniques for finding unknown but useful information may be advantageous Employ mining techniques for finding useful gaze patterns for NLP tasks Current work: limited to the analysis of the verification stage of annotation the orientation and evaluation stages include important clues for examining human behaviour during annotation