Semantic Parsing of Natural Language Input for Dialogue Systems. Jamie Frost Oxford University Computational Linguistics Group

Similar documents
Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Parsing of part-of-speech tagged Assamese Texts

Compositional Semantics

Some Principles of Automated Natural Language Information Extraction

CS 598 Natural Language Processing

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

Context Free Grammars. Many slides from Michael Collins

Using dialogue context to improve parsing performance in dialogue systems

Natural Language Processing. George Konidaris

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Applications of memory-based natural language processing

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

FUZZY EXPERT. Dr. Kasim M. Al-Aubidy. Philadelphia University. Computer Eng. Dept February 2002 University of Damascus-Syria

Grammars & Parsing, Part 1:

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Developing a TT-MCTAG for German with an RCG-based Parser

AQUA: An Ontology-Driven Question Answering System

Word Segmentation of Off-line Handwritten Documents

LTAG-spinal and the Treebank

The stages of event extraction

SEMAFOR: Frame Argument Resolution with Log-Linear Models

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

Control and Boundedness

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

Rule-based Expert Systems

Modeling user preferences and norms in context-aware systems

Prediction of Maximal Projection for Semantic Role Labeling

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

Spoken Language Parsing Using Phrase-Level Grammars and Trainable Classifiers

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Annotation Projection for Discourse Connectives

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

PRODUCT PLATFORM DESIGN: A GRAPH GRAMMAR APPROACH

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Teacher: Mlle PERCHE Maeva High School: Lycée Charles Poncet, Cluses (74) Level: Seconde i.e year old students

"f TOPIC =T COMP COMP... OBJ

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

2/15/13. POS Tagging Problem. Part-of-Speech Tagging. Example English Part-of-Speech Tagsets. More Details of the Problem. Typical Problem Cases

Proof Theory for Syntacticians

Natural Language Processing: Interpretation, Reasoning and Machine Learning

Conversation Starters: Using Spatial Context to Initiate Dialogue in First Person Perspective Games

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.

Protocols for building an Organic Chemical Ontology

Type Theory and Universal Grammar

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

The Strong Minimalist Thesis and Bounded Optimality

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

An Interactive Intelligent Language Tutor Over The Internet

Analysis of Probabilistic Parsing in NLP

Biomedical Sciences (BC98)

elearning OVERVIEW GFA Consulting Group GmbH 1

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Learning to Schedule Straight-Line Code

Construction Grammar. University of Jena.

Unit purpose and aim. Level: 3 Sub-level: Unit 315 Credit value: 6 Guided learning hours: 50

Interfacing Phonology with LFG

Specifying Logic Programs in Controlled Natural Language

Noisy SMS Machine Translation in Low-Density Languages

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Android App Development for Beginners

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

arxiv: v1 [cs.cv] 10 May 2017

ECE-492 SENIOR ADVANCED DESIGN PROJECT

Introduction to Simulation

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

A Framework for Customizable Generation of Hypertext Presentations

Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

School of Innovative Technologies and Engineering

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

Philosophy 301L: Early Modern Philosophy, Spring 2012

Accurate Unlexicalized Parsing for Modern Hebrew

Seminar - Organic Computing

The Discourse Anaphoric Properties of Connectives

Dreistadt: A language enabled MOO for language learning

Essay on importance of good friends. It can cause flooding of the countries or even continents..

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Probabilistic Latent Semantic Analysis

LEGO MINDSTORMS Education EV3 Coding Activities

The Smart/Empire TIPSTER IR System

A Version Space Approach to Learning Context-free Grammars

Language Model and Grammar Extraction Variation in Machine Translation

A Grammar for Battle Management Language

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

Constraining X-Bar: Theta Theory

The Interface between Phrasal and Functional Constraints

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Basic Syntax. Doug Arnold We review some basic grammatical ideas and terminology, and look at some common constructions in English.

Circuit Simulators: A Revolutionary E-Learning Platform

Transcription:

Semantic Parsing of Natural Language Input for Dialogue Systems Jamie Frost Oxford University Computational Linguistics Group

Video

The EUROPA Project Autonomous pedestrian assistant robot designed to operate in a city/town environment. Provides information to pedestrians and escorts them to their requested locations.

The EUROPA Project Laser Mapping Pedestrian Tracking Global Path Planning Linguistic Control Local Path Planning Location Recognition

The EUROPA Project Models of Discourse Evaluating different approaches to discourse modelling (POMDPs, Plan-Based, ISU, etc.) Building a framework that can handle anaphoric resolution, multiple utterances, multi-modal input, etc. Semantic Parsing Converting English to some semantic representation suitable for dialogue system. and back to English. Spatial Reasoning Building numerical models for aspects of spatial language. Generating expressions to identify objects or disambiguate their location.

Natural language output Natural language input Architecture DIALOGUE SYSTEM Natural Language Parser Semantic representation Natural Language Generator Event/Request Handler Search requests, etc. Replies, robot notifications, etc. Robot Intermediary Image Service Env DB Visualisation Dialogue Web Server User/system dialogue text MOOS Go to requests, tour requests... Localisation data, arrival notifications, etc. ipad

Where did you last see your cat madam?

By the tree in front of me, on the road and near the other tree by a house.

where

(Sentence from from TownInfo training set.) hi i'm looking for a bar but i don't have much money on me and the other thing is i'd like it to be in the south of town because i've a train to catch at the station is there anywhere suitable

How have discourse systems parsed language in the past? Approach 1: Keyword Spotting No encoding of input. Dialogue Manager responds directly to particular keywords. Example: automated telephone system. Advantages Predictable rigid behaviour. Simple to implement. Disadvantages Very limited representation of semantic content. Dialogue Manager coupled too tightly with raw source input.

Approach 2: Full Logic Based Representation

Approach 3: DA Taxonomy with Key-Value Pairs

Approach 3: DA Taxonomy with Key-Value Pairs Advantages Taxonomy captures natural couplings of speech acts in dialogue (e.g. request often followed by acknowledge, question by answer, etc.) Easy for a Dialogue Manager to see particular information of interest. Simple representation lends well to Machine Learning approaches for learning dialogue policy. Disadvantages Limited semantic encoding.

Our Approach Target semantic language represented as a Context Free Grammar. CFG can be automatically generated by our Dialogue Manager framework. Advantages Allows very expressive representation (e.g. English language definable with CFG) yet with a rigid tree like structure. Easy to extract subtrees representing data we re interested in.

S VP NP VP V NP JJ NP V NP blue I want JJ NNS NNS cabbages

S S NP[1] VP[2] NP[1] VP[2] VP VP V[1] NP[2] V[1] NP[2]

NP NP JJ[1] NNS[2] des NNS[2] JJ[1] JJ JJ blue NNS carrots bleues NNS carottes

Synchronous Context Free Grammar

S NP VP I V NP want JJ blue NNS carrots

S S NP VP NP VP I V NP Je V NP want JJ NNS veux des NNS JJ blue carrots carottes bleues

Examples

Examples

Example Rule

Dialogue Act Segmentation

Dealing with superfluous info

Challenges Considering all possible segmentations and allowing data to be superfluous leads to lots of possible translations. Could use Probabilistic SCFGs can give a measure of the strength of the translation. Requires training data to obtain probabilities associated with rules. But for simplicity, we use simple heuristics to choose the best tree i.e. the one that maximises the amount of parsed information.

Where does target grammar come from? use input IPAD SCFG( patternfile ) U ; use input ROBOT RAW R ; use output IPAD SCFG( patternfile );

Where does target grammar come from? enum DIALOGUEACT { acknowledge, clarify(prop), greet, informyes, informno, informdontknow, inform(prop), requestinfo(qud), requestinfoyn(prop), requestaction(task), } structure LOCEXPR { INT[?@] id, PART[?@] part, INT[?] classid, STR[?] class, LIST<PREPOSITION>[?] relations, STR[?] name, LIST<ATTRVAL>[?] attrs, BOOL[?] isvisible, BOOL[?] da, BOOL[?] multiple }; const REAL WALKINGDISTANCE = 150;

Problem? Non-isomorphic translations not easily represented by SCFG. i.e. Transformation of grammatical structure more complicated than renamings and swapping siblings. Synchronous Tree Substitution Grammars (STSGs) solve the problem, as they allow longer range dependencies.

Problem? STSG > SCFG Tree languages

Problem? STSG = SCFG String languages

Problem? We don t ultimately care whether we have the correct syntax tree of the source sentence. If our target grammar is unambiguous, we care only about the string (and indeed, our Dialogue Manager accepts parsed inputs in string form. Therefore SCFG is sufficient. But non-isomorphism property means that we ll likely have lots more rules.

Can we learn a SCFG? Can generate 3 different types of rules: : Z < X*1+ Y*2+, X*1+ Y*2+ > : Z < X*1+ Y*2+, Y[2] X[1] > : Z < a, b > Rules in this form are synchronous equivalent of.

Summary We can use a variety of different methods to parse input for the purposes of dialogue. Often a trade off between the level of semantic content we capture and the ease of processing it. Use of SCFGs has a number of advantages: Ties in well with Machine Translation theory. And therefore gives us a means by which we can potentially learn a SCFG using Machine Learning. Expressive representation (although can t for example represent logical operators very effectively, e.g.,,, ). Can be generated automatically based on the particular task domain. Attempted to build framework (HURDLE) that puts large emphasis on the ease for industry to develop complex systems as easily as possible, and without the need for too much specialist knowledge.

Any questions?