InIT Institute of Applied Information Technology Human Information Interaction ICT Accessibility Lab PROJECT: AUTOMATIC TRANSLATION FROM SIGN LANGUAGE

Similar documents
Deep search. Enhancing a search bar using machine learning. Ilgün Ilgün & Cedric Reichenbach

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Software Development: Programming Paradigms (SCQF level 8)

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Information System Design and Development (Advanced Higher) Unit. level 7 (12 SCQF credit points)

Individual Component Checklist L I S T E N I N G. for use with ONE task ENGLISH VERSION

Python Machine Learning

CS Machine Learning

Human Emotion Recognition From Speech

Proposal of Pattern Recognition as a necessary and sufficient principle to Cognitive Science

LEGO MINDSTORMS Education EV3 Coding Activities

Axiom 2013 Team Description Paper

Introduction to Ensemble Learning Featuring Successes in the Netflix Prize Competition

Machine Learning and Data Mining. Ensembles of Learners. Prof. Alexander Ihler

Generative models and adversarial training

Multiple Intelligence Teaching Strategy Response Groups

Lecture 1: Machine Learning Basics

CROSS COUNTRY CERTIFICATION STANDARDS

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.

Computerized Adaptive Psychological Testing A Personalisation Perspective

Unit purpose and aim. Level: 3 Sub-level: Unit 315 Credit value: 6 Guided learning hours: 50

Topic: Making A Colorado Brochure Grade : 4 to adult An integrated lesson plan covering three sessions of approximately 50 minutes each.

A MULTI-AGENT SYSTEM FOR A DISTANCE SUPPORT IN EDUCATIONAL ROBOTICS

Appendix L: Online Testing Highlights and Script

Intelligent Agents. Chapter 2. Chapter 2 1

Australian Journal of Basic and Applied Sciences

DIGITAL GAMING & INTERACTIVE MEDIA BACHELOR S DEGREE. Junior Year. Summer (Bridge Quarter) Fall Winter Spring GAME Credits.

Functional Maths Skills Check E3/L x

Evidence for Reliability, Validity and Learning Effectiveness

COVER SHEET. This is the author version of article published as:

Paper Reference. Edexcel GCSE Mathematics (Linear) 1380 Paper 1 (Non-Calculator) Foundation Tier. Monday 6 June 2011 Afternoon Time: 1 hour 30 minutes

(Sub)Gradient Descent

Laboratorio di Intelligenza Artificiale e Robotica

ASSISTIVE COMMUNICATION

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

Rule Learning With Negation: Issues Regarding Effectiveness

IAT 888: Metacreation Machines endowed with creative behavior. Philippe Pasquier Office 565 (floor 14)

THE ROLE OF TOOL AND TEACHER MEDIATIONS IN THE CONSTRUCTION OF MEANINGS FOR REFLECTION

Self Study Report Computer Science

Cambridge NATIONALS. Creative imedia Level 1/2. UNIT R081 - Pre-Production Skills DELIVERY GUIDE

Modeling user preferences and norms in context-aware systems

Data Fusion Models in WSNs: Comparison and Analysis

Using SAM Central With iread

Education the telstra BLuEPRint

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Literature and the Language Arts Experiencing Literature

THE WEB 2.0 AS A PLATFORM FOR THE ACQUISITION OF SKILLS, IMPROVE ACADEMIC PERFORMANCE AND DESIGNER CAREER PROMOTION IN THE UNIVERSITY

Learning Methods in Multilingual Speech Recognition

CEFR Overall Illustrative English Proficiency Scales

Spring 2016 Stony Brook University Instructor: Dr. Paul Fodor

ProFusion2 Sensor Data Fusion for Multiple Active Safety Applications

Courses in English. Application Development Technology. Artificial Intelligence. 2017/18 Spring Semester. Database access

Relationships Between Motivation And Student Performance In A Technology-Rich Classroom Environment

S T A T 251 C o u r s e S y l l a b u s I n t r o d u c t i o n t o p r o b a b i l i t y

Robot manipulations and development of spatial imagery

Learning Methods for Fuzzy Systems

Course Law Enforcement II. Unit I Careers in Law Enforcement

University of Waterloo School of Accountancy. AFM 102: Introductory Management Accounting. Fall Term 2004: Section 4

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Mathematics subject curriculum

Notes on The Sciences of the Artificial Adapted from a shorter document written for course (Deciding What to Design) 1

TRANSFER LEARNING OF WEAKLY LABELLED AUDIO. Aleksandr Diment, Tuomas Virtanen

Radius STEM Readiness TM

Speak with Confidence The Art of Developing Presentations & Impromptu Speaking

Welcome to. ECML/PKDD 2004 Community meeting

Innovative Methods for Teaching Engineering Courses

Business Analytics and Information Tech COURSE NUMBER: 33:136:494 COURSE TITLE: Data Mining and Business Intelligence

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Prentice Hall Literature: Timeless Voices, Timeless Themes Gold 2000 Correlated to Nebraska Reading/Writing Standards, (Grade 9)

Seminar - Organic Computing

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

Europeana Creative. Bringing Cultural Heritage Institutions and Creative Industries Europeana Day, April 11, 2014 Zagreb

Massachusetts Institute of Technology Tel: Massachusetts Avenue Room 32-D558 MA 02139

FY16 UW-Parkside Institutional IT Plan Report

Parent Information Welcome to the San Diego State University Community Reading Clinic

Spanish III Class Description

The University of Amsterdam s Concept Detection System at ImageCLEF 2011

CS 100: Principles of Computing

Designing Educational Computer Games to Enhance Teaching and Learning

REVIEW OF CONNECTED SPEECH

Rule Learning with Negation: Issues Regarding Effectiveness

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS APPLIED MECHANICS MET 2025

STT 231 Test 1. Fill in the Letter of Your Choice to Each Question in the Scantron. Each question is worth 2 point.

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

Chapter 9 Banked gap-filling

Measurement. When Smaller Is Better. Activity:

IMPROVING SPEAKING SKILL OF THE TENTH GRADE STUDENTS OF SMK 17 AGUSTUS 1945 MUNCAR THROUGH DIRECT PRACTICE WITH THE NATIVE SPEAKER

Backwards Numbers: A Study of Place Value. Catherine Perez

Agents and environments. Intelligent Agents. Reminders. Vacuum-cleaner world. Outline. A vacuum-cleaner agent. Chapter 2 Actuators

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

ACTIVITY: Comparing Combination Locks

ReinForest: Multi-Domain Dialogue Management Using Hierarchical Policies and Knowledge Ontology

Merry-Go-Round. Science and Technology Grade 4: Understanding Structures and Mechanisms Pulleys and Gears. Language Grades 4-5: Oral Communication

Computer Science 141: Computing Hardware Course Information Fall 2012

The Evolution of Random Phenomena

Transcription:

InIT Institute of Applied Information Technology Human Information Interaction ICT Accessibility Lab PROJECT: AUTOMATIC TRANSLATION FROM SIGN LANGUAGE Andri Reichenbacher Conference Barrier-free Communication: Methods and Products ZHAW Event, 14th September 2017 1

Overview Introduction Motivation, Problem Definition State of Art Research, Practicality Analysis Demo, Results Conclusion Application Area, Technical Progress 2

Motivation Project Idea Relief for deaf and hard of hearing by automatic translation from sign language to text or audio Evaluation project in 2013 with helping and usage of the available and different sensors for recognition of gestures Proposals Useful as complement to barrier-free communication in sign language Usage of modern technologies and research methods for the development of communication platform for people with hearing impairment Innovative approaches for image processing, machine learning and deep learning 3

Problem Definition What is Sign Language? Body position = Association of Hands + Arms + Torso + Mimic Gestures mean language expression for thinking in flow images as movie or flip book, not primitive single image Visually perception by associative production of body position in three-dimensional space, that is some composited informations can reflect at once In contrast spoken language is a linear, sequential production of words by voice, hence notion for one-dimensional space Therefore this own grammatical structure in 3D and totally different to spoken language Sight contact always essential, otherwise break in communication 4

Research What is State of the Art? Artificial Intelligence IBM Deep Blue defeated world chess champion Was an important fundament for development Machine Learning Is always still current used and well established Is considered as transition to Deep Learning Uses different efficient algorithms for example as Adaptive Boosting and Random Forests Deep Learning Is at present and has trend towards Big Data, Smart Data, Data Science etc. Is a specialized form of Machine Learning Uses different kind of neural networks such as Convolutional Neural Network 5

Research Machine Learning (ML) Look at a cat, a dog or a parrot Learning for many images of animals as object to identify them over time Three objects are divided into classes as error free as possible Each object has relevant features of image as edges, corners, pointed ears, tails etc. ML requires manual feature extraction from images Features are used to create a model that categorizes the objects in the image 6

Research Deep Learning (DL) Is generally more complex to get reliable results Eliminate manual feature extraction Can automatically and directly learn relevant features in data Performs «end-to-end-learning» in principle Key advantage of DL Continue often to improve the accuracy as the amount of data increases 7

Practicality Comparison of Machine and Deep Learnings Conditions for decision between ML and DL Pro ML Is suitable especially for a small amount of data to train Can achieve a short training time Is enough to use an efficient CPU Is possible to define own features Pro DL Requires a very large amount of data (thousands of images) to train Needs a long training time Needs less time to analyze all images Requires a high-performance GPU to rapidly process image data 8

Analysis Which Methods? Current usage of Kinect cam with 3D depth-sensor from Microsoft Current usage of tool Visual Gesture Builder (VGB) from Microsoft Integral part of algorithms such AdaBoost and Random Forest Reasons for image processing algorithms with VGB Minimum effort for record clips, tagged clips, without programming, non-engineering task etc. Which Gestures for training data? Base idea for three different meaning of gestures But these motions relatively similar at first sight for example as Car, Thursday and Milk 9

Demo Data-driven process of creating a gesture detector using VGB 10

Demo How Gesture Recognition for Testing Data? Needs at least programming for own application and is an engineering task Therefore a good direct comparison between Steering and Car as gesture Steering Actions 3 discrete states: SteerLeft, SteerRight, KeepStraight 1 continuous state: SteerProgress Actions by discrete states for change of direction Action by continuous state for change of angle as more or less rotation Used both AdaBoost and Random Forest Car Detections 2 discrete states: CarHandUpLeft, CarHandUpRight no Detections for start state and end state Additional number of sequence: repeated twice Used only AdaBoost 11

Demo Gesture detection in application Correct display of a gesture data set of test examples Following illustration shows gestures for Car, Thursday and Milk 12

Results Result of Gesture Recognition Each gesture data contains nearly 5 similar clips by the same person Evaluation for results is subjective because training data is too little at the moment Accuracy Mostly True Positives successful deteced (without specification of perecentage) Rare False Positives detected Latency Relatively very little to none, but speed of movement of gesture should be fair Difficulties Examples for gestures as Ship and Plow are nearly identical because hand detection has only simple hand position as open, close and lasso Problem with triggering of lower confidence value at change of discrete state from false to true in the tagging frames 13

Conclusion Takeaways Use Visual Gesture Builder Results speak for themselves: rapidly productivity with tagging data and non-engineering task Invest in quality assurance for tagging gesture data Tagging plays an important role in good results and increased accuracy Improve Accuracy by Using enough positive and negative training examples Of a wide variety of different signing persons Application Area Study course for Applied Linguistics in the Institute for Translation and Interpreting Knownledge transfer to research and teaching in the Institute for Information Technology Technical Progress Possible usage of prototype with approach to Deep Learning Better performance for complex grammatically structure of sign language 14

Questions 15