Towards a Human Language Project for Multilingual Europe

Similar documents
Open Discovery Space: Unique Resources just a click away! Andy Galloway

Approved Foreign Language Courses

DETECTING RANDOM STRINGS; A LANGUAGE BASED APPROACH

ROSETTA STONE PRODUCT OVERVIEW

Turkish Vocabulary Developer I / Vokabeltrainer I (Turkish Edition) By Katja Zehrfeld;Ali Akpinar

Universities as Laboratories for Societal Multilingualism: Insights from Implementation

The Ohio State University. Colleges of the Arts and Sciences. Bachelor of Science Degree Requirements. The Aim of the Arts and Sciences

The CESAR Project: Enabling LRT for 70M+ Speakers

The International Coach Federation (ICF) Global Consumer Awareness Study

What Can Twitter tell us about the language diversity of Greater Manchester?

MODERNISATION OF HIGHER EDUCATION PROGRAMMES IN THE FRAMEWORK OF BOLOGNA: ECTS AND THE TUNING APPROACH

ELP in whole-school use. Case study Norway. Anita Nyberg

Text-to-Speech Application in Audio CASI

OilSim. Talent Management and Retention in the Oil and Gas Industry. Global network of training centers and technical facilities

EUROPEAN DAY OF LANGUAGES

Language Center. Course Catalog

Chapter 5: Language. Over 6,900 different languages worldwide

Case Study 4 Evaluation, testing and assessment. May I help you, madam? English for office communication in an adult education centre

*** * * * COUNCIL * * CONSEIL OFEUROPE * * * DE L'EUROPE. Proceedings of the 9th Symposium on Legal Data Processing in Europe

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

English-German Medical Dictionary And Phrasebook By A.H. Zemback

InTraServ. Dissemination Plan INFORMATION SOCIETY TECHNOLOGIES (IST) PROGRAMME. Intelligent Training Service for Management Training in SMEs

Busuu The Mobile App. Review by Musa Nushi & Homa Jenabzadeh, Introduction. 30 TESL Reporter 49 (2), pp

the contribution of the European Centre for Modern Languages Frank Heyworth

HIGHER EDUCATION IN POLAND

Language learning in primary and secondary schools in England Findings from the 2012 Language Trends survey

The development of national qualifications frameworks in Europe

Development of an IT Curriculum. Dr. Jochen Koubek Humboldt-Universität zu Berlin Technische Universität Berlin 2008

Challenges for Higher Education in Europe: Socio-economic and Political Transformations

Breakthrough Russian (Breakthrough Language Courses) [Paperback] By Halya Coynash

Language. Name: Period: Date: Unit 3. Cultural Geography

Learning Methods in Multilingual Speech Recognition

BLASKI, POLAND Introduction. Italian partner presentation

11:00 am Robotics and the Law: An American Perspective Prof. Ryan Calo, University of Washington School of Law

Berlitz Swedish-English Dictionary (Berlitz Bilingual Dictionaries) By Berlitz Guides

WITTENBORG UNIVERSITY

University of Thessaloniki, Greece Marina Mattheoudakis Associate Professor School of English, AUTh

HEPCLIL (Higher Education Perspectives on Content and Language Integrated Learning). Vic, 2014.

Citrine Informatics. The Latest from Citrine. Citrine Informatics. The data analytics platform for the physical world

Experience of Tandem at University: how can ICT help promote collaborative language learning between students of different mother tongues.

The language challenge for higher education institutions in Europe, and the specific case of CLIL

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

DEVELOPMENT AID AT A GLANCE

European 2,767 ACTIVITY SUMMARY DUKE GLOBAL FACTS. European undergraduate students currently enrolled at Duke

DISV IB DIPLOMA HANDBOOK striving for excellence by engaging minds, exciting learners, acting ethically and showing empathy

Cross Language Information Retrieval

Funded PhD and MLitt scholarships available at the School of Law, the University of Dublin, Trinity College, Ireland

Testing A Moving Target: How Do We Test Machine Learning Systems? Peter Varhol Technology Strategy Research, USA

The Use of Statistical, Computational and Modelling Tools in Higher Learning Institutions: A Case Study of the University of Dodoma

ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY DOWNLOAD EBOOK : ADVANCED MACHINE LEARNING WITH PYTHON BY JOHN HEARTY PDF

Second medium-term programme of activities

Competition in Information Technology: an Informal Learning

* Does the following equation hold true in your opinion: Education equals success, elite education equals a great salary?

CEN/ISSS ecat Workshop

HOW DO YOU IMPROVE YOUR CORPORATE LEARNING?

15-year-olds enrolled full-time in educational institutions;

INTEgrated TRaining system for Trainers in Intercultural Education

ECML Project B.1: Intercultural Communication in Teacher Education Workshop Report National Training Event Germany Stuttgart, Oct.

Adding content in Course Support Environments

Open Science at Tritonia Academic Library, University of Vaasa, Finland

OVERVIEW Getty Center Richard Meier Robert Irwin J. Paul Getty Museum Getty Research Institute Getty Conservation Institute Getty Foundation

Rethinking Library and Information Studies in Spain: Crossing the boundaries

CONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS

Constructing Parallel Corpus from Movie Subtitles

Speech Recognition at ICSI: Broadcast News and beyond

Productive partnerships to promote media and information literacy for knowledge societies: IFLA and UNESCO s collaborative work

Maurício Serva (Coordinator); Danilo Melo; Déris Caetano; Flávia Regina P. Maciel;

The 9 th International Scientific Conference elearning and software for Education Bucharest, April 25-26, / X

Language Independent Passage Retrieval for Question Answering

Lower and Upper Secondary

ANALYSIS: LABOUR MARKET SUCCESS OF VOCATIONAL AND HIGHER EDUCATION GRADUATES

From Empire to Twenty-First Century Britain: Economic and Political Development of Great Britain in the 19th and 20th Centuries 5HD391

Co-operation between Higher Education Institutions in Oulu. 30. September 2015 Jouko Paaso President, CEO

Purdue Data Summit Communication of Big Data Analytics. New SAT Predictive Validity Case Study

Read and Play. With a Partner.

Basic German: CD/Book Package (LL(R) Complete Basic Courses) By Living Language

The European Consensus on Development: the contribution of Development Education & Awareness Raising

Leonardo Partnership Project INCREASE MOTIVATION IMPROVE EMPLOYABILITY

MARE Publication Series

The Survey of Adult Skills (PIAAC) provides a picture of adults proficiency in three key information-processing skills:

Susanne Rieger on her objectives as new President of EASC

Interview on Quality Education

On the Open Access Strategy of the Max Planck Society

empowering explanation

English (from Chinese) (Language Learners) By Daniele Bourdaise

COMMISSION OF THE EUROPEAN COMMUNITIES. COMMISSION STAFF WORKING DOCUMENT Accompanying document to the

The recognition, evaluation and accreditation of European Postgraduate Programmes.

2016 Annual Report to the School Community

EQE Candidate Support Project (CSP) Frequently Asked Questions - National Offices

Top US Tech Talent for the Top China Tech Company

Hungarian Pedagogical Statistics around the Period of the Census of 1930.

Computer Science PhD Program Evaluation Proposal Based on Domain and Non-Domain Characteristics

Designing Autonomous Robot Systems - Evaluation of the R3-COP Decision Support System Approach

Section V Reclassification of English Learners to Fluent English Proficient

A comparative study on cost-sharing in higher education Using the case study approach to contribute to evidence-based policy

STUDENT INFORMATION GUIDE MASTER'S DEGREE PROGRAMME ENERGY AND ENVIRONMENTAL SCIENCES (EES) 2016/2017. Faculty of Mathematics and Natural Sciences

What s in a Step? Toward General, Abstract Representations of Tutoring System Log Data

GREAT Britain: Film Brief

University of New Orleans

Intercultural communicative competence past and future

Transcription:

Towards a Human Language Project for Multilingual Europe AI and Interpretation Georg Rehm German Research Center for Artificial Intelligence (DFKI) GmbH Language Technology Lab Berlin, Germany META-NET, General Secretary georg.rehm@dfki.de

SCIC Universities Conference (19/20 April 2018) 2

Artificial Intelligence Data Intelligence Current breakthroughs based on Machine Learning ( Deep Learning ) Also still in use: symbolic, rule-based methods and systems Huge data sets + powerful algorithms + extremely fast hardware Enormous potential for disruptions in all sectors and areas SCIC Universities Conference (19/20 April 2018) 3

Translation and Interpretation Since approx. 2015, with breakthroughs in neural technologies, Machine Translation has been getting better and better. All areas of AI look for super-human performance but language is fundamentally different and much more complex. Neural AI approaches cannot understand language, they process it according to huge underlying data sets. In many use cases, mistakes can be tolerated. But: translation and interpretation are often mission-critical! Mistakes can have serious consequences (politics, medicine). SCIC Universities Conference (19/20 April 2018) 4

Speech Translation Example: Lecture Translator University lectures are automatically transcribed and translated, in near-real time, into several languages Students can follow the translation through a web interface Example: Presentation Translator Presenter can have the speech automatically translated Translations are displayed as subtitles Example: Call Translator Internet telephony provider offers automatic voice translation SCIC Universities Conference (19/20 April 2018) 5

Issues and Limitations The three example applications work surprisingly well for general-domain language and input. But: They are far from being perfect. They aren t robust. They cannot cope with unforeseen situations. They cannot understand language as humans do. They are not (yet?) suited for conference interpretation. Limitations as regards their fields of application. Interpretation is often mission-critical. Human interpreters won t be replaced anytime soon. SCIC Universities Conference (19/20 April 2018) 6

https://slator.com/features/ai-interpreter-fail-at-china-summit-sparks-debate-about-future-of-profession/ SCIC Universities Conference (19/20 April 2018) 7

LT Current Developments LT in Europe: World class research, strong SME base, thousands of LSPs; immense fragmentation; need for coordination. Need for High-Quality LT: translation, interpretation, MDSM etc. The European Language Challenge cannot be it must not be abandoned or outsourced! Need for Language Technology, made i Europe, for Europe! STOA Workshop in the EP (January 2017): Language equality in the digital age towards a Human Language Project SCIC Universities Conference (19/20 April 2018) 8

Human Language Project Goal: Deep Natural Language Understanding by 2030 Vision: EU FET Flagship Project (10+ years) Broad coverage, high quality, high precision Create approaches, algorithms, data sets, resources Across modalities: text, text types, speech, video etc. Artificial Intelligence including cognition, perception, vision, cross-modal, cross-platform, cross-culture etc. Linguistics Language Technology Machine Learning SCIC Universities Conference (19/20 April 2018) 9

Summary & Conclusions AI is disrupting all industries including translation and, increasingly, also interpretation. But: perfect, robust, precise language technologies (incl. written/spoken MT and interpretation) are still far away. Linguists are increasingly needed new profiles emerging The machine will support human experts and help them become more efficient it will not replace them. The Human Language Project is still a vision. Its goal: develop new breakthroughs in Language Technology. SCIC Universities Conference (19/20 April 2018) 10

Recommendation SCIC Speech Repository 4,000 speeches (3,000 public + 1,000 private) Extremely interesting data set and language resource for Language Technology researchers! Many R&D groups currently work on TED talk data sets Recommendation: establish bridges between SCIC and research groups for spoken language translation Help build the next generation of AI tools for interpreters AI tools that are tailored to the needs and wishes, topics and domains of conference interpreters in the EC/EP SCIC Universities Conference (19/20 April 2018) 11

Thank you! Dr. Georg Rehm DFKI Berlin georg.rehm@dfki.de http://de.linkedin.com/in/georgrehm https://www.slideshare.net/georgrehm Strategic Research and Innovation Agenda Language Technologies for Multilingual Europe Towards a Human Language Project SRIA Editorial Team Version 1.0 December 2017 SCIC Universities Conference (19/20 April 2018) 12

Multilingualism is at the heart of the European idea 24 EU languages all have the same status Dozens of regional and minority languages as well as languages of immigrants and trade partners Many economic and social challenges: The Digital Single Market needs to be multilingual Cross-border, cross-lingual, cross-cultural communication

(published in 2013) (31 volumes; published in 2012) 60 research centres in 34 countries (founded in 2010) Chair of Executive Board: Jan Hajic (CUNI) Dep.: J. van Genabith (DFKI), A. Vasiljevs (Tilde) General Secretary: Georg Rehm (DFKI) Multilingual Europe Technology Alliance. 826 members in 67 countries T4ME (META-NET) CESAR META-NORD METANET4U

Basque Bulgarian* Catalan Croatian* Czech* Danish* Dutch* English* Estonian* Finnish* French* Galician German* Greek* Hungarian* Icelandic Irish* Italian* Latvian* Lithuanian* Maltese* Norwegian Polish* Portuguese* Romanian* Serbian Slovak* Slovene* Spanish* Swedish* Welsh http://www.meta-net.eu/whitepapers * Official EU language

Speech MT excellent good moderate fragmentary weak or no support through LT English French, Spanish Catalan, Dutch, German, Hungarian, Italian, Polish, Romanian Basque, Bulgarian, Croatian, Czech, Danish, Estonian, Finnish, Galician, Greek, Icelandic, Irish, Latvian, Lithuanian, Maltese, Norwegian, Portuguese, Serbian, Slovak, Slovene, Swedish, Welsh Resources Text Analytics excellent excellent excellent good English good English good English moderate Dutch, French, German, Italian, Spanish moderate Czech, Dutch, Finnish, French, German, Italian, Portuguese, Spanish moderate Czech, Dutch, French, German, Hungarian, Italian, Polish, Spanish, Swedish fragmentary Basque, Bulgarian, Catalan, Czech, Danish, Finnish, Galician, Greek, Hungarian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovene, Swedish fragmentary Basque, Bulgarian, Catalan, Danish, Estonian, Galician, Greek, Hungarian, Irish, Norwegian, Polish, Serbian, Slovak, Slovene, Swedish fragmentary Basque, Bulgarian, Catalan, Croatian, Danish, Estonian, Finnish, Galician, Greek, Norwegian, Portuguese, Romanian, Serbian, Slovak, Slovene weak or no support through LT Croatian, Estonian, Icelandic, Irish, Latvian, Lithuanian, Maltese, Serbian, Welsh weak or no support through LT Croatian, Icelandic, Latvian, Lithuanian, Maltese, Romanian, Welsh weak or no support through LT Icelandic, Irish, Latvian, Lithuanian, Maltese, Welsh

Source: META-NET White Paper Series: Europe's Languages in the Digital Age. Springer, Heidelberg, New York, Dordrecht, London, September 2012. Georg Rehm and Hans Uszkoreit (series editors)

We carried out the study in 2010/2012. While support for many languages has improved in the meantime, the overall picture remains mostly the same. Source: META-NET White Paper Series: Europe's Languages in the Digital Age. Springer, Heidelberg, New York, Dordrecht, London, September 2012. Georg Rehm and Hans Uszkoreit (series editors)