A Future with Language Technology

Similar documents
Modeling user preferences and norms in context-aware systems

On Human Computer Interaction, HCI. Dr. Saif al Zahir Electrical and Computer Engineering Department UBC

Language Center. Course Catalog

Mathematics Program Assessment Plan

Robert S. Unnasch, Ph.D.

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

General study plan for third-cycle programmes in Sociology

STAFF DEVELOPMENT in SPECIAL EDUCATION

Thesis-Proposal Outline/Template

Inquiry Learning Methodologies and the Disposition to Energy Systems Problem Solving

Contents. Foreword... 5

WE GAVE A LAWYER BASIC MATH SKILLS, AND YOU WON T BELIEVE WHAT HAPPENED NEXT

The Curriculum in Primary Schools

Cognitive Thinking Style Sample Report

Course Outline for Honors Spanish II Mrs. Sharon Koller

The Verbmobil Semantic Database. Humboldt{Univ. zu Berlin. Computerlinguistik. Abstract

MSW POLICY, PLANNING & ADMINISTRATION (PP&A) CONCENTRATION

AUTHORITATIVE SOURCES ADULT AND COMMUNITY LEARNING LEARNING PROGRAMMES

The Common European Framework of Reference for Languages p. 58 to p. 82

Calculators in a Middle School Mathematics Classroom: Helpful or Harmful?

Cross Language Information Retrieval

Objectives. Chapter 2: The Representation of Knowledge. Expert Systems: Principles and Programming, Fourth Edition

Modified Systematic Approach to Answering Questions J A M I L A H A L S A I D A N, M S C.

Guidelines for Project I Delivery and Assessment Department of Industrial and Mechanical Engineering Lebanese American University

English Language and Applied Linguistics. Module Descriptions 2017/18

ELP in whole-school use. Case study Norway. Anita Nyberg

5 th Grade Language Arts Curriculum Map

$0/5&/5 '"$*-*5"503 %"5" "/"-:45 */4536$5*0/"- 5&$)/0-0(: 41&$*"-*45 EVALUATION INSTRUMENT. &valuation *nstrument adopted +VOF

The leaky translation process

Perception of Lecturer on Intercultural Competence and Culture Teaching Time (Case Study)

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Procedia - Social and Behavioral Sciences 146 ( 2014 )

Researcher Development Assessment A: Knowledge and intellectual abilities

Linking Task: Identifying authors and book titles in verbose queries

November 2012 MUET (800)

School Inspection in Hesse/Germany

ACCREDITATION STANDARDS

An Interactive Intelligent Language Tutor Over The Internet

Software Maintenance

KENTUCKY FRAMEWORK FOR TEACHING

Study Abroad Housing and Cultural Intelligence: Does Housing Influence the Gaining of Cultural Intelligence?

Information for Candidates

Guidelines for Writing an Internship Report

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

Setting the Scene: ECVET and ECTS the two transfer (and accumulation) systems for education and training

TEACHING QUALITY: SKILLS. Directive Teaching Quality Standard Applicable to the Provision of Basic Education in Alberta

San José State University Department of Psychology PSYC , Human Learning, Spring 2017

CROSS COUNTRY CERTIFICATION STANDARDS

Coast Academies Writing Framework Step 4. 1 of 7

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

TEACHING AND EXAMINATION REGULATIONS PART B: programme-specific section MASTER S PROGRAMME IN LOGIC

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Evidence for Reliability, Validity and Learning Effectiveness

Key concepts for the insider-researcher

Doctoral GUIDELINES FOR GRADUATE STUDY

2015 Academic Program Review. School of Natural Resources University of Nebraska Lincoln

Effective practices of peer mentors in an undergraduate writing intensive course

Language Acquisition Chart

How People Learn Physics

Bachelor of Arts. Intercultural German Studies. Language in intercultural contexts

Counseling 150. EOPS Student Readiness and Success

Multidisciplinary Engineering Systems 2 nd and 3rd Year College-Wide Courses

Biomedical Sciences (BC98)

Seminar - Organic Computing

PSY 1010, General Psychology Course Syllabus. Course Description. Course etextbook. Course Learning Outcomes. Credits.

CHAPTER IV RESEARCH FINDING AND DISCUSSION

Initial teacher training in vocational subjects

ANGLAIS LANGUE SECONDE

Knowledge based expert systems D H A N A N J A Y K A L B A N D E

Syllabus: MKT Online Marketing (MKT3202) / MKT Introduction into Online Technologies for Marketing Professionals (MKT3205)

MISSISSIPPI OCCUPATIONAL DIPLOMA EMPLOYMENT ENGLISH I: NINTH, TENTH, ELEVENTH AND TWELFTH GRADES

Maintaining Resilience in Teaching: Navigating Common Core and More Site-based Participant Syllabus

What the National Curriculum requires in reading at Y5 and Y6

USER ADAPTATION IN E-LEARNING ENVIRONMENTS

Curriculum Policy. November Independent Boarding and Day School for Boys and Girls. Royal Hospital School. ISI reference.

HOLISTIC LESSON PLAN Nov. 15, 2010 Course: CHC2D (Grade 10, Academic History)

Value of Athletics in Higher Education March Prepared by Edward J. Ray, President Oregon State University

CROSS-LANGUAGE INFORMATION RETRIEVAL USING PARAFAC2

Essentials of Ability Testing. Joni Lakin Assistant Professor Educational Foundations, Leadership, and Technology

Math Pathways Task Force Recommendations February Background

The KAM project: Mathematics in vocational subjects*

WORK OF LEADERS GROUP REPORT

*Net Perceptions, Inc West 78th Street Suite 300 Minneapolis, MN

Individual Interdisciplinary Doctoral Program Faculty/Student HANDBOOK

Applications of memory-based natural language processing

Number of students enrolled in the program in Fall, 2011: 20. Faculty member completing template: Molly Dugan (Date: 1/26/2012)

Assignment 1: Predicting Amazon Review Ratings

Hueber Worterbuch Learner's Dictionary: Deutsch Als Fremdsprache / German-English / English-German Deutsch- Englisch / Englisch-Deutsch By Olaf

Alpha provides an overall measure of the internal reliability of the test. The Coefficient Alphas for the STEP are:

Teachers response to unexplained answers

Participate in expanded conversations and respond appropriately to a variety of conversational prompts

I. INTRODUCTION. for conducting the research, the problems in teaching vocabulary, and the suitable

GUIDE TO EVALUATING DISTANCE EDUCATION AND CORRESPONDENCE EDUCATION

A new Dataset of Telephone-Based Human-Human Call-Center Interaction with Emotional Evaluation

2020 Strategic Plan for Diversity and Inclusive Excellence. Six Terrains

Gifted/Challenge Program Descriptions Summer 2016

CS 598 Natural Language Processing

1. Answer the questions below on the Lesson Planning Response Document.

Jahrgangsstufentest ENGLISCH. an bayerischen Realschulen. Termin: Donnerstag, 8. Oktober Bearbeitungszeit: 45 Minuten.

Kelli Allen. Vicki Nieter. Jeanna Scheve. Foreword by Gregory J. Kaiser

Transcription:

Karin Haenelt A Future with Language Technology Human language technology is expected to permeate all areas of our life in the future (HOPE, 2002). Progress in computational linguistics and human language technology will not only lead to an omnipresence of language processing chips in clothes, offices, houses, shops and other locations, it also opens new perspectives in politics, society, economics, or education. In order to sound out estimations and expectations a short online-questioning (Haenelt, 2002) has been run with language processing professionals, students and educated people from other professions. Since the questioning was run for a very short time only and the participants acceded more or less randomly, this study does not claim to be representative. The questioning focused on perspectives that are opened by computational linguistics and human language technology. Four examples from politics, society, education and science shall be provided in this article. 1 Politics: PeaceMaker The most focused question with respect to politics was, whether the participants could imagine a machine which mediates in the conflicts between peoples. No one doubted the technical feasibility of such a machine. This probably results from the fact, that in these days even in the field of human language technology partial and stepwise improved solutions are becoming similarly accepted as in other technological fields. So solutions are not necessarily expected to work fully automatically and without any human participation, but rather systems can be imagined which could structure a discussion, collect the goals of different peoples, support the comparison of these goals, access background information such as laws and other cultural and historical conventions, and could support the finding of solutions. All participants ranked such a machine as a highly desirable goal. But sadly enough, most contributors doubted the political will for using such a machine. 2 Society: Everyday life with language technology Imagine your car can explain the repair it needs. This is the type of question that is typical in these days for characterizing the potential of language processing (cf.

2 Karin Haenelt HOPE, 2002). All participants could imagine this, and all participants found this highly desirable. More and more language technology products appear in the market. Some are accepted readily imperfect as some of them are, since there is no alternative (such as for instance search engines), others compete with alternative possibilities (such as dictation systems vs. keyboards). After a first phase of aiming at fully automatic high quality processing as the (more or less) only imaginable marketable state of the art in the sixties of the last century (ALPAC, 1966) human language technology has gained some experience in which gears and clutch pedals are accepted by the market, and some dialogue with the market and training has developed some experience of customers. Examples of products which can be offered with restricted language processing capabilities are - voice control of everyday machines (operating of telephones, ordering of cinema tickets), - programming of everyday life systems: video recorder, telephone systems, house technology - dictation systems - dialogue systems, (telephone) information systems, call centres - text processing with spelling and grammar correction - translation systems - (internet) search engines Currently, however, many of these systems are more or less systems for experts, because their application requires some expertise for filling the gap between the processing provided and the processing needed. Examples of such expertise are - speaker adaptation, dictionary updates, grammar updates, formal knowledge modelling, or providing examples and training of rules, - knowledge of the technology for estimating the usability and for making optimal use of systems: - knowledge of information retrieval methods in order to formulate an optimal search query, - knowledge of parsing / generation / translation technology in order to determine or even formulate texts which are best suited for machine translation or to understand processing errors and perform pre- or post-editing efficiently) - realistic estimation of error rates and reliability in order to determine possible application areas. All participants of the questioning appreciated the support provided by the products which are available now and which make many tasks easier, but they also wished a better command of language of the systems in order to be able to concentrate on their communicative goals, rather than to have to deal with matters of the system. Clearly,

3 A Future with Language Technology the goal is the development of more advanced and more comfortable and broader applicable systems. The more progress will be made towards successfully managing everyday language, the more human language technology will become an everyday life technology. 3 Education: Integrating linguistic and mathematical gifts All participants agreed that interdisciplinary work will become more and more important in future, and that more training in cross-method thinking will be required in education. Computational linguistics and language technology already comprise approaches from linguistics, from mathematics, from engineering and other disciplines. Based on this interdisciplinary experience this science and technology can offer support to forward interdisciplinary thinking in school education: They can contribute educational topics and methods which provide an excellent opportunity to overcome a split of pupils into those which are linguistically gifted and those which are mathematically gifted in favour of pupils which are well-educated. For a better understanding and integration of linguistics and mathematics it could be useful to cross the boundaries and to teach mathematics with examples from language technology and linguistics with examples from mathematics. Applications like part-ofspeech-tagging with Hidden Markov Models or text retrieval with vector models are well suited for giving linguistically advanced pupils a deeper understanding of probability theory and vector algebra, and mathematically advanced pupils a deeper understanding of parts of speech, sentence structures, and text and content constitution. Integrating such applications into teaching also allows for combining example-based learning and theory-based learning, and thus also broadens the basis for understanding the subject. In addition, the applications are very good examples of the applicability of linguistics and mathematics beyond class tests and exam papers, and for developing methods by crossing the boundaries of individual disciplines. The learning process can further be supported by applying the knowledge to the development of small programs. There are applications for which basic versions can be implemented with fairly simple programs (cf. Haenelt, 2001). In the online discussion not only computational linguists but also chemists, lawyers, and mathematicians unanimously agreed that they would appreciate examples from language technology in mathematics lessons.

4 Karin Haenelt 4 Science: Understanding the understanding the great challenge Currently human language is the most simple and flexible means of knowledge and information handling. It allows for the constitution of information, for its presentation, memorization and for its communication. It can be very economically adapted to specific communication purposes. Linguistic communication succeeds on the basis of a certain breadth and depth of variation and vagueness. Language enables humans to think and thus to access and integrate heterogeneous and incomplete information as well as to develop knowledge and information in the progress of time. The ability to model these properties and functions will be essential for the development of more flexible natural language systems. Although much progress has been made, currently there is no consistent comprehensive technology for the development of corresponding systems, and even research on individual components has not been completed. Therefore research towards this goal is still a complex undertaking. It begins with exploring and modelling language mechanisms which serve to constitute, to communicate and to understand concepts. It includes pragmatic conditions and cognitive processes, and at an even higher level of complexity it also ranges to physical and chemical processes in the brain. With respect to time perspectives participants think that more progress will be made in the area of semantic processing within the near future. With respect to cognitive processes one participant wrote: In fifty years we will be even more astonished how complex, unique and estimable humans are, since we still will not have succeeded in understanding cognitive processes, not to mention to imitate them. Another participant answered with respect to biotechnological perspectives: Currently we know at most one thousandth of how the brain works. If we aim at an electronic system which also communicates with the brain (for medical purposes for instance), we additionally need a computer technology in nm- and pm-dimensions. The person believes that it will take at least another hundred years to investigate the processes in the brain. The person currently regards the development of a speaking computer fitted with some intelligence and emotions which can manoeuvre a spaceship as more feasible. This, however, should have existed in 2001 already according to the film Odyssey 2001. As we have seen in the last years, human language technology has made much progress, and the participants believed that progress has also been made with respect to an estimation of the tasks to be solved, of their complexity and their application breadth, in the research community as well as in industrial development and in the market and with funding authorities. They believe that this will lead to a realistic estimation and better allocation of the resources needed for research and development in this area. Language technology has been estimated a multi-billion dollar industry (Spiegel, 1999). This is the more promising considering that we are only at the beginning of its scientific, technologic and economic development.

5 A Future with Language Technology References ALPAC (1966): Language and Machines. Computers in Translation and Linguistics. (ALPAC report, 1966). National Academy of Sciences. Haenelt, Karin (2002): Mit Computerlinguistik und Sprachtechnologie in die Zukunft. Eine Einladung zum Querdenken und Mitreden. Diskussion. Fragebogen. Literatur. Links. - <http://kontext.fraunhofer.de/haenelt/hlt/cl-futur2002/> Haenelt, Karin (2001): Beispielprogramm zur Berechnung der Ähnlichkeit zwischen Anfrage- und Dokumentvektoren nach der Cosinus-Formel 19.12.2001. - <http://kontext.fraunhofer.de/haenelt/kurs/folien/irvektorberechnung.html> HOPE (2002): Das europäische Projekt zur Förderung sprachtechnologischer Anwendungen: HOPE/EUROMAP 3 <www.bit.ac.at/4_fue_5rp_2tp_euromap_info.htm> Spiegel (1999): Der Spiegel. Nr. 15, 12.04.1999, S. 124-127.