Error Correcting Romaji-kana Conversion for Japanese Language Education

Size: px
Start display at page:

Download "Error Correcting Romaji-kana Conversion for Japanese Language Education"

Transcription

1 SNS 10% Error Correcting Romaji-kana Conversion for Japanese Language Education Seiji Kasahara, 1 Mamoru Komachi, 1 Masaaki Nagata 2 and Yuuji Matsumoto 1 We present an approach to help Japanese editors on language learning SNS correct learners sentences written in roman characters by converting them into kana Our system detects foreign words and converts only Japanese words even if it contains spelling errors. Experimental results show that our system achieves about 10 points higher conversion accuracy than one of traditional input methods. Error analysis reveals tendency of errors made by learners. For example, learners tend to be confused by vowels and make errors caused by nature of their native language. 1 Nara Institute of Science and Technology 2 NTT NTT Communication Science Laboratories SNS c 2011 Information Processing Society of Japan

2 2 3 4 SNS Lang ) n-gram?) 2) 3) Lang-8 SNS Lang , , % 763,971 10,000 1 Lang-8 Lang OK desu hanasemasu hanashimasu mada made 9 ha no 10 amerikajin americagen amerika america jin gen Lang-8 ha wa wo o he e c 2011 Information Processing Society of Japan

3 1 Onaka ga itai desu! Onaka ga itai desu! 2 suki ni narimasu. suki ni narimasu.perfect! 3 Isogashikatta. Isogashikatta. 4 gakko wa omoshiroi desu. gakko wa omoshiroi desu. 5 Tokyo ni irutoki, Meiji-jingu mo ni ikimashita. Tokyo ni irutoki, Meiji-jingu ni mo ikimashita. 6 Noh ni mimashita. Nihonjin no tomodachi ga Noh wo misetekuremashita. 7 Konnichiwa! OK desu 8 nihongo ga sukoshi hanashimasu demo made jouzu ja arimasen. nihongo ga sukoshi hanasemasu demo mada jouzu ja arimasen. 9 Chichi no atama ga ii desu. Chichi ha atama ga ii desu. 10 watashi wa americagen desu. watashi wa amerikajin desu. 1 Lang WordNet IPADic CaboCha , uni-gram IPADic n-gram n 1 packu 163 kau pakku chikau 4) n-gram 5-gram 1991 kakasi 1 kakasi c 2011 Information Processing Society of Japan

4 2 yorushiku onegia shimasu. yoroshiku onegai shimasu. Muscle musical wo mietai. Muscle musical wo mitai. Muscle musical Gorofu ga daisuki desu gorufu ga daisuki desu Lang-8 SRILM Witten-Bell 5.3 ca, ci, cu, ce, co ka, shi, ku, se, ko m n kinyuu n Recall = N t N w, P recision = N t N e Nt Nw Ne 6.2 Anthy Anthy Anthy 6.3 Lang-8 Lang % Anthy 74.5% % % c 2011 Information Processing Society of Japan

5 1 domou doumo 2 Yorushiko onegai shimasu yoroshiku onegai shimasu 3 Merrii kurisamasu, mina-san merii kurisumasu minasan 4 domo arigato guzaimasu doumo arigatou gozaimasu 5 nihongo ga scoshi wakarimasu s nihongo ga sukoshi wakarimasu 6 hajimimashtei sh hajimemashite 7 donna eigaosaiking mimashitaka donna eiga wo saikin mimashitaka 8 Horandajin desu orandajin desu 9 Nihon go wa totemo musugashi desu nihon go wa totemo muzukashii desu 5 1 Soshite, kurama wo durivu wo shimasu, Soshite, kuruma wo doraibu wo shimasu 2 boku wa nagai ichi-nichi no renshou o shimasu boku wa nagai ichi nichi no renshuu o shimasu 3 Terebi gamu wo asobitai desu terebi geemu wo asobitai desu 6 shuutmatsu t shuumatsu do-yoobi doyoubi packu c pakku 4 durivu doraibu 3 prutugarogo p porutogarugo 3 musugashi muzukashii % % 78.1% ?? renhuu renshou renshou n-gram muzukashii musugashi 5 c 2011 Information Processing Society of Japan

6 denwabangou denwa bangou Meiji-jingu meiji jinguu nouryokushiken nouryoku shiken doumo domou 5 6 su shi 7 n ng 8 9 5) nouryokushiken nouryoku shiken IPADic nouryokushiken Lang-8 1) Zheng Chen and Kai-Fu Lee. A New Statistical Approach to Chinese Pinyin Input. In Proceedings of ACL, pp , ) Yo Ehara and Kumiko Tanaka-Ishii. Multilingual Text Entry using Automatic Language Detection. In Proceedings of IJCNLP, pp , ) Tomoya Mizumoto, Mamoru Komachi, Masaaki Nagata, and Yuji Matsumoto. Mining Revision Log of Language Learning SNS for Automated Japanese Error Correction of Second Language Learners. In Proceedings of IJCNLP, ) Naoaki Okazaki and Jun ichi Tsujii. Simple and Efficient Algorithm for Approximate Dictionary Matching. In Proceedings of COLING, pp , ) Kumiko Tanaka-Ishii, Yusuke Inutsuka, and Masato Takeichi. Japanese input system with digits Can Japanese be input only with consonants? In Proceedings of HLT, pp , ) Yabin Zheng, Chen Li, and Maosong Sun. CHIME: An Efficient Error-Tolerant Chinese Pinyin Input Method. In Proceedings of IJCAI, pp , ). N-gram., Vol.40, No.6, pp , SNS 10 6 c 2011 Information Processing Society of Japan

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches Yu-Chun Wang Chun-Kai Wu Richard Tzong-Han Tsai Department of Computer Science

More information

Kwansei Gakuin University Rep

Kwansei Gakuin University Rep Kwansei Gakuin University Rep Title Author(s) Politeness Strategies in Japanese H and Japanese in strategic planning Obana, Yasuko, 尾鼻, 靖子 Citation 言語と文化, 12: 39-53 Issue Date 2009-02-20 URL http://hdl.handle.net/10236/1653

More information

A Named Entity Recognition Method using Rules Acquired from Unlabeled Data

A Named Entity Recognition Method using Rules Acquired from Unlabeled Data A Named Entity Recognition Method using Rules Acquired from Unlabeled Data Tomoya Iwakura Fujitsu Laboratories Ltd. 1-1, Kamikodanaka 4-chome, Nakahara-ku, Kawasaki 211-8588, Japan iwakura.tomoya@jp.fujitsu.com

More information

arxiv: v1 [cs.cl] 2 Apr 2017

arxiv: v1 [cs.cl] 2 Apr 2017 Word-Alignment-Based Segment-Level Machine Translation Evaluation using Word Embeddings Junki Matsuo and Mamoru Komachi Graduate School of System Design, Tokyo Metropolitan University, Japan matsuo-junki@ed.tmu.ac.jp,

More information

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade Trend Survey on Japanese Natural Language Processing Studies over the Last Decade Masaki Murata, Koji Ichii, Qing Ma,, Tamotsu Shirado, Toshiyuki Kanamaru,, and Hitoshi Isahara National Institute of Information

More information

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks POS tagging of Chinese Buddhist texts using Recurrent Neural Networks Longlu Qin Department of East Asian Languages and Cultures longlu@stanford.edu Abstract Chinese POS tagging, as one of the most important

More information

My Japanese Coach: Lesson I, Basic Words

My Japanese Coach: Lesson I, Basic Words My Japanese Coach: Lesson I, Basic Words Lesson One: Basic Words Hi! I m Haruka! It s nice to meet you. I m here to teach you Japanese. So let s get right into it! Here is a list of words in Japanese.

More information

Automatic English-Chinese name transliteration for development of multilingual resources

Automatic English-Chinese name transliteration for development of multilingual resources Automatic English-Chinese name transliteration for development of multilingual resources Stephen Wan and Cornelia Maria Verspoor Microsoft Research Institute Macquarie University Sydney NSW 2109, Australia

More information

Add -reru to the negative base, that is to the "-a" syllable of any Godan Verb. e.g. becomes becomes

Add -reru to the negative base, that is to the -a syllable of any Godan Verb. e.g. becomes becomes The "Passive." Formation i) Ichidan Verbs: Add -rareru to the negative base, e.g. remove from, add inflection to thus, ii. Godan Verbs: Add -reru to the negative base, that is to the "-a" syllable of any

More information

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY Chen, Hsin-Hsi Department of Computer Science and Information Engineering National Taiwan University Taipei, Taiwan E-mail: hh_chen@csie.ntu.edu.tw Abstract

More information

3 Character-based KJ Translation

3 Character-based KJ Translation NICT at WAT 2015 Chenchen Ding, Masao Utiyama, Eiichiro Sumita Multilingual Translation Laboratory National Institute of Information and Communications Technology 3-5 Hikaridai, Seikacho, Sorakugun, Kyoto,

More information

arxiv:cs/ v2 [cs.cl] 7 Jul 1999

arxiv:cs/ v2 [cs.cl] 7 Jul 1999 Cross-Language Information Retrieval for Technical Documents Atsushi Fujii and Tetsuya Ishikawa University of Library and Information Science 1-2 Kasuga Tsukuba 35-855, JAPAN {fujii,ishikawa}@ulis.ac.jp

More information

Implementing the Syntax of Japanese Numeral Classifiers

Implementing the Syntax of Japanese Numeral Classifiers Implementing the Syntax of Japanese Numeral Classifiers Emily M. Bender 1 and Melanie Siegel 2 1 University of Washington, Department of Linguistics, Box 354340, Seattle WA 98195-4340 ebender@u.washington.edu

More information

user s utterance speech recognizer content word N-best candidates CMw (content (semantic attribute) accept confirm reject fill semantic slots

user s utterance speech recognizer content word N-best candidates CMw (content (semantic attribute) accept confirm reject fill semantic slots Flexible Mixed-Initiative Dialogue Management using Concept-Level Condence Measures of Speech Recognizer Output Kazunori Komatani and Tatsuya Kawahara Graduate School of Informatics, Kyoto University Kyoto

More information

Investigation on Mandarin Broadcast News Speech Recognition

Investigation on Mandarin Broadcast News Speech Recognition Investigation on Mandarin Broadcast News Speech Recognition Mei-Yuh Hwang 1, Xin Lei 1, Wen Wang 2, Takahiro Shinozaki 1 1 Univ. of Washington, Dept. of Electrical Engineering, Seattle, WA 98195 USA 2

More information

Abbreviated text input. The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters.

Abbreviated text input. The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Abbreviated text input The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters. Citation Published Version Accessed Citable Link Terms

More information

The Ups and Downs of Preposition Error Detection in ESL Writing

The Ups and Downs of Preposition Error Detection in ESL Writing The Ups and Downs of Preposition Error Detection in ESL Writing Joel R. Tetreault Educational Testing Service 660 Rosedale Road Princeton, NJ, USA JTetreault@ets.org Martin Chodorow Hunter College of CUNY

More information

Words come in categories

Words come in categories Nouns Words come in categories D: A grammatical category is a class of expressions which share a common set of grammatical properties (a.k.a. word class or part of speech). Words come in categories Open

More information

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar Chung-Chi Huang Mei-Hua Chen Shih-Ting Huang Jason S. Chang Institute of Information Systems and Applications, National Tsing Hua University,

More information

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.**

**Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** **Note: this is slightly different from the original (mainly in format). I would be happy to send you a hard copy.** REANALYZING THE JAPANESE CODA NASAL IN OPTIMALITY THEORY 1 KATSURA AOYAMA University

More information

Using a Native Language Reference Grammar as a Language Learning Tool

Using a Native Language Reference Grammar as a Language Learning Tool Using a Native Language Reference Grammar as a Language Learning Tool Stacey I. Oberly University of Arizona & American Indian Language Development Institute Introduction This article is a case study in

More information

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment

Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment Evaluation of a Simultaneous Interpretation System and Analysis of Speech Log for User Experience Assessment Akiko Sakamoto, Kazuhiko Abe, Kazuo Sumita and Satoshi Kamatani Knowledge Media Laboratory,

More information

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology

Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano. Graduate School of Information Science, Nara Institute of Science & Technology ISCA Archive SUBJECTIVE EVALUATION FOR HMM-BASED SPEECH-TO-LIP MOVEMENT SYNTHESIS Eli Yamamoto, Satoshi Nakamura, Kiyohiro Shikano Graduate School of Information Science, Nara Institute of Science & Technology

More information

UNIVERS, Macrocosm and Microcosm in Western Art. The picture is the Battle of

UNIVERS, Macrocosm and Microcosm in Western Art. The picture is the Battle of DESIGN THEORY AND HISTORY OF MODERN JAPAN Haruhiko Fujita 1. Introduction It is my honor to have an opportunity to give a lecture at the Residenza di Studi Superiori for the Institute of Advanced Studies

More information

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017

What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017 What Can Neural Networks Teach us about Language? Graham Neubig a2-dlearn 11/18/2017 Supervised Training of Neural Networks for Language Training Data Training Model this is an example the cat went to

More information

JYPE Spring Semester Course Description

JYPE Spring Semester Course Description JYPE 2016-2017 Spring Semester Course Description Tohoku University Institute for Excellence in Higher Education Contents Page Course Title 1 Japanese 1, 2, 3, 4 10 Japanese Culture B 12 Japanese Culture

More information

30 Sociocultural theory and the zone of proximal development

30 Sociocultural theory and the zone of proximal development 30 Sociocultural theory and the zone of proximal development Amy Snyder Ohta 30.1 Introduction There has been increasing interest in perspectives on second language acquisition that are grounded in discourse,

More information

LINGUIST List

LINGUIST List Page 1 of 6 LINGUIST List 20.1356 Thu Apr 09 2009 Review: Sociolinguistics: Kaske (2007) Editor for this issue: Randall Eggert This LINGUIST List issue is a review of a book published

More information

Disambiguation of Thai Personal Name from Online News Articles

Disambiguation of Thai Personal Name from Online News Articles Disambiguation of Thai Personal Name from Online News Articles Phaisarn Sutheebanjard Graduate School of Information Technology Siam University Bangkok, Thailand mr.phaisarn@gmail.com Abstract Since online

More information

Learning Methods in Multilingual Speech Recognition

Learning Methods in Multilingual Speech Recognition Learning Methods in Multilingual Speech Recognition Hui Lin Department of Electrical Engineering University of Washington Seattle, WA 98125 linhui@u.washington.edu Li Deng, Jasha Droppo, Dong Yu, and Alex

More information

Speech Emotion Recognition Using Support Vector Machine

Speech Emotion Recognition Using Support Vector Machine Speech Emotion Recognition Using Support Vector Machine Yixiong Pan, Peipei Shen and Liping Shen Department of Computer Technology Shanghai JiaoTong University, Shanghai, China panyixiong@sjtu.edu.cn,

More information

Mandarin Lexical Tone Recognition: The Gating Paradigm

Mandarin Lexical Tone Recognition: The Gating Paradigm Kansas Working Papers in Linguistics, Vol. 0 (008), p. 8 Abstract Mandarin Lexical Tone Recognition: The Gating Paradigm Yuwen Lai and Jie Zhang University of Kansas Research on spoken word recognition

More information

Information Session 13 & 19 August 2015

Information Session 13 & 19 August 2015 Information Session 13 & 19 August 2015 Mr Johnie Goh Office of Global Education & Mobility Increase career prospects Immerse in another culture Complement your language studies in NTU Earn AUs during

More information

Combining a Chinese Thesaurus with a Chinese Dictionary

Combining a Chinese Thesaurus with a Chinese Dictionary Combining a Chinese Thesaurus with a Chinese Dictionary Ji Donghong Kent Ridge Digital Labs 21 Heng Mui Keng Terrace Singapore, 119613 dhji @krdl.org.sg Gong Junping Department of Computer Science Ohio

More information

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments Vijayshri Ramkrishna Ingale PG Student, Department of Computer Engineering JSPM s Imperial College of Engineering &

More information

FEIRONG YUAN, PH.D. Updated: April 15, 2016

FEIRONG YUAN, PH.D. Updated: April 15, 2016 FEIRONG YUAN, PH.D. Assistant Professor The University of Texas at Arlington College of Business Department of Management Box 19467 701 S. West Street, Suite 226 Arlington, TX 76019-0467 Phone: 817-272-3863

More information

Information Retrieval

Information Retrieval Information Retrieval Suan Lee - Information Retrieval - 02 The Term Vocabulary & Postings Lists 1 02 The Term Vocabulary & Postings Lists - Information Retrieval - 02 The Term Vocabulary & Postings Lists

More information

Anatomy and Physiology. Astronomy. Boomilever. Bungee Drop

Anatomy and Physiology. Astronomy. Boomilever. Bungee Drop Anatomy and Physiology 2nd 28 MN Mounds View H.S. 3rd 5 NC William G. Enloe H.S. 4th 20 TX Seven Lakes H.S. 5th 29 NJ West Windsor Plainsboro South 6th 6 NC Raleigh Charter H.S. Astronomy 1st 4 CA Mira

More information

Overview of the 3rd Workshop on Asian Translation

Overview of the 3rd Workshop on Asian Translation Overview of the 3rd Workshop on Asian Translation Toshiaki Nakazawa Chenchen Ding and Hideya Mino Japan Science and National Institute of Technology Agency Information and nakazawa@pa.jst.jp Communications

More information

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape

Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Lip reading: Japanese vowel recognition by tracking temporal changes of lip shape Koshi Odagiri 1, and Yoichi Muraoka 1 1 Graduate School of Fundamental/Computer Science and Engineering, Waseda University,

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

DEPARTMENT OF JAPANESE LANGUAGE AND STUDIES

DEPARTMENT OF JAPANESE LANGUAGE AND STUDIES FCC Curriculum 98 DEPARTMENT OF JAPANESE LANGUAGE AND STUDIES The Department of Japanese Language and Studies has two majors: Japanese Linguistics and Teaching Methods Japanese Studies Students entering

More information

Survey on parsing three dependency representations for English

Survey on parsing three dependency representations for English Survey on parsing three dependency representations for English Angelina Ivanova Stephan Oepen Lilja Øvrelid University of Oslo, Department of Informatics { angelii oe liljao }@ifi.uio.no Abstract In this

More information

THE ACQUISITION OF ARGUMENT ELLIPSIS IN JAPANESE: A PRELIMINARY STUDY* Koji Sugisaki Mie University

THE ACQUISITION OF ARGUMENT ELLIPSIS IN JAPANESE: A PRELIMINARY STUDY* Koji Sugisaki Mie University THE ACQUISITION OF ARGUMENT ELLIPSIS IN JAPANESE: A PRELIMINARY STUDY* Koji Sugisaki Mie University 1. Introduction Japanese is a language that allows productive use of null arguments in finite clauses.

More information

A heuristic framework for pivot-based bilingual dictionary induction

A heuristic framework for pivot-based bilingual dictionary induction 2013 International Conference on Culture and Computing A heuristic framework for pivot-based bilingual dictionary induction Mairidan Wushouer, Toru Ishida, Donghui Lin Department of Social Informatics,

More information

Rule Learning with Negation: Issues Regarding Effectiveness

Rule Learning with Negation: Issues Regarding Effectiveness Rule Learning with Negation: Issues Regarding Effectiveness Stephanie Chua, Frans Coenen, and Grant Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX

More information

Problems of the Arabic OCR: New Attitudes

Problems of the Arabic OCR: New Attitudes Problems of the Arabic OCR: New Attitudes Prof. O.Redkin, Dr. O.Bernikova Department of Asian and African Studies, St. Petersburg State University, St Petersburg, Russia Abstract - This paper reviews existing

More information

Basic German: CD/Book Package (LL(R) Complete Basic Courses) By Living Language

Basic German: CD/Book Package (LL(R) Complete Basic Courses) By Living Language Basic German: CD/Book Package (LL(R) Complete Basic Courses) By Living Language If searching for the book by Living Language Basic German: CD/Book Package (LL(R) Complete Basic Courses) in pdf format,

More information

2012 Ph.D. University of Maryland, College Park (UMD). Physics. (December, anticipated)

2012 Ph.D. University of Maryland, College Park (UMD). Physics. (December, anticipated) Michael M. Hull CV pg 1 Hull, Michael M. Current Contact Information University of Maryland, Department of Physics 082 Regents Drive College Park, MD 20742-4111 (301) 938-9233; mhull12@umd.edu; http://umdperg.pbworks.com/mike-hull

More information

Why Is the Chinese Curriculum Difficult for Immigrants Children from Southeast Asia

Why Is the Chinese Curriculum Difficult for Immigrants Children from Southeast Asia Why Is the Chinese Curriculum Difficult for Immigrants Children from Southeast Asia Chiu-Jung Chen 1,* 1 Department of E-Learning, Design and Management, National Chia-yi University, Taiwan *Correspondence:

More information

Al Cornish Head, Library Systems Washington State University Libraries Pullman, WA

Al Cornish Head, Library Systems Washington State University Libraries Pullman, WA Al Cornish Head, Library Systems Washington State University Libraries Pullman, WA 99164-5610 e-mail: cornish@wsu.edu web home: http://www.wsu.edu/~cornish EDUCATION LOUISIANA STATE UNIVERSITY, BATON ROUGE,

More information

Distant Supervised Relation Extraction with Wikipedia and Freebase

Distant Supervised Relation Extraction with Wikipedia and Freebase Distant Supervised Relation Extraction with Wikipedia and Freebase Marcel Ackermann TU Darmstadt ackermann@tk.informatik.tu-darmstadt.de Abstract In this paper we discuss a new approach to extract relational

More information

What the National Curriculum requires in reading at Y5 and Y6

What the National Curriculum requires in reading at Y5 and Y6 What the National Curriculum requires in reading at Y5 and Y6 Word reading apply their growing knowledge of root words, prefixes and suffixes (morphology and etymology), as listed in Appendix 1 of the

More information

Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion

Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion Computational Linguistics and Chinese Language Processing vol. 3, no. 2, August 1998, pp. 79-92 79 Computational Linguistics Society of R.O.C. Noisy Channel Models for Corrupted Chinese Text Restoration

More information

English (from Chinese) (Language Learners) By Daniele Bourdaise

English (from Chinese) (Language Learners) By Daniele Bourdaise English (from Chinese) (Language Learners) By Daniele Bourdaise If you are searched for the book by Daniele Bourdaise English (from Chinese) (Language Learners) in pdf format, then you have come on to

More information

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words, A Language-Independent, Data-Oriented Architecture for Grapheme-to-Phoneme Conversion Walter Daelemans and Antal van den Bosch Proceedings ESCA-IEEE speech synthesis conference, New York, September 1994

More information

TextGraphs: Graph-based algorithms for Natural Language Processing

TextGraphs: Graph-based algorithms for Natural Language Processing HLT-NAACL 06 TextGraphs: Graph-based algorithms for Natural Language Processing Proceedings of the Workshop Production and Manufacturing by Omnipress Inc. 2600 Anderson Street Madison, WI 53704 c 2006

More information

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays

Longest Common Subsequence: A Method for Automatic Evaluation of Handwritten Essays IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661,p-ISSN: 2278-8727, Volume 17, Issue 6, Ver. IV (Nov Dec. 2015), PP 01-07 www.iosrjournals.org Longest Common Subsequence: A Method for

More information

Improving the Quality of MT Output using Novel Name Entity Translation Scheme

Improving the Quality of MT Output using Novel Name Entity Translation Scheme Improving the Quality of MT Output using Novel Name Entity Translation Scheme Deepti Bhalla Department of Computer Science Banasthali University Rajasthan, India deeptibhalla0600@gmail.com Nisheeth Joshi

More information

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks

Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks Devendra Singh Chaplot, Eunhee Rhim, and Jihie Kim Samsung Electronics Co., Ltd. Seoul, South Korea {dev.chaplot,eunhee.rhim,jihie.kim}@samsung.com

More information

Artwork and Drama Activities Using Literature with High School Students

Artwork and Drama Activities Using Literature with High School Students Artwork and Drama Activities Using Literature with High School Students Vicky Ann Richings Kwansei Gakuin University Richings@kwansei.ac.jp Masateru Nishimuro Kwansei Gakuin Senior High School mnishimuro@kwansei.ac.jp

More information

Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation

Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation School of Computer Science Human-Computer Interaction Institute Carnegie Mellon University Year 2007 Predicting Students Performance with SimStudent: Learning Cognitive Skills from Observation Noboru Matsuda

More information

METHODS FOR EXTRACTING AND CLASSIFYING PAIRS OF COGNATES AND FALSE FRIENDS

METHODS FOR EXTRACTING AND CLASSIFYING PAIRS OF COGNATES AND FALSE FRIENDS METHODS FOR EXTRACTING AND CLASSIFYING PAIRS OF COGNATES AND FALSE FRIENDS Ruslan Mitkov (R.Mitkov@wlv.ac.uk) University of Wolverhampton ViktorPekar (v.pekar@wlv.ac.uk) University of Wolverhampton Dimitar

More information

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation AUTHORS AND AFFILIATIONS MSR: Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova,

More information

Multiobjective Optimization for Biomedical Named Entity Recognition and Classification

Multiobjective Optimization for Biomedical Named Entity Recognition and Classification Available online at www.sciencedirect.com Procedia Technology 6 (2012 ) 206 213 2nd International Conference on Communication, Computing & Security (ICCCS-2012) Multiobjective Optimization for Biomedical

More information

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING SISOM & ACOUSTICS 2015, Bucharest 21-22 May THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING MarilenaăLAZ R 1, Diana MILITARU 2 1 Military Equipment and Technologies Research Agency, Bucharest,

More information

Memory-based grammatical error correction

Memory-based grammatical error correction Memory-based grammatical error correction Antal van den Bosch Peter Berck Radboud University Nijmegen Tilburg University P.O. Box 9103 P.O. Box 90153 NL-6500 HD Nijmegen, The Netherlands NL-5000 LE Tilburg,

More information

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ;

EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10. Instructor: Kang G. Shin, 4605 CSE, ; EECS 571 PRINCIPLES OF REAL-TIME COMPUTING Fall 10 Instructor: Kang G. Shin, 4605 CSE, 763-0391; kgshin@umich.edu Number of credit hours: 4 Class meeting time and room: Regular classes: MW 10:30am noon

More information

Grade 3: Module 2B: Unit 3: Lesson 10 Reviewing Conventions and Editing Peers Work

Grade 3: Module 2B: Unit 3: Lesson 10 Reviewing Conventions and Editing Peers Work Grade 3: Module 2B: Unit 3: Lesson 10 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. Exempt third-party content is indicated by the footer: (name

More information

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17. Semi-supervised methods of text processing, and an application to medical concept extraction Yacine Jernite Text-as-Data series September 17. 2015 What do we want from text? 1. Extract information 2. Link

More information

Multilingual Document Clustering: an Heuristic Approach Based on Cognate Named Entities

Multilingual Document Clustering: an Heuristic Approach Based on Cognate Named Entities Multilingual Document Clustering: an Heuristic Approach Based on Cognate Named Entities Soto Montalvo GAVAB Group URJC Raquel Martínez NLP&IR Group UNED Arantza Casillas Dpt. EE UPV-EHU Víctor Fresno GAVAB

More information

FY year and 3-year Cohort Default Rates by State and Level and Control of Institution

FY year and 3-year Cohort Default Rates by State and Level and Control of Institution Student Aid Policy Analysis FY2007 2-year and 3-year Cohort Default Rates by State and Level and Control of Institution Mark Kantrowitz Publisher of FinAid.org and FastWeb.com January 5, 2010 EXECUTIVE

More information

Remarks on Classifiers and Nominal Structure in East Asian

Remarks on Classifiers and Nominal Structure in East Asian Remarks on Classifiers and Nominal Structure in East Asian C.-T. James Huang and Masao Ochi LANGUAGE AND LINGUISTICS MONOGRAPH SERIES 54 Peaches and Plums Edited by C.-T. James Huang and Feng-hsi Liu Institute

More information

The taming of the data:

The taming of the data: The taming of the data: Using text mining in building a corpus for diachronic analysis Stefania Degaetano-Ortlieb, Hannah Kermes, Ashraf Khamis, Jörg Knappen, Noam Ordan and Elke Teich Background Big data

More information

STELLA TING-TOOMEY CURRICULUM VITAE

STELLA TING-TOOMEY CURRICULUM VITAE STELLA TING-TOOMEY CURRICULUM VITAE OFFICE: C.V. DATE: Department of Human Communication Studies California State University at Fullerton sting@fullerton.edu August 20, 2010 USA Tel: (657) 278-3691 EDUCATION

More information

Detecting Student Emotions in Computer-Enabled Classrooms

Detecting Student Emotions in Computer-Enabled Classrooms Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Detecting Student Emotions in Computer-Enabled Classrooms Nigel Bosch, Sidney K. D Mello University

More information

Sari locative noun classes Contents

Sari locative noun classes Contents Bep Langhout 46th Colloquium on African Languages and Linguistics SIL Cameroon 29 August - 31 August 2016 bep_langhout@sil.org Sari locative noun classes Contents (1) Introduction (2) Noun classes and

More information

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011

Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011 Detecting Wikipedia Vandalism using Machine Learning Notebook for PAN at CLEF 2011 Cristian-Alexandru Drăgușanu, Marina Cufliuc, Adrian Iftene UAIC: Faculty of Computer Science, Alexandru Ioan Cuza University,

More information

Humboldt-Universität zu Berlin

Humboldt-Universität zu Berlin Humboldt-Universität zu Berlin Department of Informatics Computer Science Education / Computer Science and Society Seminar Educational Data Mining Organisation Place: RUD 25, 3.101 Date: Wednesdays, 15:15

More information

French Dictionary: 1000 French Words Illustrated By Evelyn Goldsmith

French Dictionary: 1000 French Words Illustrated By Evelyn Goldsmith French Dictionary: 1000 French Words Illustrated By Evelyn Goldsmith If searching for the ebook French Dictionary: 1000 French Words Illustrated by Evelyn Goldsmith in pdf format, then you've come to correct

More information

Student Admissions, Outcomes, and Other Data

Student Admissions, Outcomes, and Other Data Student Admissions, Outcomes, and Other Data Data on Incoming Class UNL Clinical Psychology Training Program (CPTP) August Academic Year of Entry 7 8 9 Number of Applicants 9 7 8 8 8 Number Interviewed

More information

INTRODUCTION. 512 J. Acoust. Soc. Am. 105 (1), January /99/105(1)/512/10/$ Acoustical Society of America 512

INTRODUCTION. 512 J. Acoust. Soc. Am. 105 (1), January /99/105(1)/512/10/$ Acoustical Society of America 512 Language identification with suprasegmental cues: A study based on speech resynthesis Franck Ramus and Jacques Mehler Laboratoire de Sciences Cognitives et Psycholinguistique (EHESS/CNRS), 54 boulevard

More information

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1) Houghton Mifflin Reading Correlation to the Standards for English Language Arts (Grade1) 8.3 JOHNNY APPLESEED Biography TARGET SKILLS: 8.3 Johnny Appleseed Phonemic Awareness Phonics Comprehension Vocabulary

More information

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation SLSP-2016 October 11-12 Natalia Tomashenko 1,2,3 natalia.tomashenko@univ-lemans.fr Yuri Khokhlov 3 khokhlov@speechpro.com Yannick

More information

Rule Learning With Negation: Issues Regarding Effectiveness

Rule Learning With Negation: Issues Regarding Effectiveness Rule Learning With Negation: Issues Regarding Effectiveness S. Chua, F. Coenen, G. Malcolm University of Liverpool Department of Computer Science, Ashton Building, Ashton Street, L69 3BX Liverpool, United

More information

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio

Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio SCSUG Student Symposium 2016 Analyzing sentiments in tweets for Tesla Model 3 using SAS Enterprise Miner and SAS Sentiment Analysis Studio Praneth Guggilla, Tejaswi Jha, Goutam Chakraborty, Oklahoma State

More information

Identification of Opinion Leaders Using Text Mining Technique in Virtual Community

Identification of Opinion Leaders Using Text Mining Technique in Virtual Community Identification of Opinion Leaders Using Text Mining Technique in Virtual Community Chihli Hung Department of Information Management Chung Yuan Christian University Taiwan 32023, R.O.C. chihli@cycu.edu.tw

More information

Introduction, Organization Overview of NLP, Main Issues

Introduction, Organization Overview of NLP, Main Issues HG2051 Language and the Computer Computational Linguistics with Python Introduction, Organization Overview of NLP, Main Issues Francis Bond Division of Linguistics and Multilingual Studies http://www3.ntu.edu.sg/home/fcbond/

More information

Using the Web as a Bilingual Dictionary

Using the Web as a Bilingual Dictionary Using the Web as a Bilingual Dictionary Masaaki NAGATA NTT Cyber Space Laboratories 1-1 Hikarinooka, Yokoshuka-shi Kanagawa, 239-0847 Japan nagata@nttnly.isl.ntt.co.jp Teruka SAITO Chiba University 1-33

More information

Test Blueprint. Grade 3 Reading English Standards of Learning

Test Blueprint. Grade 3 Reading English Standards of Learning Test Blueprint Grade 3 Reading 2010 English Standards of Learning This revised test blueprint will be effective beginning with the spring 2017 test administration. Notice to Reader In accordance with the

More information

Let s think about how to multiply and divide fractions by fractions!

Let s think about how to multiply and divide fractions by fractions! Let s think about how to multiply and divide fractions by fractions! June 25, 2007 (Monday) Takehaya Attached Elementary School, Tokyo Gakugei University Grade 6, Class # 1 (21 boys, 20 girls) Instructor:

More information

Direct and Indirect Passives in East Asian. C.-T. James Huang Harvard University

Direct and Indirect Passives in East Asian. C.-T. James Huang Harvard University Direct and Indirect Passives in East Asian C.-T. James Huang Harvard University 8.20-22.2002 I. Direct and Indirect Passives (1) Direct (as in 2a) Passive Inclusive (as in 2b) Indirect Exclusive (Adversative,

More information

The NICT Translation System for IWSLT 2012

The NICT Translation System for IWSLT 2012 The NICT Translation System for IWSLT 2012 Andrew Finch Ohnmar Htun Eiichiro Sumita Multilingual Translation Group MASTAR Project National Institute of Information and Communications Technology Kyoto,

More information

Busuu The Mobile App. Review by Musa Nushi & Homa Jenabzadeh, Introduction. 30 TESL Reporter 49 (2), pp

Busuu The Mobile App. Review by Musa Nushi & Homa Jenabzadeh, Introduction. 30 TESL Reporter 49 (2), pp 30 TESL Reporter 49 (2), pp. 30 38 Busuu The Mobile App Review by Musa Nushi & Homa Jenabzadeh, Shahid Beheshti University, Tehran, Iran Introduction Technological innovations are changing the second language

More information

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny By the End of Year 8 All Essential words lists 1-7 290 words Commonly Misspelt Words-55 working out more complex, irregular, and/or ambiguous words by using strategies such as inferring the unknown from

More information

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming

Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming Data Mining VI 205 Rule discovery in Web-based educational systems using Grammar-Based Genetic Programming C. Romero, S. Ventura, C. Hervás & P. González Universidad de Córdoba, Campus Universitario de

More information

The role of the first language in foreign language learning. Paul Nation. The role of the first language in foreign language learning

The role of the first language in foreign language learning. Paul Nation. The role of the first language in foreign language learning 1 Article Title The role of the first language in foreign language learning Author Paul Nation Bio: Paul Nation teaches in the School of Linguistics and Applied Language Studies at Victoria University

More information

Residual Stacking of RNNs for Neural Machine Translation

Residual Stacking of RNNs for Neural Machine Translation Residual Stacking of RNNs for Neural Machine Translation Raphael Shu The University of Tokyo shu@nlab.ci.i.u-tokyo.ac.jp Akiva Miura Nara Institute of Science and Technology miura.akiba.lr9@is.naist.jp

More information

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se

More information

The Effects of Social Anxiety on English Language Learning in Japan R. A. Brown

The Effects of Social Anxiety on English Language Learning in Japan R. A. Brown The Effects of Social Anxiety on English Language Learning in Japan R. A. Brown Unlike typical North Americans (Alicke, 1985; Alicke, Klotz, Breitenbecher, Yurak, & Vredenburg, 1995) Japanese tend to be

More information