Fifth Workshop on Asian Language Resources (ALR-05) and First Symposium on

Similar documents
MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

Building International Partnerships: In quest of a more creative exchange of students

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

National Taiwan Normal University - List of Presidents

Syntactic Patterns versus Word Alignment: Extracting Opinion Targets from Online Reviews

OTHER RESEARCH EXPERIENCE & AFFILIATIONS

The Current Situations of International Cooperation and Exchange and Future Expectations of Guangzhou Ploytechnic of Sports

POS tagging of Chinese Buddhist texts using Recurrent Neural Networks

EXECUTIVE SUMMARY. TIMSS 1999 International Mathematics Report

Combining a Chinese Thesaurus with a Chinese Dictionary

International Series in Operations Research & Management Science

September 8, 2017 Asia Pacific Health Promotion Capacity Building Forum

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Eileen Bau CIE/USA-DFW 2014

Philip Hallinger a & Arild Tjeldvoll b a Hong Kong Institute of Education. To link to this article:

Noisy Channel Models for Corrupted Chinese Text Restoration and GB-to-Big5 Conversion

Chen Zhou. June Room 492, Darla Moore School of Business Office: (803) University of South Carolina 1014 Greene Street

ACS HONG KONG INTERNATIONAL CHEMICAL SCIENCES CHAPTER 2014 ANNUAL REPORT

Developing True/False Test Sheet Generating System with Diagnosing Basic Cognitive Ability

Eye Level Education. Program Orientation

Professional Development Guideline for Instruction Professional Practice of English Pre-Service Teachers in Suan Sunandha Rajabhat University

Curriculum Vitae of Chiang-Ju Chien

Shun-ling Chen. Harvard Law School, S.J.D., expected: 2012, with a PhD Secondary Field in Science, Technology and Society, Harvard University

Task-Based Language Teaching: An Insight into Teacher Practice

Parsing of part-of-speech tagged Assamese Texts

Asian Studies. Jukka Lahtinen. at Helsinki Metropolia University of Applied Sciences Program Director: Managing Director, Avaintulos Oy

What Can Near Synonyms Tell Us? 1

Language Independent Passage Retrieval for Question Answering

Pei (Cindy) Zheng. Roy H. Park School of Communication Ithaca College, New York,

EXECUTIVE SUMMARY. TIMSS 1999 International Science Report

President WSC Vice-President WSC President CAC Honorary General Secretary CAC President Sports Club Vice-President Sports Club

Albert (Yan) Wang. Flow-induced Trading Pressure and Corporate Investment (with Xiaoxia Lou), Forthcoming at

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

TIMSS Highlights from the Primary Grades

Mining Topic-level Opinion Influence in Microblog

Materials Under Extreme Conditions: Effects of Temperature, High Strain Rate and Irradiation

Matching Similarity for Keyword-Based Clustering

Colleges And Universities Civil Engineering Practice Teaching Family Planning Materials. Civil Engineering Graduate Design Typical Example: Road And

Word Embedding Based Correlation Model for Question/Answer Matching

Yoshida Honmachi, Sakyo-ku, Kyoto, Japan 1 Although the label set contains verb phrases, they

Empirical research on implementation of full English teaching mode in the professional courses of the engineering doctoral students

ONG KONG OUTLINING YOUR SUCCESS SIDLEY S INTERN AND TRAINEE SOLICITOR PROGRAM

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Trend Survey on Japanese Natural Language Processing Studies over the Last Decade

Bug triage in open source systems: a review

Welcome to. ECML/PKDD 2004 Community meeting

On the Development of Text Input Method - Lessons Learned

2016 Kyoto Global Conference for Rising Public Health Researchers Universal Health Coverage and Health Economics

Evaluating NTU s OpenCourseWare Project with Google Analytics: User Characteristics, Course Preferences, and Usage Patterns

FACULTY OF ARTS. Division of Anthropology. Programme. Admission Requirements. Additional Application Information. Fields of Specialization

Vocabulary Usage and Intelligibility in Learner Language

Introduction, Organization Overview of NLP, Main Issues

GEB 6930 Doing Business in Asia Hough Graduate School Warrington College of Business Administration University of Florida

Teaching Global English with NNS-NNS Online Communication

MEd. Master of Education. General Enquiries

5.7 Country case study: Vietnam

Document WSIS/PC-3/CONTR/187-E 5 November 2003 Original: English and French

Lecture Notes on Mathematical Olympiad Courses

"Women of Influence in Education" A Leadership Gathering in Hong Kong

Multilingual Sentiment and Subjectivity Analysis

Impact of Educational Reforms to International Cooperation CASE: Finland

Corpus on Web: Introducing The First Tagged and Balanced Chinese Corpus + Chu-Ren Huang, *Keh-Jiann Chen and -Shin Lin

The MEANING Multilingual Central Repository

Why Is the Chinese Curriculum Difficult for Immigrants Children from Southeast Asia

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

Automatic English-Chinese name transliteration for development of multilingual resources

Overcoming the Tyranny of Distance in 21 st Century Research AARNet/Pacific Wave. Overcoming the Tyranny of Distance in 21 st Century Research

ABEST21 e-news ABEST21. THE ALLIANCE ON BUSINESS EDUCATION AND SCHOLARSHIP FOR TOMORROW, a 21 st century organization

Information Session 13 & 19 August 2015

Introduction to CS 100 Overview of UK. CS September 2015

HARVARD GLOBAL UPDATE. October 1-2, 2014

The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation

CS 598 Natural Language Processing

Bachelor of Science (Hons) in Banking and Finance Awarded by Bangor University, UK No. Module Lecturer Highest

Knowledge Management & E-Learning

30 Jahre Kooperation zwischen TU Darmstadt & Tongji University Shanghai

SOME MINIMAL NOTES ON MINIMALISM *

Prediction of Maximal Projection for Semantic Role Labeling

Residual Stacking of RNNs for Neural Machine Translation

Double Master Degrees in International Economics and Development

Investigation on Mandarin Broadcast News Speech Recognition

Mandarin Lexical Tone Recognition: The Gating Paradigm

Character Distributions of Classical Chinese Literary Texts: Zipf s Law, Genres, and Epochs

arxiv: v1 [cs.cl] 2 Apr 2017

STUDENT HANDBOOK. Center for International Studies Welcome to the NEW Department of International Studies & Modern Languages

The role of the first language in foreign language learning. Paul Nation. The role of the first language in foreign language learning

A Comparison of Two Text Representations for Sentiment Analysis

Shintaro Yamaguchi. Educational Background. Current Status at McMaster. Professional Organizations. Employment History

Keynote. Developments in English for Specific Purposes Research. Brian Paltridge University of Sydney

Writing quality predicts Chinese learning

Hungary. Iván Rónai Ministry of Cultural Heritage

Linking Task: Identifying authors and book titles in verbose queries

Speech Emotion Recognition Using Support Vector Machine

Ensemble Technique Utilization for Indonesian Dependency Parser

ACS HONG KONG_INTERNATIONAL CHEMICAL SCIENCES CHAPTER 2011 ANNUAL REPORT

Distant Supervised Relation Extraction with Wikipedia and Freebase

Ideas for Intercultural Education

Language and Tourism in Sabah, Malaysia and Edinburgh, Scotland

Segmentation Standard for Chinese Natural Language Processing

Transcription:

IJCNLP-05 Fifth Workshop on Asian Language Resources (ALR-05) and First Symposium on Asian Language Resources Network (ALRN) Proceedings of the Workshop 14 October 2005 Jeju Island, Korea

2005 Asian Federation of Natural Language Processing These workshop and symposium are supported by Special Coordination Funds for Promoting Science and Technology, Ministry of Education, Culture, Sport, Science and Technology, MEXT Japan.

PREFACE It is increasingly convinced that language resources, as well as corpus-based, stochastic, and learning approaches, play a significantly important role in Natural Language Processing (NLP) research. There have been several reports on the success of constructing and using corpora in many dimensions. How to effectively re-organizing the existing resources into a unified framework and establishing the guideline for corpus development has become more and more important, which will be highly helpful for sharing resources and coping with cross-language problems. Motivated by this background, the 5th Workshop on Asian Language Resources (ALR) and 1st Symposium on Asian Language Resources Network (ALRN) are organized under the auspices of the Asian Language Resources Committee of Asia Federation of Natural Language Processing (AFNLP) in conjunction with IJCNLP2005. The purposes of the workshop and symposium are as follows. (1) To investigate the situation of Asian Language Resources, and to make a catalog of the result of this investigation; (2) To investigate and discuss the problems related to the standards and specification on creating various kinds of language resources; (3) To promote communications between developers and users of various language resources in order to fill the gap between language resources and practical applications; (4) To launch a roadmap for Asian Language Resources. ALR-05 accepts 10 regular papers. We are so sure that the selected papers for presentation are informative and can gain much potential for further research. We hope to meet worldwide active researchers working on Asian languages to promote the research on linguistic resources and related fields. We are sure that the workshop and symposium will fruitfully contribute to construct a unifying architecture and mechanism for Asian Language Resources development, management and sharing. Our workshop and symposium would not have been succeeded without the hard work of the program committee. Also, we would like to express our great thanks to the arrangement of the IJCNLP-05 organizing committee and the secretariat. Finally, we wish that all the participants can benefit a lot and enjoy themselves in the workshop and symposium. Bo Xu (chair) Chu-Ren Huang (co-chair) Takenobu Tokunaga (co-chair) Jun Zhao (co-chair) i

PROGRAMME COMMITTEE Bo Xu (chair) Chu-Ren Huang (co-chair) Takenobu Tokunaga (co-chair) Jun Zhao (co-chair) Nicoletta Calzolari Baobao Chang Shuichi Itahashi Donghong Ji Kiyong Lee Qin Lu Nguyen Thi Minh Huyen Hae-Chang Rim Kiyoaki Shirai Nashunwuritu Virach Sornlertlamvanich Maosong Sun Jane Tsay Hsiao-chuan Wang Elizabeth Zeitoun Chinese Academy of Sciences Academia Sinica Tokyo Institute of Technology Chinese Academy of Sciences Istituto di Linguistica Computazionale del CNR Peking University National Institute of Advanced Industrial Science and Technology Institute for Inforcomm Research Singapore Korea University Polytechnique University of Hong Kong Hanoi University of Sciences Korea University Japan Advanced Institute of Science and Technology Inner Mongolia University Thai Computational Linguistics Laboratory, NICT Tsinghua University Chung-Cheng University Tsing Hua Unversity Academia Sinica ii

PROGRAMME Friday, October 14, 2005 Time 8:30 Registration 9:00 Opening 9:10 Keynote Speech Nicoletta Calzolari 9:50 Break 10:00 Domain Knowledge Engineering Based on Sui, Z., Cui, G., Ding, W., 10:20 Encyclopedias and the Web Text Evaluation of a Japanese CFG Derived from a Syntactically Annotated Corpus with Respect to Dependency Measures Zhang, Q. Noro, T., Koike, C., Hashimoto, T., Tokunaga, T., Tanaka, H. 10:40 Corpus-oriented Acquisition of Chinese Zhang, Y., Kashioka, H. 11:00 Break 11:20 The Standard of Chinese Corpus Metadata He, T., Xu, X. 11:40 An Integrated Framework for Archiving, Processing and Developing Learning Materials for an Endangered Aboriginal Language in Tai- Yang, M., Rau, D. V. 12:00 Construction of Structurally Annotated Spoken Kato, S., Matsubara, S., Dialogue Corpus Yamaguchi, Y., 12:20 Lunch 13:40 Cross-lingual Conversion of Lexical Semantic Huang, C., Su, I., Hong, J., Relations: Building Parallel Wordnets Li, X. 14:00 Taiwan Child Language Corpus: Data Collection Tsay, J. S. and Annotation 14:20 Question Classification using Multiple Li, X., Huang, X., Wu, L. 14:40 Harvesting the Bitexts of the Laws of Hong Kit, C., Liu, X., Sin, K., Kong From the Web Webster, J.J. 15:00 Break Symposium : Asian language resources: 15:10 Infrastructure towards a multilingual language processing environment in Asia 16:50 Closing Event Authors iii

Table of Contents Preface : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : i Programme Committee : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : ii Programme : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : iii Domain Knowledge Engineering Based on Encyclopedias and the Web Text Zhifang Sui, Gaoying Cui, Wansong Ding and Qinlong Zhang : : : : : : : : : : : : : : : : : : : : : : : : : 1 Evaluation of a Japanese CFG Derived from a Syntactically Annotated Corpus with Respect to Dependency Measures Tomaya Noro, Chimato Koike, Taiichi Hashimoto, Takenobu Tokunaga and Hozumi Tanaka : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 9 Corpus-oriented Acquisition of Chinese Grammar Yan Zhang and Hideki Kashioka : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 17 The Standard of Chinese Corpus Metadata Tingting He and Xiaoqi Xu : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : :24 An Integrated Framework for Archiving, Processing and Developing Learning Materials for an Endangered Aboriginal Language in Tai-wan Meng-Chien Yang and D. Victoria Rau : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 32 Construction of Structurally Annotated Spoken Dialogue Corpus Shingo Kato, Shigeki Matsubara, Yukiko Yamaguchi and Nobuo Kawaguchi : : : : : : : : : : 40 Cross-lingual Conversion of Lexical Semantic Relations: Building Parallel Wordnets Chu-Ren Huang, I-Li Su, Jia-Fei Hong and Xiang-Bing Li : : : : : : : : : : : : : : : : : : : : : : : : : : : : 48 Taiwan Child Language Corpus: Data Collection and Annotation Jane S. Tsay : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 56 Question Classication using Multiple Classiers Xin Li, Xuan-jing Huang and Li-de Wu : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 64 Harvesting the Bitexts of the Laws of Hong Kong From the Web Chunyu Kit, Xiaoyue Liu, KingKui Sin and Jonathan J. Webster : : : : : : : : : : : : : : : : : : : : :71 Author Index : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 79 iv