Converting a Bilingual Dictionary into a Bilingual Knowledge Bank based on the Synchronous SSTC

Similar documents
Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

11/29/2010. Statistical Parsing. Statistical Parsing. Simple PCFG for ATIS English. Syntactic Disambiguation

Developing a TT-MCTAG for German with an RCG-based Parser

Linking Task: Identifying authors and book titles in verbose queries

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Chinese Language Parsing with Maximum-Entropy-Inspired Parser

Chunk Parsing for Base Noun Phrases using Regular Expressions. Let s first let the variable s0 be the sentence tree of the first sentence.

An Interactive Intelligent Language Tutor Over The Internet

Accurate Unlexicalized Parsing for Modern Hebrew

Cross Language Information Retrieval

A Case Study: News Classification Based on Term Frequency

The Interface between Phrasal and Functional Constraints

Basic Parsing with Context-Free Grammars. Some slides adapted from Julia Hirschberg and Dan Jurafsky 1

Context Free Grammars. Many slides from Michael Collins

Prediction of Maximal Projection for Semantic Role Labeling

Annotation Projection for Discourse Connectives

Exploiting Phrasal Lexica and Additional Morpho-syntactic Language Resources for Statistical Machine Translation with Scarce Training Data

Abstractions and the Brain

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

Semi-supervised methods of text processing, and an application to medical concept extraction. Yacine Jernite Text-as-Data series September 17.

Disambiguation of Thai Personal Name from Online News Articles

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

A heuristic framework for pivot-based bilingual dictionary induction

CS 598 Natural Language Processing

Parsing of part-of-speech tagged Assamese Texts

Ensemble Technique Utilization for Indonesian Dependency Parser

EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar

Towards a MWE-driven A* parsing with LTAGs [WG2,WG3]

Translating Collocations for Use in Bilingual Lexicons

UNIVERSITY OF OSLO Department of Informatics. Dialog Act Recognition using Dependency Features. Master s thesis. Sindre Wetjen

AQUA: An Ontology-Driven Question Answering System

DEVELOPMENT OF A MULTILINGUAL PARALLEL CORPUS AND A PART-OF-SPEECH TAGGER FOR AFRIKAANS

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

On-Line Data Analytics

Grammars & Parsing, Part 1:

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

University of Alberta. Large-Scale Semi-Supervised Learning for Natural Language Processing. Shane Bergsma

Constructing Parallel Corpus from Movie Subtitles

An Efficient Implementation of a New POP Model

LTAG-spinal and the Treebank

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

Some Principles of Automated Natural Language Information Extraction

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

BYLINE [Heng Ji, Computer Science Department, New York University,

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

Towards a Machine-Learning Architecture for Lexical Functional Grammar Parsing. Grzegorz Chrupa la

Proof Theory for Syntacticians

Twitter Sentiment Classification on Sanders Data using Hybrid Approach

Radius STEM Readiness TM

Learning Methods in Multilingual Speech Recognition

The stages of event extraction

Natural Language Processing. George Konidaris

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

Applications of memory-based natural language processing

SEMAFOR: Frame Argument Resolution with Log-Linear Models

arxiv: v1 [cs.cl] 2 Apr 2017

Cross-Lingual Dependency Parsing with Universal Dependencies and Predicted PoS Labels

Multimedia Application Effective Support of Education

MULTILINGUAL INFORMATION ACCESS IN DIGITAL LIBRARY

Learning Structural Correspondences Across Different Linguistic Domains with Synchronous Neural Language Models

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

METHODS FOR EXTRACTING AND CLASSIFYING PAIRS OF COGNATES AND FALSE FRIENDS

THE VERB ARGUMENT BROWSER

Learning Computational Grammars

NCU IISR English-Korean and English-Chinese Named Entity Transliteration Using Different Grapheme Segmentation Approaches

THE ROLE OF DECISION TREES IN NATURAL LANGUAGE PROCESSING

A relational approach to translation

Modeling Attachment Decisions with a Probabilistic Parser: The Case of Head Final Structures

A Framework for Customizable Generation of Hypertext Presentations

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Machine Learning from Garden Path Sentences: The Application of Computational Linguistics

Hyperedge Replacement and Nonprojective Dependency Structures

Word Sense Disambiguation

Ontologies vs. classification systems

A Neural Network GUI Tested on Text-To-Phoneme Mapping

1. Introduction. 2. The OMBI database editor

Measuring the relative compositionality of verb-noun (V-N) collocations by integrating features

OVERVIEW & CLASSIFICATION OF WEB-BASED EDUCATION (SYSTEMS, TOOLS & PRACTICES)

PH.D. IN COMPUTER SCIENCE PROGRAM (POST M.S.)

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011

Citation for published version (APA): Veenstra, M. J. A. (1998). Formalizing the minimalist program Groningen: s.n.

Detecting English-French Cognates Using Orthographic Edit Distance

The Smart/Empire TIPSTER IR System

Product Feature-based Ratings foropinionsummarization of E-Commerce Feedback Comments

A Coding System for Dynamic Topic Analysis: A Computer-Mediated Discourse Analysis Technique

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Noisy SMS Machine Translation in Low-Density Languages

Role of Pausing in Text-to-Speech Synthesis for Simultaneous Interpretation

Visual CP Representation of Knowledge

Three New Probabilistic Models. Jason M. Eisner. CIS Department, University of Pennsylvania. 200 S. 33rd St., Philadelphia, PA , USA

GACE Computer Science Assessment Test at a Glance

The Discourse Anaphoric Properties of Connectives

Deploying Agile Practices in Organizations: A Case Study

A General Class of Noncontext Free Grammars Generating Context Free Languages

Achim Stein: Diachronic Corpora Aston Corpus Summer School 2011

Analysis of Probabilistic Parsing in NLP

Efficient Normal-Form Parsing for Combinatory Categorial Grammar

Multilingual Sentiment and Subjectivity Analysis

Transcription:

Converting a Bilingual Dictionary into a Bilingual Knowledge Bank based on the ynchronous TC Tang Enya Kong, Mosleh H. Al-Adhaileh Computer Aided Translation Unit chool of Computer ciences Universiti ains sia 11800 PENANG, MALAYIA {enyakong, mosleh}@cs.usm.my Abstract In this paper, we would like to present an approach to construct a huge Bilingual Knowledge Bank (BKB) from an bilingual dictionary based on the idea of synchronous tructured tring- Correspondence (TC). The TC is a general structure that can associate an arbitrary tree structure to string in a language as desired by the annotator to be the interpretation structure of the string, and more importantly is the facility to specify the correspondence between the string and the associated tree which can be nonprojective. With this structure, we are able to match linguistic units at different inter levels of the structure (i.e. define the correspondence between substrings in the sentence, nodes in the tree, subtrees in the tree and sub-correspondences in the TC). This flexibility makes synchronous TC very well suited for the construction of a Bilingual Knowledge Bank we need for the - MT application. Keywords tructured tring- Correspondence (TC), ynchronous TC, Bilingual Knowledge Bank (BKB), EBMT. Introduction Recently, much effort was devoted to the compilation of the bilingual corpora for the purpose of machine translation. There is a strong argument that a bilingual corpus, when appropriately structured, can largely replace conventional dictionaries and grammar rules in machine translation. With this objective in mind, we propose, in this paper, an approach to construct a Bilingual Knowledge Bank (BKB) from a bilingual corpora consisting of translation pairs extracted from a given bilingual dictionary. In our approach, we introduce a flexible annotation schema called synchronous tructured tring- Correspondence (TC), which will be used as the basic structure to annotate translation pairs in the bilingual knowledge bank. The TC is a general structure that can associate an arbitrary tree structure to string in a language as desired by the annotator to be the interpretation structure of the string, and more importantly is the facility to specify the correspondence between the string and the associated tree which can be non-projective (Boitet & Zaharin, 1988). The flexibility in the mapping from source to target languages, using synchronous TC, makes possible to state direct correspondences without a mediating interlingual representation. By doing this, we are able to match linguistic units at different inter levels of the structure (i.e. define the correspondence between substrings in the sentence, nodes in the tree, subtrees in the tree and sub-correspondences in the TC). This flexibility makes synchronous TC very well suited for the construction of a Bilingual Knowledge Bank we need for the - MT application. In this paper, we will propose an approach to construct a huge BKB by incorporating some of the existing tools in the annotation process. First, word alignment tools that have been proven their efficiency on other pairs of languages such as Melamed (1997; 1999; 2000) will be adapted to perform - word alignment. Each sentence in the aligned translation pairs will then be annotated with part of speech (PO) and phrase structure tree produced by the Apple Pie Parser (APP) for. The annotated sentences will then be compiled into an TC structure. Next, the TC structure of each sentence will be generated based on the corresponding TC structure and the alignment mapping. Finally, the resultant pair of and TCs will be edited semi-automatically to obtain a synchronous TC, which is the basic element of BKB. Bitext Mapping and Word Alignment In our proposed approach, - translation pairs, which are extracted from a bilingual dictionary, are the main source of data. The first step in establishing useful information from these translation pairs is to find corresponding words and terms in them (i.e. bitext mapping and word alignment). To achieve this, bitext alignment tools that have been proven their efficiency on other pairs of languages will be adapted to perform - language pair. Here, IMR (mooth Injective Map Recognizer), a generic pattern recognition algorithm is used to identify word alignment between a translation pair. IMR exploits the correlation between the lengths of mutual translations. Like the char-align (Church, 1993), IMR infers bitext maps from likely points of correspondence between the two texts, points that are plotted in a two-dimensional space of possibilities. Unlike other methods, IMR greedily searches for only a small chain of correspondence points at a time. For more details on IMR algorithm, see (Melamed, 1997; 1999). Melamed (2000) presented some models of translation equivalence among words, which can automatically produce dictionary-sized translation lexicons with over 99% accuracy. These models can be used to perform word alignment on our translation pairs. Figure 1 gives an example to illustrate the output from the word alignment process.

Translation Pair IMR The basic idea of example - based parsing is very simple. Idea asas bagi penghuraian berasaskan contoh adalah mudah. Word alignment Translexicon The basic idea of example - based parsing is very simple Idea asas bagi penghuraian berasaskan contoh adalah mudah Figure 1: Example outputs of the Alignment processes. The Construction of BKB based on ynchronous TC In Example-Based Machine Translation system (ato, 1991), the use of Bilingual Knowledge Bank (BKB) containing the bilingual parallel texts encoded with correspondences between the source and the target sentences is quite popular in implementing such EBMT systems. entences in the BKB are normally annotated with their constituency or dependency structures (adler & Vendelmans, 1990); which in turn allow the correspondences to be established at the structural level. Here, to facilitate such structural annotation, we use the tructured tring- Correspondence (TC) to annotate the examples in our BKB. Furthermore, the TC structure can easily be extended to keep multiple levels of linguistic information, if they are considered important to enhance the performance of the machine translation system. For instance, in our case here, each node representing a word in the annotated tree structure is tagged with part of speech (PO). In this section, we shall first introduce the concept of TC. It followed by the description of a bitext synchronous parsing technique used to generate both the and TCs for a given aligned translation pair. Finally, we show how the resultant pair of and TCs can be edited semi-automatically to obtain a synchronous TC which is the basic element of BKB. tructured tring- Correspondence (TC) The TC is a general structure that can associate an arbitrary tree structure to string in a language as desired by the annotator to be the interpretation structure of the string, and more importantly is the facility to specify the correspondence between the string and the associated tree which can be non-projective (Boitet & Zaharin, 1988). These features are very much desired in the design of an annotation scheme, in particular for the treatment of linguistic phenomena, which are non-standard, e.g. crossed dependencies (Tang & Zaharin, 1995). ( 1-2+4-5 /0-5) (3-4/2-4) tring John picks the ball up 0-1 1-2 1-2 2-3 3-4 4-5 (1-2+4-5/ 0-5 ) (3-4/2-4) tring John John picks picks the the ball ball up up 0-1 1-2 2-3 3-4 4-5 (1-2+4-5/0-5) ( 3-4 /2-4) tring John picks the ball up 0-1 1-2 2-3 3-4 4-5 1-2+4-5 picks (1-2+4-5/0-5) 0-5 up 4-5 0-1 1-2 2-3 3-4 4-5 3-4 ball (3-4/ 2-4 ) 2-4 tring the ball John picks the ball up 0-1 1-2 2-3 3-4 3-4 4-5 Figure 2: An TC recording the sentence John picks the ball up and its dependency tree together with the correspondences between substrings of the sentence and subtrees of the tree.

In the TC, the correspondence between the sentence on one hand, and its representation tree on the other hand, is defined in terms of finer sub-correspondences between substrings of the sentence and subtrees of the tree. uch correspondence is made of two interrelated correspondences, one between nodes and substrings, and the other between subtrees and substrings, (the substrings being possibly discontinuous in both cases). It can be treated as an extended chart structure (Kay, 1973; 1980), which is capable of handling non-projective correspondences between the string and its representation tree. The notation used in TC to denote a correspondence consists of a pair of intervals X/Y attached to each node in the tree, where X(NODE) denotes the interval containing the substring that corresponds to the node, and Y(TREE) denotes the interval containing the substring that corresponds to the subtree having the node as root (Boitet & Zaharin, 1988). Figure 2 illustrates the sentence John picks the ball up with its corresponding TC. It contains a nonprojective correspondence. An interval is assigned to each word in the sentence, i.e. (0-1) for John, (1-2) for picks, (2-3) for the", (3-4) for ball and (4-5) for up. A substring in the sentence that corresponds to a node in the representation tree is denoted by assigning the interval of the substring to NODE of the node, e.g. the node picks up with NODE intervals (1-2+4-5) corresponds to the words picks and "up" in the string with the similar intervals, the node ball with NODE interval (3-4) corresponds to the word ball in the string with the similar interval. The correspondence between subtrees and substrings are denoted by the interval assigned to the TREE of each node, e.g. the subtree rooted at node picks up with TREE interval (0-5) corresponds to the whole sentence John picks the ball up, the subtree rooted at node ball with TREE interval (2-4) corresponds to the phrase the ball in the string. ynchronous Parsing Technique Here we describe how to construct the TC for the sentence by mean of a synchronous parsing technique. The basic idea is to automatically generate the TC for the sentence through the use of existing parser. As no parser is currently available for, we propose a synchronous parsing technique to parse the sentence based on the sentence parse tree together with the alignment result obtained from the alignment algorithms as described earlier. The merit of this proposed technique is to use the output of the parser in one language (e.g. ), which can achieve a good result to parse another language (e.g. ). The following steps describe the synchronous parsing process: The basic idea of example - based parsing is very simple Idea asas bagi penghuraian berasaskan contoh adalah mudah (The alignment between a pair of and sentences obtained from the alignment step) - sentence parsing: After the text is being aligned at different levels (i.e. phrase, word), each sentence is passed to a parser. Any available parser may be used to parse the sentence. In our case, we choose the Apple Pie Parser (APP) (ekine, 1996) according to the availability. The parsing result of APP is a partial phrase structure tree with simple noun phrases being treated as a single node in the parse tree. The parse tree of the example sentence is as given below. ( ( (L The basic idea) (PP of (L example -based parsing))) ( is (ADJP very simple))) - sentence TC construction: In order to obtain the sentence TC structure, we need to compute the string-tree correspondences (Tang, 1994) between the sentence and the parse tree as represented by the TC structure illustrated in Figure 3 below. (Ø/0-3) The basic idea (0-3/0-3) tring (Ø/0-8) of (3-4/3-4) (Ø/0-11) (Ø/3-8) (Ø/4-8) is Example-based parsing (4-8/4-8) (Ø/8-11) (Ø/9-11) Very simple (9-11/9-11) 0the 1basic 2idea 3of 4example 5-6based 7parsing 8is 9 very 10simple 11 Figure 3: An TC for the sentence the basic idea of example-based parsing is very simple. - Lexical transfer: In this process, a duplicate copy of the TC created above is generated to be the basic structure for TC. First, the sentence is replaced by the sentence. It followed by the replacement of all word in the TC structure by its corresponding word obtained from the alignment step. In the case of a node containing more than one word, the words will be rearranged according to their order in the sentence. Note that the node represented by an word which has no equivalent will be deleted. imilarly, word in the node representing a phrase which has no equivalent will also be deleted. Figure 4 illustrates the TC structure for the sentence after lexical transfer.

(Ø/0-2) Idea asas bagi (0-2/0-2) tring (Ø/0-7) (Ø/0-9) (Ø/2-7) (Ø/3-7) adalah (7-8/7-8) Penghuraian berasaskan - contoh (3-7/3-7) (Ø/7-9) (Ø/8-9) mudah 0Idea 1 asas 2 bagi 3 penghuraian 4 berasaskan 5 6 contoh 7 adalah 8 mudah 9 Figure 4: An TC construction for the sentence idea asas bagi penghuraian berasaskan-contoh adalah mudah after lexical transfer. ynchronization of TC In this process, the resultant pair of and TCs will be edited semi-automatically to obtain a synchronous TC which is the basic element of BKB. Based on the notations used in the TC, the translation units between the and the TCs can be constructed in terms of TREE pairs (for phrases) and NODE pairs (for words) (Tang, 1996). For instance, as illustrated by the synchronous TC given in Figure 5, the fact that "very simple" is translated to "mudah" is expressed by (9-11,8-9) under the index NODE of the translation units. Whereas, the fact that "is very simple" is translated to "adalah mudah" is expressed by (8-11,7-9) under the index TREE of the translation units. Note that this approach is quite similar to the synchronous -Adjoining Grammar presented in (hieber & chabes, 1990). The main difference between our approach and the synchronous TAG is the flexibility provided by the TC in the treatment of some linguistic phenomena, which are non-standard (Tang & Zaharin, 1995). This flexibility provided by the TC is very much desired in establishing translation units between source and target substrings, which is possibly discontinuous in both cases. In case the representation of synchronous TCs generated need further editing, a synchronous TC editor as illustrated in Figure 6 can be used to perform the necessary amendment. Figure 7 gives an overall picture of the processes involved in the construction of a BKB from a given bilingual dictionary. E ENGLIH (Ø/0-11) M MALAY (Ø/0-9) (Ø/0-8) (Ø/8-11) (Ø/0-7) (Ø/7-9) (Ø/0-3) (Ø/3-8) is (Ø/9-11) (Ø/0-2) (Ø/2-7) adalah (7-8/7-8) (Ø/8-9) The basic idea (0-3/0-3) of (3-4/3-4) (Ø/4-8) Example-based parsing (4-8/4-8) Very simple (9-11/9-11) 0the 1basic 2idea 3of 4example 5-6based 7parsing 8is 9 very 10simple 11 Idea asas bagi (0-2/0-2) (Ø/3-7) Penghuraian berasaskan - contoh (3-7/3-7) mudah 0Idea 1 asas 2 bagi 3 penghuraian 4 berasaskan 5 6 contoh 7 adalah 8 mudah 9 Translation Units Index node Index tree {(0-3),(0-2)} {(3-4),(2-3)} {(4-8),(3-7)} {(8-9),(7-8)} {(9-11),(8-9)} {(0-3),(0-2)} {(3-4),(2-3)} {(4-8),(3-7)} {(3-8),(2-7)} {(0-8),(0-7)} {(8-9),(7-8)} {(9-11),(8-9)} {(8-11),(7-9)} {(0-11),(0-9)} Figure 5: Example synchronous TC for the sentence the basic idea of examplebased parsing is very simple and the sentence idea asas bagi penghuraian berasaskan-contoh adalah mudah together with their translation units.

File Edit Correspondences Windows (Ø/0-11) (Ø/0-9) (Ø/0-8) (Ø/8-11) (Ø/0-7) (Ø/7-9) (Ø/0-3) (Ø/3-8) is (Ø/9-11) (Ø/0-2) (Ø/2-7) adalah (7-8/7-8) (Ø/8-9) The basic idea (0-3/0-3) of (3-4/3-4) (Ø/4-8) Example-based parsing (4-8/4-8) Very simple (9-11/9-11) Idea asas (0-2/0-2) bagi (Ø/3-7) penghuraian berasaskan-contoh (3-7/3-7) mudah 0the 1 basic 2 idea 3 of 4 example 5 6 based 7 parsing 8 is 9 very 10 simple 11 0Idea 1 asas 2 bagi 3 penghuraian 4 berasaskan 5 6 contoh 7 adalah 8 mudah 9 Figure 6: The synchronous TC editor. Bilingual dictionary Lexicon Phrase level Parsing & PO Tagging for the sentence Translation examples Alignment Process word level Apple Pie Parser ( (. (..(..))) Example-Based MT TC Editor ( ( (..(..))) Compile the APP output into TC for the sentence ynchronous TC BKB EDITING Example-Based Parser LEARNING from past Experience Build the TC for sentence based on the TC for the sentence using the alignment mapping Figure 7: The construction of the BKB from a bilingual dictionary based on the synchronous TC.

Conclusion In this paper, we described an approach to construct a Bilingual Knowledge bank (BKB) from a given bilingual dictionary. We introduced a flexible annotation schema called synchronous tructured tring- Correspondence (TC), which has been used to annotate translation examples in the BKB. The flexibility in the mapping from to sentences, using synchronous TC, makes possible to state direct correspondences without a mediating interlingual representation. By doing this, we are able to match linguistic units at different inter levels of the structure (i.e. define the correspondence between substrings in the sentence, nodes in the tree, subtrees in the tree and subcorrespondences in the TC). We also have proposed a synchronous parsing technique to parse the sentence based on the sentence parse tree together with the alignment result obtained from the alignment algorithms. A graphic editor for the synchronous TC (complete with syntax verification) has been implemented. o far the BKB constructed from the bilingual dictionary (i.e. Kamus Inggeris Melayu Dewan (KIMD)) contains 30,000 translation pairs. Finally the constructed BKB (see Figure 7) can be used as an example-base for the EBMT (Al-Adhaileh & Tang, 1999). From the BKB, we can also derive an example-base parser for which is very much needed for language processing (Al-Adhaileh & Tang, 1998). adler, V. and Vendelmans, R. (1990). Pilot implementation of a bilingual knowledge bank. In Proceedings of COLING-90, 3, Helsinki, Fenland. ato,. (1991). Example-Based Machine Translation. Ph.D. thesis, Kyoto University, Japan. ekine,. (1996). Apple Pie Parser. http:// cs.nyu.edu/ cs/ projects/ proteus/ app/. hieber,.m. and chabes, Y. (1990). ynchronous - Adjoining Grammars. In Proceedings of COLING-90, 3, Helsinki, Fenland. Tang E.K. (1994), Natural Language Analysis In Machine Translation (MT) Based On The tring- Correspondence Grammar (TCG), Dissertation submitted in fulfillment of the Ph.D., Universiti ains sia, Penang, sia. Tang,E.K.(1996).Interactive Disambiguation in Multilevel Parallel Texts Alignment towards the construction of a Bilingual Knowledge Bank. In Proceedings of MIDDIM-96, Post-COLING seminar on Interactive Disambiguation, Ch. Boitet (ed), pp. 101-106. Tang, E.K. and Zaharin, Y. (1995). Handling Crossed Dependencies with the TCG. In Proceedings of NLPR 95, eoul, Korea. References Al-Adhaileh, M.H. and Tang, E.K. (1998). A Flexible Example-Based Parser Based on the TC. In Proceedings. of COLING-ACL'98, Vol. I, Montreal, Canada. Al-Adhaileh, M.H. and Tang, E.K. (1999). Example- Based Machine Translation Based on the ynchronous TC Annotation chema. In Proceedings of MT-VII (Machine Translation UMMIT VII). ingapore. Boitet, C. and Zaharin, Y. (1988). Representation trees and string-tree correspondences. In Proceedings of COLING-88, Budapest. Hungary. Church, K. (1993). Char_align: a program for aligning parallel texts at the character level. In Proceedings of ACL93, Ohio. Kay, M. (1973). The MIND system. In R. Rustin (Eds), Natural Language Processing. New York: Algorithmics Press. Kay, M. (1980). Algorithm schemata and data structures in syntactic processing. CL-80-12, Xerox Corporation. Reprinted in RNLP. Melamed, I.D. (1997). A portable algorithm for mapping bitext correspondence. In Proceedings of ACL35/EACL8. Melamed I.D. (1999). Bitext Maps and Alignment via Pattern Recognition, Computational Linguistics 25(1), 107-130, March. Melamed, I.D. (2000). Models of Translational Equivalence among Words, Computational Linguistics 26(2), 221-249, June.