A First-Pass Approach for Evaluating Machine Translation Systems

Size: px
Start display at page:

Download "A First-Pass Approach for Evaluating Machine Translation Systems"

Transcription

1 [Proceedings of the Evaluators Forum, April 21st 24th, 1991, Les Rasses, Vaud, Switzerland; ed. Kirsten Falkedal (Geneva: ISSCO).] A First-Pass Approach for Evaluating Machine Translation Systems Pamela W. Jordan Mitre Corporation C 3 I Artificial Intelligence Center 1 Introduction The MITRE Corp. 1 is in the midst of surveying and evaluating machine translation systems across the U.S. and, to a lesser extent, in Europe and Japan. The intent of the study is to recommend software purchases and R&D support that would meet the near-term, mid-term and long-term requirements of the users. Initially we identified over 20 machine translation efforts in the U.S. alone that we should investigate. Since it is too costly to do in-depth evaluations of so many MT efforts, we decided to gather just enough information to narrow down the possibilities. Once the best-fits according to our user s requirements are identified, deeper evaluations can be done on this smaller set. Planning and conducting the in-depth evaluations will take place at a later time. What is described here is our filter approach for narrowing down the possibilities and our assessment of its success to date. The evaluation plan we formulated was to first predict what the most stringent requirements would be since, in our case, the requirements analysis time-frame overlapped with that of the survey. These requirements would determine what information was needed for the evaluation. Once the requirements analysis was complete, then the MT systems and research projects would be evaluated according to the projected near-term, mid-term and longterm needs of the users. The evaluation data collected would be prioritized and weighted based on these requirements so that all the relevant data could be combined to determine how well each system met the user's needs. The requirements upon which we based our evaluation criteria were whether the MT systems and research projects provided the necessary functionality, whether the vendor s or research group s parent organization was stable enough financially so that we could reasonably expect them to continue their 1 MITRE is an independent, not-for-profit organization that provides technical assistance, systems engineering, and acquisition support to U.S. government agencies 1

2 work and support the user, whether the system would be a good fit for the user s current and future concept of operations, whether it could be upgraded and maintained at reasonable costs, and whether it performed well enough to increase translation throughput. Figure 1 shows the mapping of these five broad requirements categories to the evaluation criteria we selected. Figure 1: Mapping User Requirements to Evaluation Criteria 2

3 In describing the evaluation plan, I first discuss some of the current techniques for evaluating general software systems and natural language analysis systems, both of which are applicable to the MT evaluation problem, and I discuss current techniques and ideas for evaluating MT systems. Included in this discussion of evaluation techniques are the shortcomings of these approaches and an explanation of why it is so difficult to evaluate MT software. Next I describe our approach to the evaluation. This includes a discussion of the information to be collected for each of the evaluation criteria, as well as how this information is expected to contribute to the evaluation of the software for the near-term, mid-term and long-term requirements. 2 Evaluation Techniques There has been a recent increase of interest in evaluating natural language systems, whether they are for translation, database stuffing or any number of other possible NL-related applications. This surge of interest is due to the growing popular opinion that natural language research has progressed enough to start transitioning some of it into applications software. Although the interest in evaluating MT software dates back to at least the ALPAC report (ALP66), the recent interest in evaluating other NL applications has sparked new discussion, techniques and insights for MT evaluation as well. The approach one takes when evaluating software systems (in general) is two-fold: (1) evaluation of the accuracy of the input/output pairs; and (2) evaluation of the architecture of the system and the data flow between the system components. The former (external) view of software evaluation is equivalent to what NL researchers call black-box evaluation, and the latter (internal) view is referred to as glass-box evaluation (PF90). Black-box evaluation covers engineering issues such as reliability, productivity, learnability and likability (user friendliness). Glass-box evaluation also considers reliability (at the component-level) as well as maintainability, improvability, extendibility, compatibility and portability. (The engineering issues listed here are discussed in (GF88)). NL applications have some unique problems that must be accounted for when doing black-box and glass-box evaluations. The biggest problem with evaluating NL applications is eliminating the subjectivity that, to date, has proven unavoidable due to the nature of natural language itself. Standard software engineering techniques for software evaluation apply to these NL applications, but these standard techniques must be enhanced to deal with the multiple correct answers that frequently occur with natural language. It is not clear what constitutes a correct answer especially when dealing with translations. Because of this, judging the correctness of the output for MT requires a degree of subjectivity. The output features to judge according to 3

4 Carbonell, et al. (CCG81), are semantic invariance (preservation of meaning), pragmatic invariance (preservation of purpose), structural invariance (preservation of syntax), lexical invariance (preservation of word senses) and spatial invariance (preservation of text format). Preserving the meaning of the source text is one of the predominant evaluation criteria used today in the MT research community. Black-box evaluation, in the case of MT, tends to focus on evaluating the translation-quality of the output. Essentially it is an attempt to measure the acceptability of the translation to users. To produce the most objective measure possible, a standard test suite of input/output pairs should be established for judging whether the system is performing correctly or not and whether it will be cost effective. In light of the above discussion, this is a very costly undertaking and has yet to be satisfactorily accomplished in any evaluation of an MT system. Another difficulty in developing black-box test suites is caused by the number of dimensions along which MT developers must limit their systems. These systems can be thought of as shells that are customized to apply to a particular domain, language pair, and type of text. To demonstrate these systems, the developers and researchers typically customize them to meet the needs of their largest projected market or to test the validity of a research hypothesis. The test suites must also be limited along the same dimensions, but then there is no common range among the systems. Because of this lack of commonality, some systems will need to be customized for the chosen ranges of the test suite no matter what ranges are selected. There is not much incentive to customize for evaluation purposes since a customer is typically expected to pay for customizations. The glass-box approach attempts to evaluate the system's internal processing strategies to measure how well the system does something. According to the ideas for evaluating NLP systems (PF90), this type of evaluation should include a determination of the system s linguistic coverage, and an examination of the linguistic theories used to handle the linguistic phenomena. Determining the linguistic coverage means testing what linguistic phenomena are handled and to what degree. The examination of the linguistic theories used includes how closely these theories were followed in the implementation and noting what modifications had to be made to the theories. In addition, one should look at the performance of the system s various modules. The evaluation of each of these modules should be treated as individual black-box evaluations. Again, test suites are often proposed as a way to determine a system s linguistic coverage. The difficulty here is the interaction between different linguistic phenomena (KF90). When creating the test suites, one must attempt to eliminate the interactions and test the smallest possible number of phenomena at one time. However, minimizing the interactions is difficult 4

5 and the test inputs grow quickly. To bound the problem, the test-suite developers must know what linguistic phenomena are of greatest importance to the users and be well-versed in linguistics and the languages of interest (KF90). Test suites have also been proposed by (KF90) as a way to test the improvability of an MT system. Improvability tests assume that either the evaluator is working closely with the developer or that the evaluator is able to modify the system himself. The caveats mentioned earlier on bounding the problem, apply here as well. 3 Approach to Evaluation Although we support the basic idea of black-box and glass-box evaluation that is being pursued for NLP systems, this survey was of such a short time frame that test suites could not be built nor could customized tests be performed. Our approach to evaluation (given the time restrictions) was to interview developers, researchers and current users of MT, attend MT demonstrations, survey the literature for additional details about the software, collect (for further evaluation) sample inputs and outputs for each language the software handles, and (whenever possible) arrange for evaluation copies of the software to be given to potential users. A detailed questionnaire was developed to guide information collection during interviews, demonstrations and while reading the literature. Two questionnaires were developed; one directed at users and one directed at researchers and vendors. Both are found in Appendix 4. In the next five subsections, I discuss the information we collected as it relates to the five requirements categories and our rationale for collecting this information. In addition, I will discuss how this information is expected to contribute to determining how well a particular system meets the requirements for each of the three time-frames. 3.1 Functional Capabilities As mentioned earlier, to realistically tackle the language translation problem, developers of MT systems inherently impose limits along several dimensions. These dimensions include the domain, number of languages translated and types of text handled. This information is important in judging whether a system will cost effectively handle a particular user-group's near-term requirements. In addition, for near-term considerations, it is useful to know the sort of user expected in terms of skill level and the types of tasks for which the system was designed. This information allows us to judge how 5

6 well a system will fit into the user-group s operational needs. All of this information was gathered directly from the system developers. 3.2 Product Stability An important factor in deciding whether to invest in research or in a product, is projecting whether the supporting organization is going to be available in the long-term to support their product or complete their research. One consideration in projecting the longevity of a group is the market reaction to any products they have produced. Another important factor in predicting the success of the group is whether they are qualified to do the work. Background of Organization To get an idea of the group s MT credentials we wanted to know the group s technological heritage, the amount of research they planned to do and whether they planned to transition any of their research work. We were also concerned about the stability of the MT work and the stability of the infrastructure supporting this work. We needed to be reasonably satisfied that the group would be able to provide technical support in the long-term and that they could follow through in transitioning any research efforts. And finally, we wanted to know their goals and schedules for machine translation research and development. System Availability & Market Reactions Referring back to cost effectiveness, another factor is the actual cost of the system and how much it typically costs to customize the system for languages, domains and text styles. We collected this information by asking the system developers to supply us with their cost data. We also wanted to know whether the system had been popular with those needing translation services. To this end we asked how many systems were in use and for information about their customers so that we could contact them. 3.3 Concept of Operations Fit How well a system will fit in with the user s concept of operations depends on the concept of operations anticipated by the developers and on the operational requirements of the MT system. Concept of Operations As mentioned in the functional capabilities section, we needed to know the anticipated external flow of the translation process. To judge how well a system would fit into the user-group's operational needs, we asked the developers what tools they provided for getting the text into the system, and for manipulating the source and target languages. We 6

7 asked to see demonstrations of these tools so that we could judge whether they appeared easy to use or if some specialized system knowledge was required to successfully use them. Also, we needed to know what was required of the user whenever the system encountered text it could not translate. The concept of operations information was gathered by questioning the developers and from seeing demonstrations of the system. System Implementation To match against user requirements for operational environments, we had to know what the system s hardware and software requirements were. Preferably the hardware and software would be something the user already had available. Otherwise, the missing hardware or software should not be prohibitively expensive. 3.4 Ease of Upgrading and Maintaining To ascertain the ease with which the software could be maintained and upgraded, we examined how it was implemented and the theoretical foundations upon which it was based. Because we could not build test suites that measure improvability (as described in section 4.1) or conduct formal tests at that time, this part of the evaluation is entirely subjective. As mentioned earlier, King and Falkedal reported an approach in COLING 90 (KF90) for an evaluation based on test suites that will be considered in the future when more time and resources can be allocated. System Implementation We needed to judge how well the system was implemented to assure that maintenance and system enhancements would not cause problems later in the system s life-cycle. A well-implemented MT system should be designed as a shell, so that it can be readily customized for new domains, language pairs and text styles. In addition there should be tools to aid in customizing and modifying these areas. These tools were noted along with the targeted users for these tools. Demonstrations of these tools were requested as part of the criteria under concept of operations. Knowing what tools are available and who could best use them gave us an idea of whether the developer s services or other specialized services would be required to customize and later adjust the system. In addition to customization tools, the system should allow for core language knowledge to be separated from the domain dependent knowledge. For example, the core words of a language should not have to be re-entered whenever the system is customized for a new domain. The developers were asked if they had allowed for such modularity. Another factor in determining the quality of a system's implementation was whether the software could be easily maintained and integrated with other software. To determine how easily the software could be maintained, 7

8 we asked whether the software was modular and whether good documentation existed that accurately described each of the I/O modules. To determine if the system could possibly be integrated with other software, we asked if a programmer s interface had been developed and documented for the system. Theoretical Foundations Knowing something about the underlying theoretics of a system gives us some indication of the facility with which it could be extended to handle new languages, domains and text styles. Past experience in software engineering tells us that ad-hoc systems not based on some coherent theoretical foundation are difficult to extend and maintain. Additionally, one of the objectives of the project is to identify research that looks promising for meeting long-term needs. Again, knowing the theoretical foundations of the research is a major factor in predicting whether the work is promising. 3.5 Increased Throughput Judging whether an MT system performs well enough to increase throughput is a complex problem that depends not only on the speed with which the system produces a translation but also on the quality of the translation. To rate the quality of the output we must consider the types of text and domains the user needs translated, and the translation quality that will be acceptable to him. We knew that many different text styles, languages and domains were of interest to the users. So until their needs could be prioritized we considered what language phenomena out of all possible phenomena the MT systems and research addressed. Another factor in judging the system performance was whether the system would produce an output of high enough quality that the user would be inclined to post-edit it instead of doing his own translation directly from the source text and ignoring the MT output. Performance We needed to determine whether the system operated efficiently enough and that the output quality was high enough to provide cost effective translations. To make this determination, we asked the developers for any performance measures they may have collected. These numbers did not allow us to make truly objective comparisons since they were made under different test environments. Because of this we asked the developers whether the performance figures were based on real text and whether the measures were based on use by system developers or translators who were not associated with the development effort. Knowing this information allowed us to compensate for the performance measures among different systems. The performance figures we asked for were: the time spent pre-editing and post-editing, the speed with which the raw machine translation is pro- 8

9 duced and whether any customers have measured a change in productivity. Associated with the speed for producing raw translations, we asked how the handling of unexpected text affected this performance. In addition to these kinds of performance measures, we asked for any measures that have been made for how long it takes to customize or enhance the grammar, lexicon and domain. Competence To get an idea of a system s linguistic coverage as part of judging its competence, we used a fairly comprehensive checklist of linguistic and textual phenomena which can be found in Appendix 4. The idea was to ask the developers which of these phenomena their system handles. We did not expect anyone to have achieved handling of everything on the list. Also we collected data on linguistic phenomena that are currently of low priority to the users but we expect this data to be useful in predicting whether the quality of a system s approach can be improved. At this point we want to be able to gauge how wide a system's linguistic coverage is. Presumably a wider coverage means that the system could produce a higher-quality output. A major factor in judging a system s competence is whether the output is acceptable to the users. We have three types of MT users: those who need to scan material to estimate its relevance, those who want to know the content of the material and those who want publication-quality output. To determine whether a system s MT output will be acceptable to at least one of these three user groups, we will evaluate the English output of operational systems (see section 2 for discussion on using test suites for acceptability measures). The source texts used for this purpose include those that the developer typically provides as samples and an independently selected text that corresponds to the system s language pairs, domains and text styles. Whenever possible, the machine translation of the user-provided source text was performed in our presence. By being present during the machine translation, we know what modifications had to be made in order to get the output that we will be evaluating for acceptability. The modifications that were made (e.g. preediting, post-editing, lexical changes and additions) will give us more insight into the linguistic coverage of the system. Two separate types of fidelity tests are needed to evaluate the acceptability of the output to all three types of users. The fidelity tests determine whether the meaning of the text has been retained in the translation (semantic invariance). Users who want to know just the subject area and users who need just the content of the text, are primarily concerned with semantic invariance; anything beyond this is a secondary consideration. On the other hand, users who want publication-quality output are concerned about stylistic and grammatical well-formedness as well as semantic invariance. For the latter two user groups, if incorrectly ordered words (along with other typical 9

10 errors) do not inhibit the user s understanding, then that user will be satisfied with the quality of the output. For example, most people would agree that the following sentence is still comprehensible despite the word ordering error. *Incorrectly words ordered do not inhibit the reader s understanding. The first fidelity test will be to examine the raw MT output and state the subject matter of the text without referring to the original source material. This test predicts whether the MT output will be acceptable to those who are scanning for texts in particular subject areas. In the second fidelity test, we will compare the raw MT output to the original source material and rate how well the meaning of the original text was preserved. When rating the semantic invariance of the MT output, we follow a scale to keep the ratings consistent across languages and domains. We will be using Nagao s seven point scale for judging accuracy or fidelity (N + 85) (also like Van Slype s (vs82) measures of information transfer). Nagao s seven point scale follows: 1. Content of input sentence faithfully conveyed to output sentence. Translated sentence clear to native speaker and no rewriting needed. 2. Content of input sentence faithfully conveyed to output sentence and can be clearly understood by native speaker but some rewriting needed. Can be corrected by native speaker without referring to original. 3. Content of input sentence faithfully conveyed in output sentence but some changes needed in word order. 4. Content of input sentence generally conveyed faithfully in output but problems with things like relationships between phrases and expressions and with tense, voice, plurals and positions of adverbs. 5. Content not adequately conveyed. Some expressions are missing and there are problems with relationships between clauses, between phrases and clauses or between sentence elements. 6. Content not conveyed. Clauses and phrases missing. 7. Content not conveyed at all. Output not proper sentence; subjects and predicates missing. In noun phrases, main noun is missing or clause or phrase acting as a verb and modifying a noun missing. 10

11 This scale is for individual sentences, so the ratings for each sentence will be combined by taking the average. If the rating of the article's fidelity is in the range 1-4 then it is suitable for someone who wants to know just the content of the text or who wants to post-edit it. In contrast, a rating in the range of 1-5 would be suitable for scanning purposes. An option we considered was to use multi-lingual domain experts (instead of ourselves) to perform the fidelity tests described above. An additional measure of acceptability was considered for those users who need publication-quality output. In this case, style and grammaticality are as important as semantic invariance. However, no one expects publicationquality translations to be automatically generated by an MT system without some form of human assistance. This assistance could come in the form of pre-editing, post-editing or interactive dialogs that occur during the translation process to help resolve difficulties as they are encountered. So the question in this case is whether the combined efforts of the post-editor and the MT system are more productive than the translator (and to be fair, any other MAT tools of his own choosing) without the MT system. To determine which is more productive, timings would need to be made for the two configurations in a well-controlled environment. For example, to make the measure objective, the translator and the post-editor should not be the same person and a statistically significant number of measures would have to be made. Given the time constraints of the project, this type of measure will not be done but it could be undertaken during a more detailed evaluation. For now, we are relying on any productivity measures users might be able to provide. 4 Status of the Evaluation At this writing, we have surveyed most of the work we planned to cover and have collected the evaluation data described here. The requirements analysis is completed and we are in the process of weighting and prioritizing the evaluation data on the basis of what we have learned about the requirements. At the beginning of the survey, we were initially concerned that the subjects of the MT interviews would not be willing to spend as much time as it took to go through our lengthy questionnaire. Fortunately, an overwhelming majority of the groups were extremely cooperative. As we continue with the evaluation, we find that the data we collected have been adequate for doing a first-pass evaluation. The biggest difficulty we have encountered, so far, has been in finding out enough about the user's requirements to make an evaluation possible. Users cannot easily tell us what makes an MT output acceptable to them and this is one of the key elements in evaluating an MT system for our users. 11

12 Acknowledgements I wish to acknowledge the we s I referred to throughout the paper. The other members of the survey and evaluation team are Dr. John W. Benoit, Adrienne J. Kleiboemer and George L. Marling, and our MT consultant is Dr. Bonnie J. Dorr from the University of Maryland. 12

13 APPENDIX: QUESTIONNAIRES 1 User Interviews User Group Name: Address: System or Product used: MT Requirements Languages and directionality needed and which needs met: Domains needed and which needs met: Text styles needed and which needs met: (S&T legal news article titles & IR sentences telegraphic) Any other requirements met: Any other requirements not met: Interaction style (batch interactive) Target user (translator non-translator) Concept of Operations Hardware platforms: Memory requirements: System extensibility, tools and targeted users for: Correcting grammar: Adding new languages: Adding/correcting lexicon: System modifiability Modularity of software: Modularity of lexicon: Modularity of grammar: I/O for modules well-documented? Hooks to integrate with other software tools (i.e. programmer intf.)? What is the external flow of the translation process (user viewpoint): 13

14 How is source text input? Are there tools to import machine-readable text (describe them)? Describe Pre-editing capabilities: Describe Post-editing capabilities : Describe from the user viewpoint how unexpected text is handled (e.g. unable to translate): Describe user-friendliness of tools (amount of training required): Is there any consistency-checking for modifications? Are there software hooks that allow the system to be integrated with other software? System Performance Performance figures based on real text? Performance figures based on use by translators or system developers? Average time spent pre-editing: Speed with which produce raw translation (prefer words/hour): Average time spent post-editing: Increase in user productivity: Time required to modify grammar: Time required to modify lexicon: Effect of unexpected text on system performance: Benchmarks against translators of varying skill levels: Time spent customizing and type of person responsible: Time spent correcting and type of person responsible: 14

15 Time spent integrating with other software and type of person responsible: System Competence Give subjective estimate of translation quality: Collect samples of source input and associated target output: Size of lexicon: Structure of lexicon, describe: Handling of synonymy: Handling of hyponomy: Handling of abbreviations and acronyms: Handling of numbers: Size of grammar: Checklist for Linguistic and Textual Phenomena Handled Lexical Ambiguity Lexical Selection Lexical Divergences Categorial Divergences Conflational Divergences Syntactic Distinctions Head Initial vs. Head Final Null subject vs. overt subject Free inversion vs. static positioning Free word order vs. configurational Limited movement vs. long distance movement Thematic divergences Structural divergences Quantifier Scoping Conjunction Scoping Negation Scoping Modifier attachment Prepositional phrases adjectives adverbs noun-noun compounds (idioms) Proper Nouns Auxiliary verbs Levels of embededness allowed 15

16 Anaphora (inter and intra-sentential, bindings) Pronominal Prosentential Proverbial Proactional Proadjectival Temporal Locative Elliptic Metonomy Case Markings Tenses Progressive Present Past Future Aspect Honorifics Instantaneous vs. Over time Moods Interrogative (intentionally left out tags and echoes) wh-questions alternative questions Declarative Imperative Subjunctive Comparative constructions Relative Clauses (gap filling) Restrictive Relative Clauses Reduced Restrictive Relative Clauses Non-restrictive Relative Clauses Text typography footnotes & footnote references headings and paragraph markings word boundaries line boundaries punctuation quotations parenthetical insertions tables figures formulae units of measure (technical use kgms but non-technical don't) 16

17 2 Vendor/Researcher Interviews Company Name: Company address: Marketing contact name and phone: Technical contact name and phone: System or Product Name: General Capabilities Languages currently supported and directionality: Domains covered: Text styles covered: (S&T legal news article titles & IR sentences telegraphic) Interaction style (batch interactive) Target user (translator non-translator) Target translation quality of output text: System Implementation Hardware platforms: Memory requirements: Software implementation languages: Availability of source code: System extensibility, tools and targeted users for: Correcting grammar: Adding new languages: Adding/correcting lexicon: 17

18 System modifiability Modularity of software: Modularity of lexicon: Modularity of grammar: I/O for modules well-documented? Hooks to integrate with other software tools (i.e. programmer intf.)? Concept of Operations Describe the anticipated external flow of the translation process (user viewpoint): How is source text input? Are there tools to import machine-readable text (describe them)? Describe Pre-editing capabilities: Describe Post-editing capabilities : Describe from the user viewpoint how unexpected text is handled (e.g. unable to translate): Describe user-friendliness of tools (amount of training required): Is there any consistency-checking for modifications? Are there software hooks that allow the system to be integrated with other software? System Performance Performance figures based on real text? Performance figures based on use by translators or system developers? Average time spent pre-editing: Average speed with which produce raw translation (prefer words/hour): Average time spent post-editing: Average increase in user productivity: 18

19 Average time required to modify grammar: Average time required to modify lexicon: Effect of unexpected text on system performance: Benchmarks against translators of varying skill levels: System Availability and Market Reactions Cost: Describe customization services offered: Average cost for customizing New language: New domain: New text style: Number of systems in use: Number of companies which have purchased: Description of current customers: Description of projected customers: Is the Company stable? System Competence Give subjective estimate of translation quality: Collect samples of source input and associated target output: Size of lexicon: Structure of lexicon, describe: Handling of synonymy: Handling of hyponomy: Handling of abbreviations and acronyms: Handling of numbers: 19

20 Size of grammar: Checklist for Linguistic and Textual Phenomena Handled Lexical Ambiguity Lexical Selection Lexical Divergences Categorial Divergences Conflational Divergences Syntactic Distinctions Head Initial vs. Head Final Null subject vs. overt subject Free inversion vs. static positioning Free word order vs. configurational Limited movement vs. long distance movement Thematic divergences Structural divergences Quantifier Scoping Conjunction Scoping Negation Scoping Modifier attachment Prepositional phrases adjectives adverbs noun-noun compounds (idioms) Proper Nouns Auxiliary verbs Levels of embededness allowed Anaphora (inter and intra-sentential, bindings) Pronominal Prosentential Proverbial Proactional Proadjectival Temporal Locative Elliptic Metonomy Case Markings Tenses Progressive Present Past Future 20

21 Aspect Honorifics Instantaneous vs. Over time Moods Interrogative (intentionally left out tags and echoes) wh- questions alternative questions Declarative Imperative Subjunctive Comparative constructions Relative Clauses (gap filling) Restrictive Relative Clauses Reduced Restrictive Relative Clauses Non-restrictive Relative Clauses Text typography footnotes & footnote references headings and paragraph markings word boundaries line boundaries punctuation quotations parenthetical insertions tables figures formulae units of measure (technical use kgms but non-technical don't) Descriptions of Theoretical Foundations MT approach (direct transfer interlingual example-based) Morphology, describe: Morphological approach used (e.g. KIMMO) including changes and assumptions: Heuristics used: Representational mechanisms: Syntax, describe: Syntactic Theory used (GB LFG CUG GPSG HPSG TG other) including changes, assumptions and heuristics: Grammar design (corpus-based standard grammar): 21

22 Parser used including changes, assumptions and heuristics: Generator used including changes, assumptions and heuristics: Representational mechanisms: Semantics, describe: Semantic Theory used for lexicon, world knowledge and domain knowledge including changes, assumptions and heuristics: Representational mechanisms: Discourse, describe: Discourse approach used including changes, assumptions and heuristics: Representational mechanisms: Describe Architecture of the MT components (how do the above components interact?): Describe approach to handling unexpected text: Describe approach for consistency-checking: List relevant technical literature that describes the theoretical foundations: Of the linguistic phenomena handled, how is it done? Company Background and Information What is the technological heritage of the group? What is the corporate structure and how does the MT group fit in this structure (is the support for MT serious)? What are the goals and schedules for: Commercial MT Software? MT Research? 22

23 References ALPAC. Language and machines: Computers in translation and linguistics. Technical report, Division of Behavioral Sciences, National Academy of Sciences, National Research Council, Washington, DC, A Report by the Automatic Language Processing Advisory Committee. Winfield S. Bennett and Jonathan Slocum. The LRC machine translation system. Computational Linguistics, 11: , April-September Jaime G. Carbonell, Richard E. Cullingford, and Anatole V. Gershman. Steps toward knowledge-based machine translation. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-3(4), Tom Gilb and Susannah Finzi. Principles of software engineering management. Addison-Wesley Pub. Co., Reading, Mass., Pierre Isabelle and Laurent Bourbeau. TAUM - AVIATION: Its technical features and some experimental results. Computational Linguistics, 11:24-26, January-March M. King and K. Falkedal. Using test suites in evaluation of machine translation systems. In COLING 90, volume 2, pages , M. King. A practical guide to the evaluation of machine translation systems. Technical report, ISSCO, Intermediate Report to Suissetra. J. Lehrberger and Laurent Bourbeau. Machine Translation: Linguistic Characteristics of Machine Translation Systems and General Methodology for Evaluation. Benjamins, Makato Nagao et al. The Japanese government project for machine translation. Computational Linguistics, 11, April-September M. Palmer and T. Finin. Workshop on the evaluation of natural language processing systems. Computational Linguistics, 16(3), Muriel Vasconcellos and Marjorie Leon. SPANAM and ENGSPAN: Machine translation at the Pan American Health Organization. Computational Linguistics, 11:135, April-September G. van Slype. Conception d une méthodologie générale d évaluation de la traduction automatique. Multilingua, 1(4): ,

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments

Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Specification and Evaluation of Machine Translation Toy Systems - Criteria for laboratory assignments Cristina Vertan, Walther v. Hahn University of Hamburg, Natural Language Systems Division Hamburg,

More information

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque Approaches to control phenomena handout 6 5.4 Obligatory control and morphological case: Icelandic and Basque Icelandinc quirky case (displaying properties of both structural and inherent case: lexically

More information

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Tyler Perrachione LING 451-0 Proseminar in Sound Structure Prof. A. Bradlow 17 March 2006 Intra-talker Variation: Audience Design Factors Affecting Lexical Selections Abstract Although the acoustic and

More information

An Interactive Intelligent Language Tutor Over The Internet

An Interactive Intelligent Language Tutor Over The Internet An Interactive Intelligent Language Tutor Over The Internet Trude Heift Linguistics Department and Language Learning Centre Simon Fraser University, B.C. Canada V5A1S6 E-mail: heift@sfu.ca Abstract: This

More information

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many Schmidt 1 Eric Schmidt Prof. Suzanne Flynn Linguistic Study of Bilingualism December 13, 2013 A Minimalist Approach to Code-Switching In the field of linguistics, the topic of bilingualism is a broad one.

More information

Software Maintenance

Software Maintenance 1 What is Software Maintenance? Software Maintenance is a very broad activity that includes error corrections, enhancements of capabilities, deletion of obsolete capabilities, and optimization. 2 Categories

More information

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering

Document number: 2013/ Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering Document number: 2013/0006139 Programs Committee 6/2014 (July) Agenda Item 42.0 Bachelor of Engineering with Honours in Software Engineering Program Learning Outcomes Threshold Learning Outcomes for Engineering

More information

Parsing of part-of-speech tagged Assamese Texts

Parsing of part-of-speech tagged Assamese Texts IJCSI International Journal of Computer Science Issues, Vol. 6, No. 1, 2009 ISSN (Online): 1694-0784 ISSN (Print): 1694-0814 28 Parsing of part-of-speech tagged Assamese Texts Mirzanur Rahman 1, Sufal

More information

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s))

PAGE(S) WHERE TAUGHT If sub mission ins not a book, cite appropriate location(s)) Ohio Academic Content Standards Grade Level Indicators (Grade 11) A. ACQUISITION OF VOCABULARY Students acquire vocabulary through exposure to language-rich situations, such as reading books and other

More information

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF) Hans Christian 1 ; Mikhael Pramodana Agus 2 ; Derwin Suhartono 3 1,2,3 Computer Science Department,

More information

CEFR Overall Illustrative English Proficiency Scales

CEFR Overall Illustrative English Proficiency Scales CEFR Overall Illustrative English Proficiency s CEFR CEFR OVERALL ORAL PRODUCTION Has a good command of idiomatic expressions and colloquialisms with awareness of connotative levels of meaning. Can convey

More information

The College Board Redesigned SAT Grade 12

The College Board Redesigned SAT Grade 12 A Correlation of, 2017 To the Redesigned SAT Introduction This document demonstrates how myperspectives English Language Arts meets the Reading, Writing and Language and Essay Domains of Redesigned SAT.

More information

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing.

The presence of interpretable but ungrammatical sentences corresponds to mismatches between interpretive and productive parsing. Lecture 4: OT Syntax Sources: Kager 1999, Section 8; Legendre et al. 1998; Grimshaw 1997; Barbosa et al. 1998, Introduction; Bresnan 1998; Fanselow et al. 1999; Gibson & Broihier 1998. OT is not a theory

More information

Advanced Grammar in Use

Advanced Grammar in Use Advanced Grammar in Use A self-study reference and practice book for advanced learners of English Third Edition with answers and CD-ROM cambridge university press cambridge, new york, melbourne, madrid,

More information

Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses

Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses 2010 Board of Studies NSW for and on behalf of the Crown in right of the State of New South Wales This document contains Material prepared by

More information

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis

Linguistic Variation across Sports Category of Press Reportage from British Newspapers: a Diachronic Multidimensional Analysis International Journal of Arts Humanities and Social Sciences (IJAHSS) Volume 1 Issue 1 ǁ August 216. www.ijahss.com Linguistic Variation across Sports Category of Press Reportage from British Newspapers:

More information

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions. to as a linguistic theory to to a member of the family of linguistic frameworks that are called generative grammars a grammar which is formalized to a high degree and thus makes exact predictions about

More information

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level.

Candidates must achieve a grade of at least C2 level in each examination in order to achieve the overall qualification at C2 Level. The Test of Interactive English, C2 Level Qualification Structure The Test of Interactive English consists of two units: Unit Name English English Each Unit is assessed via a separate examination, set,

More information

AQUA: An Ontology-Driven Question Answering System

AQUA: An Ontology-Driven Question Answering System AQUA: An Ontology-Driven Question Answering System Maria Vargas-Vera, Enrico Motta and John Domingue Knowledge Media Institute (KMI) The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom.

More information

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Minimalism is the name of the predominant approach in generative linguistics today. It was first Minimalism Minimalism is the name of the predominant approach in generative linguistics today. It was first introduced by Chomsky in his work The Minimalist Program (1995) and has seen several developments

More information

Researcher Development Assessment A: Knowledge and intellectual abilities

Researcher Development Assessment A: Knowledge and intellectual abilities Researcher Development Assessment A: Knowledge and intellectual abilities Domain A: Knowledge and intellectual abilities This domain relates to the knowledge and intellectual abilities needed to be able

More information

5. UPPER INTERMEDIATE

5. UPPER INTERMEDIATE Triolearn General Programmes adapt the standards and the Qualifications of Common European Framework of Reference (CEFR) and Cambridge ESOL. It is designed to be compatible to the local and the regional

More information

Some Principles of Automated Natural Language Information Extraction

Some Principles of Automated Natural Language Information Extraction Some Principles of Automated Natural Language Information Extraction Gregers Koch Department of Computer Science, Copenhagen University DIKU, Universitetsparken 1, DK-2100 Copenhagen, Denmark Abstract

More information

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq

Different Requirements Gathering Techniques and Issues. Javaria Mushtaq 835 Different Requirements Gathering Techniques and Issues Javaria Mushtaq Abstract- Project management is now becoming a very important part of our software industries. To handle projects with success

More information

Statewide Strategic Plan for e-learning in California s Child Welfare Training System

Statewide Strategic Plan for e-learning in California s Child Welfare Training System Statewide Strategic Plan for e-learning in California s Child Welfare Training System Decision Point Outline December 14, 2009 Vision CalSWEC, the schools of social work, the regional training academies,

More information

Guidelines for Writing an Internship Report

Guidelines for Writing an Internship Report Guidelines for Writing an Internship Report Master of Commerce (MCOM) Program Bahauddin Zakariya University, Multan Table of Contents Table of Contents... 2 1. Introduction.... 3 2. The Required Components

More information

What the National Curriculum requires in reading at Y5 and Y6

What the National Curriculum requires in reading at Y5 and Y6 What the National Curriculum requires in reading at Y5 and Y6 Word reading apply their growing knowledge of root words, prefixes and suffixes (morphology and etymology), as listed in Appendix 1 of the

More information

Graduate Program in Education

Graduate Program in Education SPECIAL EDUCATION THESIS/PROJECT AND SEMINAR (EDME 531-01) SPRING / 2015 Professor: Janet DeRosa, D.Ed. Course Dates: January 11 to May 9, 2015 Phone: 717-258-5389 (home) Office hours: Tuesday evenings

More information

An Introduction to the Minimalist Program

An Introduction to the Minimalist Program An Introduction to the Minimalist Program Luke Smith University of Arizona Summer 2016 Some findings of traditional syntax Human languages vary greatly, but digging deeper, they all have distinct commonalities:

More information

A Case Study: News Classification Based on Term Frequency

A Case Study: News Classification Based on Term Frequency A Case Study: News Classification Based on Term Frequency Petr Kroha Faculty of Computer Science University of Technology 09107 Chemnitz Germany kroha@informatik.tu-chemnitz.de Ricardo Baeza-Yates Center

More information

CS 598 Natural Language Processing

CS 598 Natural Language Processing CS 598 Natural Language Processing Natural language is everywhere Natural language is everywhere Natural language is everywhere Natural language is everywhere!"#$%&'&()*+,-./012 34*5665756638/9:;< =>?@ABCDEFGHIJ5KL@

More information

Subject: Opening the American West. What are you teaching? Explorations of Lewis and Clark

Subject: Opening the American West. What are you teaching? Explorations of Lewis and Clark Theme 2: My World & Others (Geography) Grade 5: Lewis and Clark: Opening the American West by Ellen Rodger (U.S. Geography) This 4MAT lesson incorporates activities in the Daily Lesson Guide (DLG) that

More information

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 -

Think A F R I C A when assessing speaking. C.E.F.R. Oral Assessment Criteria. Think A F R I C A - 1 - C.E.F.R. Oral Assessment Criteria Think A F R I C A - 1 - 1. The extracts in the left hand column are taken from the official descriptors of the CEFR levels. How would you grade them on a scale of low,

More information

DICE - Final Report. Project Information Project Acronym DICE Project Title

DICE - Final Report. Project Information Project Acronym DICE Project Title DICE - Final Report Project Information Project Acronym DICE Project Title Digital Communication Enhancement Start Date November 2011 End Date July 2012 Lead Institution London School of Economics and

More information

Cross Language Information Retrieval

Cross Language Information Retrieval Cross Language Information Retrieval RAFFAELLA BERNARDI UNIVERSITÀ DEGLI STUDI DI TRENTO P.ZZA VENEZIA, ROOM: 2.05, E-MAIL: BERNARDI@DISI.UNITN.IT Contents 1 Acknowledgment.............................................

More information

Control and Boundedness

Control and Boundedness Control and Boundedness Having eliminated rules, we would expect constructions to follow from the lexical categories (of heads and specifiers of syntactic constructions) alone. Combinatory syntax simply

More information

Welcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading

Welcome to the Purdue OWL. Where do I begin? General Strategies. Personalizing Proofreading Welcome to the Purdue OWL This page is brought to you by the OWL at Purdue (http://owl.english.purdue.edu/). When printing this page, you must include the entire legal notice at bottom. Where do I begin?

More information

Rubric for Scoring English 1 Unit 1, Rhetorical Analysis

Rubric for Scoring English 1 Unit 1, Rhetorical Analysis FYE Program at Marquette University Rubric for Scoring English 1 Unit 1, Rhetorical Analysis Writing Conventions INTEGRATING SOURCE MATERIAL 3 Proficient Outcome Effectively expresses purpose in the introduction

More information

Myths, Legends, Fairytales and Novels (Writing a Letter)

Myths, Legends, Fairytales and Novels (Writing a Letter) Assessment Focus This task focuses on Communication through the mode of Writing at Levels 3, 4 and 5. Two linked tasks (Hot Seating and Character Study) that use the same context are available to assess

More information

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist Meeting 2 Chapter 7 (Morphology) and chapter 9 (Syntax) Today s agenda Repetition of meeting 1 Mini-lecture on morphology Seminar on chapter 7, worksheet Mini-lecture on syntax Seminar on chapter 9, worksheet

More information

Loughton School s curriculum evening. 28 th February 2017

Loughton School s curriculum evening. 28 th February 2017 Loughton School s curriculum evening 28 th February 2017 Aims of this session Share our approach to teaching writing, reading, SPaG and maths. Share resources, ideas and strategies to support children's

More information

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh The Effect of Discourse Markers on the Speaking Production of EFL Students Iman Moradimanesh Abstract The research aimed at investigating the relationship between discourse markers (DMs) and a special

More information

White Paper. The Art of Learning

White Paper. The Art of Learning The Art of Learning Based upon years of observation of adult learners in both our face-to-face classroom courses and using our Mentored Email 1 distance learning methodology, it is fascinating to see how

More information

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80. CONTENTS FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8 УРОК (Unit) 1 25 1.1. QUESTIONS WITH КТО AND ЧТО 27 1.2. GENDER OF NOUNS 29 1.3. PERSONAL PRONOUNS 31 УРОК (Unit) 2 38 2.1. PRESENT TENSE OF THE

More information

Major Milestones, Team Activities, and Individual Deliverables

Major Milestones, Team Activities, and Individual Deliverables Major Milestones, Team Activities, and Individual Deliverables Milestone #1: Team Semester Proposal Your team should write a proposal that describes project objectives, existing relevant technology, engineering

More information

Senior Stenographer / Senior Typist Series (including equivalent Secretary titles)

Senior Stenographer / Senior Typist Series (including equivalent Secretary titles) New York State Department of Civil Service Committed to Innovation, Quality, and Excellence A Guide to the Written Test for the Senior Stenographer / Senior Typist Series (including equivalent Secretary

More information

What is PDE? Research Report. Paul Nichols

What is PDE? Research Report. Paul Nichols What is PDE? Research Report Paul Nichols December 2013 WHAT IS PDE? 1 About Pearson Everything we do at Pearson grows out of a clear mission: to help people make progress in their lives through personalized

More information

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) Feb 2015

Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL)  Feb 2015 Author: Justyna Kowalczys Stowarzyszenie Angielski w Medycynie (PL) www.angielskiwmedycynie.org.pl Feb 2015 Developing speaking abilities is a prerequisite for HELP in order to promote effective communication

More information

Emmaus Lutheran School English Language Arts Curriculum

Emmaus Lutheran School English Language Arts Curriculum Emmaus Lutheran School English Language Arts Curriculum Rationale based on Scripture God is the Creator of all things, including English Language Arts. Our school is committed to providing students with

More information

Common Core State Standards for English Language Arts

Common Core State Standards for English Language Arts Reading Standards for Literature 6-12 Grade 9-10 Students: 1. Cite strong and thorough textual evidence to support analysis of what the text says explicitly as well as inferences drawn from the text. 2.

More information

CELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom

CELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom CELTA Syllabus and Assessment Guidelines Third Edition CELTA (Certificate in Teaching English to Speakers of Other Languages) is accredited by Ofqual (the regulator of qualifications, examinations and

More information

Context Free Grammars. Many slides from Michael Collins

Context Free Grammars. Many slides from Michael Collins Context Free Grammars Many slides from Michael Collins Overview I An introduction to the parsing problem I Context free grammars I A brief(!) sketch of the syntax of English I Examples of ambiguous structures

More information

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG Dr. Kakia Chatsiou, University of Essex achats at essex.ac.uk Explorations in Syntactic Government and Subcategorisation,

More information

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS

Arizona s English Language Arts Standards th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS Arizona s English Language Arts Standards 11-12th Grade ARIZONA DEPARTMENT OF EDUCATION HIGH ACADEMIC STANDARDS FOR STUDENTS 11 th -12 th Grade Overview Arizona s English Language Arts Standards work together

More information

Derivational and Inflectional Morphemes in Pak-Pak Language

Derivational and Inflectional Morphemes in Pak-Pak Language Derivational and Inflectional Morphemes in Pak-Pak Language Agustina Situmorang and Tima Mariany Arifin ABSTRACT The objectives of this study are to find out the derivational and inflectional morphemes

More information

Readability tools: are they useful for medical writers?

Readability tools: are they useful for medical writers? Readability tools: are they useful for medical writers? John Dixon MedComms Networking Event, 4th October, 2017 www.medcommsnetworking.com Libra Communications Training As I sincerely aspire to successfully

More information

Comprehension Recognize plot features of fairy tales, folk tales, fables, and myths.

Comprehension Recognize plot features of fairy tales, folk tales, fables, and myths. 4 th Grade Language Arts Scope and Sequence 1 st Nine Weeks Instructional Units Reading Unit 1 & 2 Language Arts Unit 1& 2 Assessments Placement Test Running Records DIBELS Reading Unit 1 Language Arts

More information

How to Judge the Quality of an Objective Classroom Test

How to Judge the Quality of an Objective Classroom Test How to Judge the Quality of an Objective Classroom Test Technical Bulletin #6 Evaluation and Examination Service The University of Iowa (319) 335-0356 HOW TO JUDGE THE QUALITY OF AN OBJECTIVE CLASSROOM

More information

Carolina Course Evaluation Item Bank Last Revised Fall 2009

Carolina Course Evaluation Item Bank Last Revised Fall 2009 Carolina Course Evaluation Item Bank Last Revised Fall 2009 Items Appearing on the Standard Carolina Course Evaluation Instrument Core Items Instructor and Course Characteristics Results are intended for

More information

Speech Recognition at ICSI: Broadcast News and beyond

Speech Recognition at ICSI: Broadcast News and beyond Speech Recognition at ICSI: Broadcast News and beyond Dan Ellis International Computer Science Institute, Berkeley CA Outline 1 2 3 The DARPA Broadcast News task Aspects of ICSI

More information

Formulaic Language and Fluency: ESL Teaching Applications

Formulaic Language and Fluency: ESL Teaching Applications Formulaic Language and Fluency: ESL Teaching Applications Formulaic Language Terminology Formulaic sequence One such item Formulaic language Non-count noun referring to these items Phraseology The study

More information

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading ELA/ELD Correlation Matrix for ELD Materials Grade 1 Reading The English Language Arts (ELA) required for the one hour of English-Language Development (ELD) Materials are listed in Appendix 9-A, Matrix

More information

A Pipelined Approach for Iterative Software Process Model

A Pipelined Approach for Iterative Software Process Model A Pipelined Approach for Iterative Software Process Model Ms.Prasanthi E R, Ms.Aparna Rathi, Ms.Vardhani J P, Mr.Vivek Krishna Electronics and Radar Development Establishment C V Raman Nagar, Bangalore-560093,

More information

Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment

Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment Impact of Controlled Language on Translation Quality and Post-editing in a Statistical Machine Translation Environment Takako Aikawa, Lee Schwartz, Ronit King Mo Corston-Oliver Carmen Lozano Microsoft

More information

The Short Essay: Week 6

The Short Essay: Week 6 The Minnesota Literacy Council created this curriculum. We invite you to adapt it for your own classrooms. Advanced Level (CASAS reading scores of 221-235) The Short Essay: Week 6 Unit Overview This is

More information

Developing a TT-MCTAG for German with an RCG-based Parser

Developing a TT-MCTAG for German with an RCG-based Parser Developing a TT-MCTAG for German with an RCG-based Parser Laura Kallmeyer, Timm Lichte, Wolfgang Maier, Yannick Parmentier, Johannes Dellert University of Tübingen, Germany CNRS-LORIA, France LREC 2008,

More information

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas

P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou, C. Skourlas, J. Varnas Exploiting Distance Learning Methods and Multimediaenhanced instructional content to support IT Curricula in Greek Technological Educational Institutes P. Belsis, C. Sgouropoulou, K. Sfikas, G. Pantziou,

More information

California Department of Education English Language Development Standards for Grade 8

California Department of Education English Language Development Standards for Grade 8 Section 1: Goal, Critical Principles, and Overview Goal: English learners read, analyze, interpret, and create a variety of literary and informational text types. They develop an understanding of how language

More information

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data

Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Target Language Preposition Selection an Experiment with Transformation-Based Learning and Aligned Bilingual Data Ebba Gustavii Department of Linguistics and Philology, Uppsala University, Sweden ebbag@stp.ling.uu.se

More information

Writing a composition

Writing a composition A good composition has three elements: Writing a composition an introduction: A topic sentence which contains the main idea of the paragraph. a body : Supporting sentences that develop the main idea. a

More information

Creating Travel Advice

Creating Travel Advice Creating Travel Advice Classroom at a Glance Teacher: Language: Grade: 11 School: Fran Pettigrew Spanish III Lesson Date: March 20 Class Size: 30 Schedule: McLean High School, McLean, Virginia Block schedule,

More information

A Framework for Customizable Generation of Hypertext Presentations

A Framework for Customizable Generation of Hypertext Presentations A Framework for Customizable Generation of Hypertext Presentations Benoit Lavoie and Owen Rambow CoGenTex, Inc. 840 Hanshaw Road, Ithaca, NY 14850, USA benoit, owen~cogentex, com Abstract In this paper,

More information

The Smart/Empire TIPSTER IR System

The Smart/Empire TIPSTER IR System The Smart/Empire TIPSTER IR System Chris Buckley, Janet Walz Sabir Research, Gaithersburg, MD chrisb,walz@sabir.com Claire Cardie, Scott Mardis, Mandar Mitra, David Pierce, Kiri Wagstaff Department of

More information

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017

GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017 GERM 3040 GERMAN GRAMMAR AND COMPOSITION SPRING 2017 Instructor: Dr. Claudia Schwabe Class hours: TR 9:00-10:15 p.m. claudia.schwabe@usu.edu Class room: Old Main 301 Office: Old Main 002D Office hours:

More information

Unit 3. Design Activity. Overview. Purpose. Profile

Unit 3. Design Activity. Overview. Purpose. Profile Unit 3 Design Activity Overview Purpose The purpose of the Design Activity unit is to provide students with experience designing a communications product. Students will develop capability with the design

More information

Writing Research Articles

Writing Research Articles Marek J. Druzdzel with minor additions from Peter Brusilovsky University of Pittsburgh School of Information Sciences and Intelligent Systems Program marek@sis.pitt.edu http://www.pitt.edu/~druzdzel Overview

More information

TOURISM ECONOMICS AND POLICY (ASPECTS OF TOURISM) BY LARRY DWYER, PETER FORSYTH, WAYNE DWYER

TOURISM ECONOMICS AND POLICY (ASPECTS OF TOURISM) BY LARRY DWYER, PETER FORSYTH, WAYNE DWYER Read Online and Download Ebook TOURISM ECONOMICS AND POLICY (ASPECTS OF TOURISM) BY LARRY DWYER, PETER FORSYTH, WAYNE DWYER DOWNLOAD EBOOK : TOURISM ECONOMICS AND POLICY (ASPECTS OF TOURISM) BY LARRY DWYER,

More information

Three Strategies for Open Source Deployment: Substitution, Innovation, and Knowledge Reuse

Three Strategies for Open Source Deployment: Substitution, Innovation, and Knowledge Reuse Three Strategies for Open Source Deployment: Substitution, Innovation, and Knowledge Reuse Jonathan P. Allen 1 1 University of San Francisco, 2130 Fulton St., CA 94117, USA, jpallen@usfca.edu Abstract.

More information

Geo Risk Scan Getting grips on geotechnical risks

Geo Risk Scan Getting grips on geotechnical risks Geo Risk Scan Getting grips on geotechnical risks T.J. Bles & M.Th. van Staveren Deltares, Delft, the Netherlands P.P.T. Litjens & P.M.C.B.M. Cools Rijkswaterstaat Competence Center for Infrastructure,

More information

First Grade Curriculum Highlights: In alignment with the Common Core Standards

First Grade Curriculum Highlights: In alignment with the Common Core Standards First Grade Curriculum Highlights: In alignment with the Common Core Standards ENGLISH LANGUAGE ARTS Foundational Skills Print Concepts Demonstrate understanding of the organization and basic features

More information

Multiple case assignment and the English pseudo-passive *

Multiple case assignment and the English pseudo-passive * Multiple case assignment and the English pseudo-passive * Norvin Richards Massachusetts Institute of Technology Previous literature on pseudo-passives (see van Riemsdijk 1978, Chomsky 1981, Hornstein &

More information

Evolution of Symbolisation in Chimpanzees and Neural Nets

Evolution of Symbolisation in Chimpanzees and Neural Nets Evolution of Symbolisation in Chimpanzees and Neural Nets Angelo Cangelosi Centre for Neural and Adaptive Systems University of Plymouth (UK) a.cangelosi@plymouth.ac.uk Introduction Animal communication

More information

Language Acquisition Chart

Language Acquisition Chart Language Acquisition Chart This chart was designed to help teachers better understand the process of second language acquisition. Please use this chart as a resource for learning more about the way people

More information

BULATS A2 WORDLIST 2

BULATS A2 WORDLIST 2 BULATS A2 WORDLIST 2 INTRODUCTION TO THE BULATS A2 WORDLIST 2 The BULATS A2 WORDLIST 21 is a list of approximately 750 words to help candidates aiming at an A2 pass in the Cambridge BULATS exam. It is

More information

Mastering Team Skills and Interpersonal Communication. Copyright 2012 Pearson Education, Inc. publishing as Prentice Hall.

Mastering Team Skills and Interpersonal Communication. Copyright 2012 Pearson Education, Inc. publishing as Prentice Hall. Chapter 2 Mastering Team Skills and Interpersonal Communication Chapter 2-1 Communicating Effectively in Teams Chapter 2-2 Communicating Effectively in Teams Collaboration involves working together to

More information

The Effect of Extensive Reading on Developing the Grammatical. Accuracy of the EFL Freshmen at Al Al-Bayt University

The Effect of Extensive Reading on Developing the Grammatical. Accuracy of the EFL Freshmen at Al Al-Bayt University The Effect of Extensive Reading on Developing the Grammatical Accuracy of the EFL Freshmen at Al Al-Bayt University Kifah Rakan Alqadi Al Al-Bayt University Faculty of Arts Department of English Language

More information

Coast Academies Writing Framework Step 4. 1 of 7

Coast Academies Writing Framework Step 4. 1 of 7 1 KPI Spell further homophones. 2 3 Objective Spell words that are often misspelt (English Appendix 1) KPI Place the possessive apostrophe accurately in words with regular plurals: e.g. girls, boys and

More information

TU-E2090 Research Assignment in Operations Management and Services

TU-E2090 Research Assignment in Operations Management and Services Aalto University School of Science Operations and Service Management TU-E2090 Research Assignment in Operations Management and Services Version 2016-08-29 COURSE INSTRUCTOR: OFFICE HOURS: CONTACT: Saara

More information

Developing Grammar in Context

Developing Grammar in Context Developing Grammar in Context intermediate with answers Mark Nettle and Diana Hopkins PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United

More information

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3

Inleiding Taalkunde. Docent: Paola Monachesi. Blok 4, 2001/ Syntax 2. 2 Phrases and constituent structure 2. 3 A minigrammar of Italian 3 Inleiding Taalkunde Docent: Paola Monachesi Blok 4, 2001/2002 Contents 1 Syntax 2 2 Phrases and constituent structure 2 3 A minigrammar of Italian 3 4 Trees 3 5 Developing an Italian lexicon 4 6 S(emantic)-selection

More information

Lower and Upper Secondary

Lower and Upper Secondary Lower and Upper Secondary Type of Course Age Group Content Duration Target General English Lower secondary Grammar work, reading and comprehension skills, speech and drama. Using Multi-Media CD - Rom 7

More information

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur

Module 12. Machine Learning. Version 2 CSE IIT, Kharagpur Module 12 Machine Learning 12.1 Instructional Objective The students should understand the concept of learning systems Students should learn about different aspects of a learning system Students should

More information

Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis Probabilistic Latent Semantic Analysis Thomas Hofmann Presentation by Ioannis Pavlopoulos & Andreas Damianou for the course of Data Mining & Exploration 1 Outline Latent Semantic Analysis o Need o Overview

More information

A Neural Network GUI Tested on Text-To-Phoneme Mapping

A Neural Network GUI Tested on Text-To-Phoneme Mapping A Neural Network GUI Tested on Text-To-Phoneme Mapping MAARTEN TROMPPER Universiteit Utrecht m.f.a.trompper@students.uu.nl Abstract Text-to-phoneme (T2P) mapping is a necessary step in any speech synthesis

More information

Analyzing Linguistically Appropriate IEP Goals in Dual Language Programs

Analyzing Linguistically Appropriate IEP Goals in Dual Language Programs Analyzing Linguistically Appropriate IEP Goals in Dual Language Programs 2016 Dual Language Conference: Making Connections Between Policy and Practice March 19, 2016 Framingham, MA Session Description

More information

Procedia - Social and Behavioral Sciences 154 ( 2014 )

Procedia - Social and Behavioral Sciences 154 ( 2014 ) Available online at www.sciencedirect.com ScienceDirect Procedia - Social and Behavioral Sciences 154 ( 2014 ) 263 267 THE XXV ANNUAL INTERNATIONAL ACADEMIC CONFERENCE, LANGUAGE AND CULTURE, 20-22 October

More information

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature 1 st Grade Curriculum Map Common Core Standards Language Arts 2013 2014 1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature Key Ideas and Details

More information

The Discourse Anaphoric Properties of Connectives

The Discourse Anaphoric Properties of Connectives The Discourse Anaphoric Properties of Connectives Cassandre Creswell, Kate Forbes, Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi Λ, Bonnie Webber y Λ University of Pennsylvania 3401 Walnut Street Philadelphia,

More information

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011

CAAP. Content Analysis Report. Sample College. Institution Code: 9011 Institution Type: 4-Year Subgroup: none Test Date: Spring 2011 CAAP Content Analysis Report Institution Code: 911 Institution Type: 4-Year Normative Group: 4-year Colleges Introduction This report provides information intended to help postsecondary institutions better

More information

Compositional Semantics

Compositional Semantics Compositional Semantics CMSC 723 / LING 723 / INST 725 MARINE CARPUAT marine@cs.umd.edu Words, bag of words Sequences Trees Meaning Representing Meaning An important goal of NLP/AI: convert natural language

More information