Endoclitics in Pashto: Can They Really Do That?

Similar documents
A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

BULATS A2 WORDLIST 2

Word Stress and Intonation: Introduction

Derivational and Inflectional Morphemes in Pak-Pak Language

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

Taught Throughout the Year Foundational Skills Reading Writing Language RF.1.2 Demonstrate understanding of spoken words,

Phonological and Phonetic Representations: The Case of Neutralization

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

Opportunities for Writing Title Key Stage 1 Key Stage 2 Narrative

Intra-talker Variation: Audience Design Factors Affecting Lexical Selections

Adjectives tell you more about a noun (for example: the red dress ).

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

ELA/ELD Standards Correlation Matrix for ELD Materials Grade 1 Reading

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

Language Acquisition by Identical vs. Fraternal SLI Twins * Karin Stromswold & Jay I. Rifkin

ELD CELDT 5 EDGE Level C Curriculum Guide LANGUAGE DEVELOPMENT VOCABULARY COMMON WRITING PROJECT. ToolKit

Words come in categories

LING 329 : MORPHOLOGY

Developing Grammar in Context

What the National Curriculum requires in reading at Y5 and Y6

Houghton Mifflin Reading Correlation to the Common Core Standards for English Language Arts (Grade1)

Language contact in East Nusantara

First Grade Curriculum Highlights: In alignment with the Common Core Standards

The Acquisition of Person and Number Morphology Within the Verbal Domain in Early Greek

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities

DOWNSTEP IN SUPYIRE* Robert Carlson Societe Internationale de Linguistique, Mali

Minimalism is the name of the predominant approach in generative linguistics today. It was first

Progressive Aspect in Nigerian English

Writing a composition

Program Matrix - Reading English 6-12 (DOE Code 398) University of Florida. Reading

California Department of Education English Language Development Standards for Grade 8

Using a Native Language Reference Grammar as a Language Learning Tool

Coast Academies Writing Framework Step 4. 1 of 7

Basic concepts: words and morphemes. LING 481 Winter 2011

Developing a TT-MCTAG for German with an RCG-based Parser

An Interactive Intelligent Language Tutor Over The Internet

Character Stream Parsing of Mixed-lingual Text

FOREWORD.. 5 THE PROPER RUSSIAN PRONUNCIATION. 8. УРОК (Unit) УРОК (Unit) УРОК (Unit) УРОК (Unit) 4 80.

Advanced Grammar in Use

Modeling full form lexica for Arabic

Emmaus Lutheran School English Language Arts Curriculum

Constraining X-Bar: Theta Theory

Florida Reading Endorsement Alignment Matrix Competency 1

Dickinson ISD ELAR Year at a Glance 3rd Grade- 1st Nine Weeks

CS 598 Natural Language Processing

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

English for Life. B e g i n n e r. Lessons 1 4 Checklist Getting Started. Student s Book 3 Date. Workbook. MultiROM. Test 1 4

Procedia - Social and Behavioral Sciences 154 ( 2014 )

More Morphology. Problem Set #1 is up: it s due next Thursday (1/19) fieldwork component: Figure out how negation is expressed in your language.

CORPUS ANALYSIS CORPUS ANALYSIS QUANTITATIVE ANALYSIS

Underlying Representations

TABE 9&10. Revised 8/2013- with reference to College and Career Readiness Standards

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

Primary English Curriculum Framework

Lexical phonology. Marc van Oostendorp. December 6, Until now, we have presented phonological theory as if it is a monolithic

CHILDREN S POSSESSIVE STRUCTURES: A CASE STUDY 1. Andrew Radford and Joseph Galasso, University of Essex

Derivational: Inflectional: In a fit of rage the soldiers attacked them both that week, but lost the fight.

Parsing of part-of-speech tagged Assamese Texts

cambridge occasional papers in linguistics Volume 8, Article 3: 41 55, 2015 ISSN

Formulaic Language and Fluency: ESL Teaching Applications

Case government vs Case agreement: modelling Modern Greek case attraction phenomena in LFG

(3) Vocabulary insertion targets subtrees (4) The Superset Principle A vocabulary item A associated with the feature set F can replace a subtree X

Written by: YULI AMRIA (RRA1B210085) ABSTRACT. Key words: ability, possessive pronouns, and possessive adjectives INTRODUCTION

CELTA. Syllabus and Assessment Guidelines. Third Edition. University of Cambridge ESOL Examinations 1 Hills Road Cambridge CB1 2EU United Kingdom

The Internet as a Normative Corpus: Grammar Checking with a Search Engine

Today we examine the distribution of infinitival clauses, which can be

UKLO Round Advanced solutions and marking schemes. 6 The long and short of English verbs [15 marks]

An Interface between Prosodic Phonology and Syntax in Kurdish

INTRODUCTION TO MORPHOLOGY Mark C. Baker and Jonathan David Bobaljik. Rutgers and McGill. Draft 6 INFLECTION

SINGLE DOCUMENT AUTOMATIC TEXT SUMMARIZATION USING TERM FREQUENCY-INVERSE DOCUMENT FREQUENCY (TF-IDF)

Presentation Exercise: Chapter 32

Senior Stenographer / Senior Typist Series (including equivalent Secretary titles)

BANGLA TO ENGLISH TEXT CONVERSION USING OPENNLP TOOLS

The Acquisition of English Grammatical Morphemes: A Case of Iranian EFL Learners

Intensive English Program Southwest College

1. Introduction. 2. The OMBI database editor

2017 national curriculum tests. Key stage 1. English grammar, punctuation and spelling test mark schemes. Paper 1: spelling and Paper 2: questions

AN ANALYSIS OF GRAMMTICAL ERRORS MADE BY THE SECOND YEAR STUDENTS OF SMAN 5 PADANG IN WRITING PAST EXPERIENCES

Comprehension Recognize plot features of fairy tales, folk tales, fables, and myths.

Correspondence between the DRDP (2015) and the California Preschool Learning Foundations. Foundations (PLF) in Language and Literacy

Mandarin Lexical Tone Recognition: The Gating Paradigm

L1 and L2 acquisition. Holger Diessel

Constructing Parallel Corpus from Movie Subtitles

National University of Singapore Faculty of Arts and Social Sciences Centre for Language Studies Academic Year 2014/2015 Semester 2

Sample Goals and Benchmarks

On the Notion Determiner

Project in the framework of the AIM-WEST project Annotation of MWEs for translation

Mercer County Schools

a) analyse sentences, so you know what s going on and how to use that information to help you find the answer.

Heritage Korean Stage 6 Syllabus Preliminary and HSC Courses

Linking Task: Identifying authors and book titles in verbose queries

How to analyze visual narratives: A tutorial in Visual Narrative Grammar

THE VERB ARGUMENT BROWSER

Syntax Parsing 1. Grammars and parsing 2. Top-down and bottom-up parsing 3. Chart parsers 4. Bottom-up chart parsing 5. The Earley Algorithm

Universal Grammar 2. Universal Grammar 1. Forms and functions 1. Universal Grammar 3. Conceptual and surface structure of complex clauses

Phenomena of gender attraction in Polish *

National Literacy and Numeracy Framework for years 3/4

Transcription:

Endoclitics in Pashto: Can They Really Do That? Craig Kopris AppTek, Inc. 6867 Elm Street, Suite 300 McLean, VA 22101, USA kopris@apptek.com Abstract A cross-linguistically very rare type of clitic, the endoclitic, occurs in Pashto. Like infixes, endoclitics can be inserted inside of a word, but by splitting words apart into separate nonadjacent pieces which themselves might not have any meaning. Unlike infixes, however, endoclitics are not inflections; their meaning is unrelated to that of their host word. This paper discusses some of the problems endoclitics cause for processing Pashto, both written and spoken. 1 Introduction: What are Endoclitics? 1 Clitics have been defined in many ways, both phonologically and syntactically, often as semiindependent forms which attach to phrases rather than words. The technical details of different definitions are not relevant for this paper; here clitics can be described simply as a part of speech somewhere between affixes and particles, attached to hosts like affixes, yet at the same time independent words, like particles. An English example would be the possessive 's. Instead of attaching to a noun referring to the possessor, it actually attaches at the end of the whole possessor noun phrase. For example, the Queen of England's hat places 's at the end of England, not at the end of the possessor noun Queen. The two most common types of clitic across languages are enclitics, which attach at the end of their host (parallel to suffixes or postpositions), 1 Abbreviations: 1 first person, 2 second person, 3 third person, sg singular, pl plural, FUT future, NEG negative, PERF perfective, - morpheme boundary, = clitic boundary and proclitics, which attach at the beginning (parallel to prefixes or prepositions). Pashto has several proclitics, including! (PERF), "# (NEG), $% (1), %& (2), and %! (3), but it is the next type that is of interest here. The third type is the endoclitic, which attaches inside a word (similar to infixes). These do not simply get inserted within a word at a grammatical boundary, in which case they would simply be affixes, but rather they can split morphemes into separate chunks (called partials here). Part of a morpheme may end up in one partial while the rest of the morpheme may end up in another, potentially separated by multiple other words. In linguistic theory they are generally considered to be an impossibility, violating lexical integrity (Kopris and Davis, 2005). This theoretical impossibility may explain why the only languages claimed to have endoclitics are Pashto, Udi (Harris, 2002) and Degema (Kari, 2003). Instead of a theoretical discussion of how endoclitics can exist at all, the focus here will be on practical problems of encountering them in Pashto, especially in the written language. 2 Data Sources Data sources include online Pashto news from sources such as the BBC (www.bbc.co.uk/pashto), the VOA (www.voanews.com/pashto), Deutsche Welle (www.dw-world.de/dw/), and Pashtun sites such as Benawa (www.benawa.com/) and Tolafghan (www.tolafghan.com/), various publications, and materials produced in-house for corpus building and linguistic analysis, totaling around 1.8 million words. Online sources are from a mixture of dialects, while in-house materials are predominantly from the Western (Kandahari, Southern)

dialect, with substantial amounts of Eastern (Jalalabad, Northern) and to a lesser extent Southern (Khost, Central) dialects as well. Dialect differences can affect the membership and behavior of endoclitics in ways beyond the scope of this paper. For instance, for some Kandahari speakers at least the negative proclitic has some endoclitic properties. 3 Pashto Endoclitics Pashto endoclitics are of three types: pronominal, modal and adverbial (the latter are not fully endoclitic for some Kandahari speakers). type Pashto meaning!" 1sg pronominal #$ 2sg!% 3 &" 1pl, 2pl modal '( future, 'will' #$ 'must, should, let' adverbial &) 'indeed, but' &* 'then, so' Table 1. Pashto endoclitics When multiple endoclitics occur, they follow a strict internal ordering (Tegey, 1977): &*!% #$!" &" '( &) Although there are two different endoclitics with the shape #$, only one may appear at a time. The type of the endoclitic has no bearing on its ordering. The two adverbials are at opposite ends of the list, and the two modals are interspersed among the pronominals. 4 Second Position Pashto endoclitics prefer to be in second position in a sentence, with the caveat that "second position" may be defined in various ways. There are four different classes of verb that behave differently in the presence of endoclitics, especially in distinguishing imperfective and perfective forms: simple, derivative, A-initial, and doubly irregular. Simple verbs such as +,- 'beat' distinguish perfective from imperfective forms by the addition of the perfective proclitic, -. If a non-endoclitic pronoun like '., '3sg' comes first, the endoclitic (here!") will follow immediately: ',/--!" '., 0-0/- =-!" '., 3sg -beat =PERF 1sg 3sg Table 2. I beat him Note that the verb ',/-- is contiguous, and parsing is straightforward. If the simple pronoun '., is removed, the endoclitic must still be in second position. To accomplish this, it is inserted between the (stress-bearing) perfective proclitic - and the rest of the verb. ',/-!" - 0-0/-!" - 3sg -beat 1sg PERF Table 3. I beat him Note that now the perfective marker - is no longer attached to the verb, although the rest of the verb is still contiguous, and easily parsable. If even the perfective marker - is removed, resulting in imperfective aspect, the endoclitic will still be in second position. This time, the basic syntax rule that verbs are final will be violated, and the endoclitic will be last.!" ',/-!" 0-0/- 1sg 3sg -beat Table 4. I was beating him Although the unusual word order needs to be addressed, the verb is still contiguous and readily parsable. Derivative verbs (Tegey and Robson, 1996) incorporate a noun or adjective into an auxiliary in the imperfective, but split them apart in the perfective, creating a type of splitting verb. 0-1*/2-1" 0 --/ -3/2-1" 3sg -do -worse 1sg Table 5. I was making it worse In table 5, the imperfective of 4&*/2- 'make worse' incorporates the adjective 3/2- 'worse' into a

shortened form of the auxiliary!"# 'do', resulting in $%&'()%. *# +()% &, *# +()% &, do.perf.3sg worse 1sg Table 6. I made it worse In the perfective however, as in table 6, the adjective +()% is separated and there is a full auxiliary *#. Unlike simple verbs, there is also no perfective %. If the 1sg endoclitic -, is used in place of the corresponding simple pronoun &,, the state of incorporation due to the aspect is preserved. -, $%&'()% -, $ -%( -+()% 1sg 3sg -do -worse Table 7. I was making it worse, endoclitic *# -, +()% *# -, +()% do.perf.3sg 1sg worse Table 8. I made it worse, endoclitic In the imperfective (table 7), the endoclitic takes second position after the verb (which incorporates the adjective), violating basic word order, while in the perfective (table 8) the endoclitic appears after the non-incorporated adjective. In terms of parsing, derivative verbs pose no particular problems, as long as incorporation in the imperfective can be handled. A-initial verbs (Tegey, 1977) are also a type of splitting verb, but not in a semantically or morphologically natural manner. In the presence of an endoclitic, the initial ( of these verbs can split off from the rest of the root. As with simple verbs, A- initial verbs also take % in the perfective../012( &,./012( &, buy.3sg 1sg Table 9. I was buying them./012(% &,./012( =% &, buy.3sg PERF 1sg Table 10. I bought them Note in tables 9 and 10 that the imperfective and perfective forms are parallel to those of simple verbs. However, when an endoclitic is added, unexpected changes occur. -,./012( -,./012( 1sg buy.3sg Table 11. I was buying them, endoclitic final./012 -, (./012 -, ( buy?.3sg 1sg? Table 12. I was buying them, endoclitic medial./012 -, (%./012 -, ( =% buy?.3sg 1sg? PERF Table 13. I bought them, endoclitic The underlining in tables 11 through 13 indicates the stressed syllable. In the imperfective, if the final syllable of the verb is stressed, the endoclitic assumes second position after the verb (table 11). However, if the first syllable is stressed, the endoclitic again appears after it, but by forcing that syllable to separate from the rest of the verb (table 12). The initial (, which is not a meaningful prefix, stands on its own. This causes problems for parsing, in that two meaningless strings from different positions in the sentence must be identified as parts of one whole. In the perfective (table 13), the marker % pulls the ( so that both form a new single initial string, (%. This pull even occurs when an endoclitic can appear second without causing a split../012 3' (% -, 345./012 3' ( =% -, 345 buy?.3sg NEG? PERF 1sg 3sg Table 14. Further A-initial split Although in table 14 there is a simple pronoun in first position, allowing the endoclitic -, to be second without affecting the verb, the ( of the verb is still pulled away from the rest of the verb to attach to the perfective proclitic, leaving the negative proclitic to intervene. There is an additional change in pronunciation, in that the vowel of the perfective [6] and the vowel of the verb [a] merge into a new vowel [7]. Parsing written text is not affected by

the pronunciation change, but speech recognition is. Doubly irregular verbs, as called by Tegey and Robson (1996), are like derivative verbs in that they do not take! in the perfective, and like A- initial verbs in that the first part of a root can be split off (even though not "). Unlike the other categories, these verbs use a stress shift to indicate perfective aspect. Compare the verb #$%& 'you take' in table 15 (infinitive '%(&) with the sentence #$ )* +, )& %& 'you won't take me' in table 16. #$%& # -$%& 2sg -take Table 15. you take #$ )* +, )& %& # -$ )* +, )& %& 2sg -take? NEG 1sg FUT take? Table 16. you won't take me Note that the root -$%& 'take' is split into two separate partials, not at a morpheme boundary but at a syllable boundary. Not only are they split apart, but three other words occur between them, the endoclitics )& and +,, and the negative proclitic )*. It is especially important to indicate that the %& partial has no meaning of its own, nor does the remaining $ of the root. This makes Pashto endoclitics distinct from potentially similar phenomena from better known languages, such as English verb particles (look at) or German separable prefixes (anschauen). Of course, it also renders parsing of the verb difficult. Although often the tokens intervening between the partials are only a small set of particles, in extreme cases an entire clause can be wrapped between two partials, as in 2%3, )* 45!"#$ -./ +0 )& )1 'even your father won't pin him', where more than an entire noun phrase intervenes. 2%3, )* 45!"#$ -./ +0 )& )1 2 %3, )* 45!"#$ -./ +0 )& )1 3sg -pin? pin? Table 17. Even your father won't pin him Between the two partials of the verb, )1 and 2%3,, come a pair of endoclitics, future )& and 3rd person +0 (which cause the split into partials), the noun phrase 6 "#$ -./ 'your father', emphatic 45, and the negative proclitic )*. 5 Tokenization and Segmentation From one perspective, tokenization (finding sentence and word boundaries) is not affected by the presence of endoclitics. They normally are set off by white space in writing, and so are easily identified as individual strings, with the caveat that due to the nature of Pashto script in using both connecting and non-connecting letters, endoclitics ending in a non-connecting! may be written without a space character, relying on the reader to see the non-connection as space. From another perspective, however, they are more difficult in that they create problems for segmentation (morphology, finding roots) by the creation of non-word strings (partials). Using lexical look-up to determine if a string is a word will fail because the word partials created by endoclitic insertion will not normally be in the lexicon, and those found in the lexicon will be homographs. The %& of table 16 is a homograph of a female name, and the )1 of table 17 is a homograph of a word meaning 'some'. Simply applying morphology is not effective because a word is split into separate words, rather than affixes being added. Segmenting #$ from table 16 might find a substring corresponding to the second person singular suffix, #, but the remaining $ cannot be used for finding the verb '%7& in the lexicon (despite the morphology operations already required to recognize irregular -$%& as '%7&). Treating the partials as a simple compound, like English blackbird, is also not effective, since the partials have no meaning to be compounded, in addition to the same morphology problems as before. Another problem sometimes appears due to the nature of the Pashto writing system. Since it is a variant of Arabic script, many vowels are unwritten, especially word internally. At the ends of words, where suffixes for person, number, gender, case, tense and aspect are found, attempts are made to indicate otherwise unwritten vowels. When an endoclitic splits a word, it is possible that a vowel which is unwritten in the whole word becomes written at the end of the first partial. Compare these two variants of the doubly irregular (stress-

shifting) verb!"#$% 'knock down', one with an endoclitic and one without: without endoclitic!"#$% &' with endoclitic!"#' (' )% Table 18. I knocked them down Note that in the first example there is no vowel indicated between the consonants * and +. However, in the second, where the endoclitic (' has split the verb into two partials (after the stressed syllable), the first partial now ends in the vowel letter,. Whether treated as simple morphology or as compounding, the extra letter in the partial must be taken into account. Fortunately, that letter is usually (perhaps always),. Of course, since this change only applies to writing, speech recognition would not need to address the "new" vowel. Segmenting partials as unfound strings can be successful, as long as there are methods in the following parsing stage to recover the words that have been split. 6 How to Parse Them? Assuming a satisfactory stage of tokenization and segmentation, one possible approach to parsing the verb partials resulting from Pashto endoclitics is to treat them as discontinuous strings. Reuniting the partials while undoing potential spelling changes is straightforward, as long as the partials can be identified as such. Section 5 suggested that morphology alone will fail, and that is because it cannot deal with multiple word tokens at one time. However, if partials can be identified as such, rather than say as unknown proper names, then there is the opportunity to put them back together. The problem then is how to identify what to put together? How to know that unfound strings are partials rather than other unknowns? The key is the occurrence of one or more endoclitics. If no endoclitics are found, than unfound strings cannot be partials, and must be treated in the normal manner (e.g. as proper names). If endoclitics are found, then unfound strings have the potential to be partials, especially if one of the unfounds is at the end of the sentence and the other unfound is before the endoclitics. The likelihood is increased if the unfound preceding the endoclitic(s) is short, particularly only a single syllable. Short recognized strings preceding the endoclitics might also be in fact partials, with only homographs recognized, if the string in the verb position is unfound. If unfounds fulfill these requirements, then they can be tested as partials. If there are two unfounds, they need to be merged in order and then tested with standard morphological processing, including testing both a string with all characters of the partials and a string with the last, of the first partial dropped. If only the final word is unfound, then it needs to be tested the same way, but with an otherwise recognized string positioned before the endoclitics as the potential first partial. Returning to table 16, the string -. will be unfound, while the string "/ will be recognized as a proper name, 'Bow'. Between them, the parser will recognize two endoclitics, )/ and ('. The existence of the endoclitics and a final unfound string can then trigger the merging of that unfound -. with the short string "/ preceding the endoclitics (even though already recognized as a name). Applying morphology to the resulting -."/ allows the unfound to be segmented as an inflected verb. If transitivity information is included in the lexical entry, the resulting sentence will be syntactically sound as the removal of the proper name will reduce the number of arguments to two, matching the transitivity of the verb. Fortunately for text analysis, and unfortunately for speech recognition, creating partials through the use of endoclitics is more common in spoken than written Pashto. Formal written text has a low frequency of partials, while speech has a higher frequency. On the other hand, speech recognition does not need to address spelling changes of certain partials, except in so far as transcribing them directly. The fact that partials are less frequent in writing means that speakers can find ways around using them. This raises another possibility, that of converting sentences with partials into equivalents without. One way is to avoid using endoclitics, and the other is rephrasing such that the verb is not split into partials. with endoclitic 12345 (' 0 without endoclitic 123450 &' Table 19. I was buying them In table 19, the 1sg endoclitic (' is replaced by the simple 1sg pronoun &'. Because there is no endoclitic, there are no partials, and the verb becomes contiguous. Where the first example has

contiguous. Where the first example has two partials,! and "#$%&, the second just has the complete verb "#$%&!. Habibullah Tegey and Barbara Robson. 1996. A Reference Grammar of Pashto. Center for Applied Linguistics, Washington, DC. with partials "#$%& '(! without partials "#$%&! '( )*+ Table 20. I was buying them Table 20 conversely shows a rearrangement due to the addition of another pronoun, )*+, initially. This allows the endoclitic to appear in second position without needing to split the verb. Although these methods are the ones presumably used by speakers in avoiding generating partials, attempting to use them in parsing existing sentences runs into the same basic problem as before: how to identify partials and merge them back together. Rearrangement or alternate choice of pronoun in an existing sentence does not touch the partials in written text, only the minds of the speakers. 7 Conclusion Endoclitics are cross-linguistically an exceedingly rare phenomenon, but they exist in Pashto and when encountered must still be parsed. Although no single specific solution has been provided in this paper, various workable approaches have been presented involving recognizing unfound strings (especially single syllables) in the presence of endoclitics as potential partials, allowing them to be remerged for lexical lookup. As endoclitics exist on the boundary of morphology and syntax, the parsing of endoclitics must also involve both morphology and syntax. References Alice C. Harris. 2002. Endoclitics and the Origins of Udi Morphosyntax. Oxford University Press, Oxford. Ethelbert Emmanuel Kari. 2003. Clitics in Degema: A Meeting Point of Phonology, Morphology and Syntax. Research Institute for Languages and Cultures of Asia and Africa, Tokyo. Craig A. Kopris and Anthony R. Davis. 2005. Endoclitics in Pashto: Implications for Lexical Integrity. Presented at the Fifth Mediterranean Morphology Meeting, Sept. 15-18, 2005, Fréjus, France. Habibullah Tegey. 1977. The Grammar of Clitics: Evidence from Pashto and Other Languages. International Center for Pashto Studies, Kabul.