Morphotactics as Tier-Based Strictly Local Dependencies

Similar documents
Language properties and Grammar of Parallel and Series Parallel Languages

Informatics 2A: Language Complexity and the. Inf2A: Chomsky Hierarchy

LING 329 : MORPHOLOGY

Derivational and Inflectional Morphemes in Pak-Pak Language

A General Class of Noncontext Free Grammars Generating Context Free Languages

Parallel Evaluation in Stratal OT * Adam Baker University of Arizona

More Morphology. Problem Set #1 is up: it s due next Thursday (1/19) fieldwork component: Figure out how negation is expressed in your language.

Linguistics 220 Phonology: distributions and the concept of the phoneme. John Alderete, Simon Fraser University

RANKING AND UNRANKING LEFT SZILARD LANGUAGES. Erkki Mäkinen DEPARTMENT OF COMPUTER SCIENCE UNIVERSITY OF TAMPERE REPORT A ER E P S I M S

Basic concepts: words and morphemes. LING 481 Winter 2011

Linguistics. Undergraduate. Departmental Honors. Graduate. Faculty. Linguistics 1

Control and Boundedness

Using a Native Language Reference Grammar as a Language Learning Tool

Proof Theory for Syntacticians

DOWNSTEP IN SUPYIRE* Robert Carlson Societe Internationale de Linguistique, Mali

Rhythm-typology revisited.

Introduction to HPSG. Introduction. Historical Overview. The HPSG architecture. Signature. Linguistic Objects. Descriptions.

"f TOPIC =T COMP COMP... OBJ

A R "! I,,, !~ii ii! A ow ' r.-ii ' i ' JA' V5, 9. MiN, ;

Double Double, Morphology and Trouble: Looking into Reduplication in Indonesian

Approved Foreign Language Courses

The Acquisition of Person and Number Morphology Within the Verbal Domain in Early Greek

CS 598 Natural Language Processing

COMPUTATIONAL COMPLEXITY OF LEFT-ASSOCIATIVE GRAMMAR

Out of the heart springs life

ROSETTA STONE PRODUCT OVERVIEW

Improved Effects of Word-Retrieval Treatments Subsequent to Addition of the Orthographic Form

AGS THE GREAT REVIEW GAME FOR PRE-ALGEBRA (CD) CORRELATED TO CALIFORNIA CONTENT STANDARDS

A Version Space Approach to Learning Context-free Grammars

Books Effective Literacy Y5-8 Learning Through Talk Y4-8 Switch onto Spelling Spelling Under Scrutiny

First Grade Curriculum Highlights: In alignment with the Common Core Standards

1/20 idea. We ll spend an extra hour on 1/21. based on assigned readings. so you ll be ready to discuss them in class

Hindi Aspectual Verb Complexes

Parsing of part-of-speech tagged Assamese Texts

The Effect of Extensive Reading on Developing the Grammatical. Accuracy of the EFL Freshmen at Al Al-Bayt University

have to be modeled) or isolated words. Output of the system is a grapheme-tophoneme conversion system which takes as its input the spelling of words,

The analysis starts with the phonetic vowel and consonant charts based on the dataset:

MTH 141 Calculus 1 Syllabus Spring 2017

Probability and Game Theory Course Syllabus

Erkki Mäkinen State change languages as homomorphic images of Szilard languages

Indeterminacy by Underspecification Mary Dalrymple (Oxford), Tracy Holloway King (PARC) and Louisa Sadler (Essex) (9) was: ( case) = nom ( case) = acc

Underlying Representations

1 st Quarter (September, October, November) August/September Strand Topic Standard Notes Reading for Literature

UH STEM Pathways Project

Grade 4. Common Core Adoption Process. (Unpacked Standards)

A Minimalist Approach to Code-Switching. In the field of linguistics, the topic of bilingualism is a broad one. There are many

The D2L eportfolio for Teacher Candidates

Reading Horizons. A Look At Linguistic Readers. Nicholas P. Criscuolo APRIL Volume 10, Issue Article 5

From Empire to Twenty-First Century Britain: Economic and Political Development of Great Britain in the 19th and 20th Centuries 5HD391

The Structure of Relative Clauses in Maay Maay By Elly Zimmer

California Department of Education English Language Development Standards for Grade 8

ENGBG1 ENGBL1 Campus Linguistics. Meeting 2. Chapter 7 (Morphology) and chapter 9 (Syntax) Pia Sundqvist

SAM - Sensors, Actuators and Microcontrollers in Mobile Robots

Cross Language Information Retrieval

AGENDA LEARNING THEORIES LEARNING THEORIES. Advanced Learning Theories 2/22/2016

What the National Curriculum requires in reading at Y5 and Y6

CLASSIFICATION OF PROGRAM Critical Elements Analysis 1. High Priority Items Phonemic Awareness Instruction

Approaches to control phenomena handout Obligatory control and morphological case: Icelandic and Basque

LINGUISTICS. Learning Outcomes (Graduate) Learning Outcomes (Undergraduate) Graduate Programs in Linguistics. Bachelor of Arts in Linguistics

Listening and Speaking Skills of English Language of Adolescents of Government and Private Schools

CONCEPT MAPS AS A DEVICE FOR LEARNING DATABASE CONCEPTS

Classroom Connections Examining the Intersection of the Standards for Mathematical Content and the Standards for Mathematical Practice

Detecting English-French Cognates Using Orthographic Edit Distance

French Dictionary: 1000 French Words Illustrated By Evelyn Goldsmith

Adler Graduate School

Timeline. Recommendations

A Neural Network GUI Tested on Text-To-Phoneme Mapping

Basic German: CD/Book Package (LL(R) Complete Basic Courses) By Living Language

Backwards Numbers: A Study of Place Value. Catherine Perez

The Effect of Discourse Markers on the Speaking Production of EFL Students. Iman Moradimanesh

Publisher Citations. Program Description. Primary Supporting Y N Universal Access: Teacher s Editions Adjust on the Fly all grades:

Beyond constructions:

Undergraduate Programs INTERNATIONAL LANGUAGE STUDIES. BA: Spanish Studies 33. BA: Language for International Trade 50

Whole School Literacy Policy 2017/18

The D2L eportfolio for Teacher Candidates

Visual CP Representation of Knowledge

Oakland Unified School District English/ Language Arts Course Syllabus

STA 225: Introductory Statistics (CT)

Phonological and Phonetic Representations: The Case of Neutralization

Possessive have and (have) got in New Zealand English Heidi Quinn, University of Canterbury, New Zealand

Evolution of Collective Commitment during Teamwork

Large Kindergarten Centers Icons

English Comprehension Question For Grade 7

THE HEAD START CHILD OUTCOMES FRAMEWORK

IT Students Workshop within Strategic Partnership of Leibniz University and Peter the Great St. Petersburg Polytechnic University

Laporan Penelitian Unggulan Prodi

Lecture 1: Machine Learning Basics

AF~-SUttA~ :tc.a~ v~ t~* Salah Alnajem. Abstract. Department of Arabic, College of Arts Kuwait University

CDE: 1st Grade Reading, Writing, and Communicating Page 2 of 27

Coast Academies Writing Framework Step 4. 1 of 7

Factoring - Grouping

Constraining X-Bar: Theta Theory

Noun incorporation in Sora: A case for incorporation as morphological merger TLS: 19 February Introduction.

Lexical phonology. Marc van Oostendorp. December 6, Until now, we have presented phonological theory as if it is a monolithic

TabletClass Math Geometry Course Guidebook

Lexical specification of tone in North Germanic

This Performance Standards include four major components. They are

A. True B. False INVENTORY OF PROCESSES IN COLLEGE COMPOSITION

Oakland Unified School District English/ Language Arts Course Syllabus

LNGT0101 Introduction to Linguistics

Transcription:

Morphotactics as Tier-Based Strictly Local Dependencies Alëna Aksënova, Thomas Graf, and Sedigheh Moradi Stony Brook University SIGMORPHON 14 Berlin, Germany 11. August 2016

Our goal Received view Recent research Phonology regular Kaplan&Kay (1994) subregular Heinz (2015) Morphology regular Beesley&Karttunen (2003)?

Our goal Received view Recent research Phonology regular Kaplan&Kay (1994) subregular Heinz (2015) Morphology regular Beesley&Karttunen (2003)? Show that morphotactics is subregular More precisely: Tier-Based Strictly Local Consequences parallels to phonology learnable in the limit from positive text explain typological gaps

Outline 1 Preliminaries 2 SL Patterns In Morphology 3 Tier-Based Strictly Local TSL is necessary TSL is sufficient 4 Typological Gaps

Morphotactics Definition (Morphotactics) Restrictions on the linear ordering of morphemes. Our focus: morphotactics in underlying representations (English) OK STEM-PL PL-STEM allomorphy (dogs, peaches) is not considered yet

Computational nature of morphotactics Phonology Morphology Received view regular regular Recent research subregular? Advantages of (some) subregular languages: resolves learnability issues describes potential cognitive mechanisms uses less powerful generating device

Subregular Phonology and Morphology not all languages exploit full power of finite-state machinery subregular hierarchy Subregular hierarchy Regular Star-Free LTT TSL LT PT SL SP

Subregular Phonology and Morphology not all languages exploit full power of finite-state machinery subregular hierarchy Strong Subregular Hypothesis All phonological dependencies are strictly local (SL) tier-based strictly local (TSL) strictly piecewise (SP) Subregular hierarchy Regular Star-Free LTT TSL LT PT SL SP

Subregular Phonology and Morphology not all languages exploit full power of finite-state machinery subregular hierarchy Strong Subregular Hypothesis All phonological dependencies are strictly local (SL) tier-based strictly local (TSL) strictly piecewise (SP) Subregular hierarchy Regular Star-Free LTT Subregular Morphotactics All morphotactic dependencies are TSL LT PT strictly local (SL) tier-based strictly local (TSL) SL SP

Strictly Local languages SL and TSL are generated by k-gram models. A k-gram model is a finite set of blocked k-grams.

Strictly Local languages SL and TSL are generated by k-gram models. A k-gram model is a finite set of blocked k-grams. Example (Strictly Local Grammar for (ab) a) Σ = {a, b} Grammar = { b, bb, aa, b, } Accepted strings: a, aba, ababa, etc. Rejected strings: ab, ba, abba, etc.

Strictly Local languages SL and TSL are generated by k-gram models. A k-gram model is a finite set of blocked k-grams. Example (Strictly Local Grammar for (ab) a) Σ = {a, b} Grammar = { b, bb, aa, b, } Accepted strings: a, aba, ababa, etc. Rejected strings: ab, ba, abba, etc.

Strictly Local languages SL and TSL are generated by k-gram models. A k-gram model is a finite set of blocked k-grams. Example (Strictly Local Grammar for (ab) a) Σ = {a, b} Grammar = { b, bb, aa, b, } Accepted strings: a, aba, ababa, etc. Rejected strings: ab, ba, abba, etc. Definition Strictly k-local (k-sl) grammar consists of a set of blocked k-grams over an alphabet Σ.

Tier-Based Strictly Local languages Example (Tier-Based Strictly Local Grammar for c (ac bc ) ac ) Σ = {a, b, c} Grammar: G(a,b tier) = { b, bb, aa, b, } Accepted strings: a, accba, cacbaccccba, etc.

Tier-Based Strictly Local languages Example (Tier-Based Strictly Local Grammar for c (ac bc ) ac ) Σ = {a, b, c} Grammar: G(a,b tier) = { b, bb, aa, b, } Accepted strings: a, accba, cacbaccccba, etc. a,b tier: a aba ababa

Tier-Based Strictly Local languages Example (Tier-Based Strictly Local Grammar for c (ac bc ) ac ) Σ = {a, b, c} Grammar: G(a,b tier) = { b, bb, aa, b, } Accepted strings: a, accba, cacbaccccba, etc. a,b tier: a aba ababa Rejected strings: accccaba, abcccacccbc, etc.

Tier-Based Strictly Local languages Example (Tier-Based Strictly Local Grammar for c (ac bc ) ac ) Σ = {a, b, c} Grammar: G(a,b tier) = { b, bb, aa, b, } Accepted strings: a, accba, cacbaccccba, etc. a,b tier: a aba ababa Rejected strings: accccaba, abcccacccbc, etc. a,b tier: aaba abab

Tier-Based Strictly Local languages Example (Tier-Based Strictly Local Grammar for c (ac bc ) ac ) Σ = {a, b, c} Grammar: G(a,b tier) = { b, bb, aa, b, } Accepted strings: a, accba, cacbaccccba, etc. a,b tier: a aba ababa Rejected strings: accccaba, abcccacccbc, etc. a,b tier: aaba abab Definition A Tier-Based Strictly k-local grammar is a k-sl grammar that operates over a tier, a specific substructure of the string.

Learnability Learning of SL and TSL learning memorizing finite number of k-grams + tier induction learnable in the limit from positive text Jardine & Heinz (2016)

Mappings we use General assumption: we assume stem not to be bound in length: There is no limit on the length of the stem in languages. The stem can be result of the compounding. whiteboard, whiteboard marker, whiteboard marker cleaning fluid, whiteboard marker cleaning fluid purchase receipt Mapping of the stem to a single symbol will result in insensibility to compounds.

Mappings we use General assumption: we assume stem not to be bound in length: There is no limit on the length of the stem in languages. The stem can be result of the compounding. whiteboard, whiteboard marker, whiteboard marker cleaning fluid, whiteboard marker cleaning fluid purchase receipt Mapping of the stem to a single symbol will result in insensibility to compounds. Affixes: affix-to-symbol mapping Stems: symbol-to-symbol mapping

Strictly Local Morphology: affixation Example (prefix za-, Russian) exat go, drive xxxx zaexat call on the way axxxx Bigram xa ensures that za is a prefix.

Strictly Local Morphology: affixation Example (prefix za-, Russian) exat go, drive xxxx zaexat call on the way axxxx Bigram xa ensures that za is a prefix. Example (suffix -s, English) dog xxx dogs xxxb Bigram bx ensures that s is a suffix.

Strictly Local Morphology: affixation [cont.] Example (affixation, English) lock xxxx unlockable axxxxb blacklist xxxxxxxxx unblacklistable axxxxxxxxxb

Strictly Local Morphology: affixation [cont.] Example (affixation, English) lock xxxx unlockable axxxxb blacklist xxxxxxxxx unblacklistable axxxxxxxxxb SLG = { b, ba, bx, xa, a } This grammar necessarily generates the following forms of English, too: axxxxx and xxxxxb.

Strictly Local Morphology: affixation [cont.] Example (affixation, English) lock xxxx unlockable axxxxb blacklist xxxxxxxxx unblacklistable axxxxxxxxxb SLG = { b, ba, bx, xa, a } This grammar necessarily generates the following forms of English, too: axxxxx and xxxxxb. Indeed, this prediction is correct: Example (affixation, English) unleash axxxxx breakable xxxxxb

SL is not enough: Indonesian circumfixation English un-...-able are prefix and suffix that can co-occur. However, two parts of a circumfix cannot occur independently: Consider the following example from Indonesian: Example (circumfix ke-an, Indonesian) tinggi high xxxxxx ketinggian altitude axxxxxxb *axxxxxx mahasiswa big pupil (student) xxxxxxxxx kemahasiswaan student affairs axxxxxxxxxb *xxxxxxxb

SL is not enough: Indonesian circumfixation [cont.] Example (circumfix ke-an, Indonesian) tinggi xxxxxx ketinggian axxxxxxb *axxxxxx mahasiswa xxxxxxxxx kemahasiswaan axxxxxxxxxb *xxxxxxxb SLG = { b, ba, bx, xa, a } String language = xxxx, axxxxxb, axxxx, xxb...

SL is not enough: Indonesian circumfixation [cont.] Example (circumfix ke-an, Indonesian) tinggi xxxxxx ketinggian axxxxxxb *axxxxxx mahasiswa xxxxxxxxx kemahasiswaan axxxxxxxxxb *xxxxxxxb SLG = { b, ba, bx, xa, a } String language = xxxx, axxxxxb, axxxx, xxb... Problem: SL languages can only capture local dependencies Circumfixes introduce non-local ones

Morphotactics is TSL Example (circumfix ke-an, Indonesian) tinggi xxxxxx ketinggian axxxxxxb *axxxxxx mahasiswa xxxxxxxxx kemahasiswaan axxxxxxxxxb *xxxxxxxb TSLG(circumfix tier) = { b, ba, a }

Morphotactics is TSL Example (circumfix ke-an, Indonesian) tinggi xxxxxx ketinggian axxxxxxb *axxxxxx mahasiswa xxxxxxxxx kemahasiswaan axxxxxxxxxb *xxxxxxxb TSLG(circumfix tier) = { b, ba, a } Licit strings: Illicit strings: xxxxxx axxxxx a axxxxxxb ab bxxxxa ba

Morphotactics is TSL Example (circumfix ka-an, Ilocano) In Ilocano, it is impossible to do embedded circumfixation: bigát morning xxxxx kabigátan the next morning axxxxxb *aaxxxxxbb

Morphotactics is TSL [cont.] Example (circumfix ka-an, Ilocano) bigát xxxxx kabigátan axxxxxb *aaxxxxxbb TSLG(circumfix tier) = { b, ba, a, aa, bb}

Morphotactics is TSL [cont.] Example (circumfix ka-an, Ilocano) bigát xxxxx kabigátan axxxxxb *aaxxxxxbb TSLG(circumfix tier) = { b, ba, a, aa, bb} Licit strings: Illicit strings: xxxxxx aaxxxxxbb aabb axxxxxxb ab bxxxxa ba

Interim Summary SL enforces local dependencies TSL enforces local dependencies on the determined tier Most of morphotactics is SL, some of it is TSL Learning of TSL languages is possible from positive data only Can morphotactics be more than TSL?

Can morphotactics be more than TSL? concatenation complement union PT intersection SP Regular Star-Free LTT LT SL closure properties inheritance TSL

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English.

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English.

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English. Closure under union: If a Mandaresian word violates rules of Mandarin Chinese, it must obey the rules of Indonesian.

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English. Closure under union: If a Mandaresian word violates rules of Mandarin Chinese, it must obey the rules of Indonesian.

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English. Closure under union: If a Mandaresian word violates rules of Mandarin Chinese, it must obey the rules of Indonesian. Closure under relative complement: Hsilgne contains all words that are ill-formed in English.

Can morphotactics be more than TSL? Closure under concatenation: Frenglish contains only words whose first part is a word of French and the second a word of English. Closure under union: If a Mandaresian word violates rules of Mandarin Chinese, it must obey the rules of Indonesian. Closure under relative complement: Hsilgne contains all words that are ill-formed in English.

Can morphotactics be more than TSL? Closure under intersection: Russenorsk is created by combination of elements of Russian and Norwegian.

Can morphotactics be more than TSL? Closure under intersection: Russenorsk is created by combination of elements of Russian and Norwegian. (spoken in Northern Norway, 18th-19th centuries)

Can morphotactics be more than TSL? Closure under intersection: Russenorsk is created by combination of elements of Russian and Norwegian. (spoken in Northern Norway, 18th-19th centuries) Example (Closure under intersection) A language allows complex nuclei and blocks codas (Supyire) A language forbids complex nuclei and allows codas (Russian)

Can morphotactics be more than TSL? Closure under intersection: Russenorsk is created by combination of elements of Russian and Norwegian. (spoken in Northern Norway, 18th-19th centuries) Example (Closure under intersection) A language allows complex nuclei and blocks codas (Supyire) A language forbids complex nuclei and allows codas (Russian) Then there will be a language that blocks complex nuclei and codas (Hawaiian, Senufo)

Can morphotactics be more than TSL? Closure under concatenation Closure under union Closure under relative complement Closure under intersection

Can morphotactics be more than TSL? Closure under concatenation Closure under union Closure under relative complement Closure under intersection concatenation complement union PT intersection SP Regular Star-Free LTT LT SL closure properties inheritance TSL

Typological gaps Basic Logic of Argument All attested morphotactic patterns must be TSL. So if pattern A is TSL, and pattern B is TSL, but their combination A+B is not, we get a typological gap. Some predicted gaps: No embedded circumfixation; No cases when amount of affixes A depends on the amount of affixes B; In general, no a n b n pattern and its derivatives.

Typological gap I: Impossible compounding Russian pattern (stem-o)*-stem Example (compounding, Russian) vodovoz water carrier xxxoxxx vodovozovoz carrier of water carriers xxxoxxxoxxx

Typological gap I: Impossible compounding Russian pattern (stem-o)*-stem Example (compounding, Russian) vodovoz water carrier xxxoxxx vodovozovoz carrier of water carriers xxxoxxxoxxx Turkish pattern stem-(stem + -o) Example (compounding, Turkish) bahçe kapi-si garden gate xxxxxxxxxo türk kahve-si Turkish coffee xxxxxxxxxo türk bahçe kapi-si Turkish garden gate xxxxxxxxxxxxxo *türk bahçe kapi-si-si *xxxxxxxxxxxxxoo

Typological gap I: Impossible compounding Russian pattern (stem-o)*-stem Turkish pattern stem-(stem + -o) Turkussian pattern: amount of compound markers is equal to the amount of added stems, stem-(stem n -o n )

Typological gap I: Impossible compounding Russian pattern (stem-o)*-stem Turkish pattern stem-(stem + -o) Turkussian pattern: amount of compound markers is equal to the amount of added stems, stem-(stem n -o n ) This pattern is not regular because it has infinite number of good continuations. (Myhill-Nerode theorem) It appears to be non-existent.

Typological gap II: Recurrent affixation Sometimes languages allow some affixes to be iterated: a*-stem. Consider example of such pattern in German: Example (prefix über, German) morgen tomorrow xxxxxx übermorgen the day after tomorrow axxxxxx überübermorgen the day after the day after tomorrow aaxxxxxx

Typological gap II: Recurrent affixation German pattern: a*-stem. The same meaning can be expressed in another language differently, consider Ilocano (Austronesian) temporal circumfix ka-...-an next. Example (circumfix ka-an, Ilocano) bigát morning xxxxx kabigátan the next morning axxxxxb

Typological gap II: Recurrent affixation German pattern: a*-stem. The same meaning can be expressed in another language differently, consider Ilocano (Austronesian) temporal circumfix ka-...-an next. Example (circumfix ka-an, Ilocano) bigát morning xxxxx kabigátan the next morning axxxxxb However, word kakabigátanan doesn t appear to be possible word in Ilocano: a n -stem-b n pattern is not regular.

Conclusion Regular Morphotactics is at most Tier-Based Strictly Local Star-Free Positive data is enough for morphological learning LTT Set of typological gaps can be explained due to the subregular nature of morphology TSL LT PT Same formal tools can be used for morphology and phonology SL SP

Future work Try to find SP patterns in morphotactics Look at more typologically diverse languages Extend to mappings from underlying to surface forms Work with representations of internal structure The elephant in the room: reduplication

References Thank you!

References References I Beesley, Kenneth R. and Lauri Karttunen (2003) Finite State Morphology. CSLI Publications. Chandlee, Jane (2014) Strictly Local Phonological Processes. PhD Thesis, University of Delaware Chandlee, Jane, Rémi Eyraud and Jeffrey Heinz (2014) Learning Strictly Local Subsequential Functions. Transactions of the Association for Computational Linguistics 2, 491 503. Galvez Rubino, Carl R. (1998) Ilocano: Ilocano-English, English-Ilocano: Dictionary and Phrasebook. Hippocrene Books Inc., U.S. Heinz, Jeffrey (2015) The Computational Nature of Phonological Generalizations. Ms., University of Delaware Heinz, Jeffrey, Chetan Rawal and Herbert G. Tanner (2011) Tier-Based Strictly Local Constraints in Phonology. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, 58 64. Jardine, Adam (2015) Computationally, Tone is Different. Ms., University of Delaware

References References II Jardine, Adam and Jeffrey Heinz (2016) Learning Tier-based Strictly 2-Local Languages. Transactions of the Association for Computational Linguistics 4, 87 98. Jurafsky, Daniel and James H. Martin. (2009) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition. Upper Saddle River, N.J. : Pearson Prentice Hall. Kaplan, Ronald M. and Martin Kay (1994) Regular Models of Phonological Rule Systems. Computational Linguistics 20(3), 331 378. Mahdi, Waruno (2012) Distinguishing Cognate Homonyms in Indonesian. Oceanic Linguistics 51(2), 402 449. Rogers, James and Geoffrey Pullum (2007) Aural Pattern Recognition Experiments and the Subregular Hierarchy. Mathematics of Language 10, 1 16. Sneddon, James Neil (1996) Indonesian Comprehensive Grammar. Routledge, London and New York. Stump, Greg (2016) Rule composition in an adequate theory of morphotactics. Manuscript.