Copyright (c) 2004 Patricia J. Donegan and David Stampe The files here are part of the Munda Lexical Archive. Munda Lexical Files are neither complete nor error-free. But they may be copied, corrected, augmented, translated, transliterated, re-arranged, redistributed, or republished in any form, provided that such derivative works explicitly impose exactly these same conditions, to guarantee their unrestricted use by speakers and students of the languages they describe, in perpetuity. ------------------------------------------------------------------------- Some notes on the coding and format of the files The working alphabetization of entries is language-specific but there are some general traits -- all use Roman order, with aspirates merged with nonaspirates, checked (unreleased glottalized devoiced) stops and their reflexes -- sometimes 'y or y' for checked j) merged with plain, retroflexes mostly merged with nonretroflexes, flaps with homorganic stops (even in languages like Sora where they aren't phonologically related, simply because some of the sources couldn't distinguish them reliably), long vowels with short. Nasal vowels may be merged with nonnasal or V~C with VNC (NC homorganic). In Sora the order is A (schwa) a b D (if not spelled d) R E (epsilon) e g i y + (barred-i) j k l m n J G O p r s t u o. That will some day change but for now it tends to minimize separation of entries with wrongly transcribed or uncertain vowels etc. Entries are one line of the following form:
(SRC) [PH] {PS} ``DEF WITH ^GLOSS ^WORDS''. (OTHER.) Subentries are indented two spaces per level, i.e. subsubentries are indented four spaces, etc. LG ABBREVS (only in etymological fields, and the etymological files): SOUTH MU: Kh[aria], Ju[ang], So[ra], Go[rum], Gu[tob], Re[mo], Gt[a?] NORTH MU: Sa[ntali], Mu[ndari], Ho, Bir[hor], Ku[rku] = Ko[rku] INDO-ARYAN: S[ans]k[rit], H[indi], Sad[ani] (H. dialect in Kh. area), B[engali], O[riya], Des[ia] (O. dialect in the area of Go., Gu., Re. Gt.) DRAVIDIAN: Or[aon] = Kurukh (in Kh. area), Tel[ugu] (in Des., Sora area) SRC: SOURCE: List unfinished. Usually a (string of) capital letters representing the initial of the contributor, but in some cases a code for a book of field notes or a locus, possibly with a two-digit date, etc. The source codes ":" or "@" are used for entries identified only by a re-analysis of the original sources. , OR ,, OR ;; are used to separate variant forms. / AND \ are used for / forms or \ forms. PH: PHONETIC: Occasional in Go entries. PS: Parts of speech, mostly self-evident: N, V (language specific subclasses may be indicated), PP (postposition), PX (prefix), SX (suffix), IX (infix), X undecided, etc. DEF[inition]: The main definition is enclosed in ``... '', with some ^words and ^short_^phrases marked as short glosses or as relevant keys in searches, with % prefixed to scientific names. Some missing definitions are simply ``G''. OTHER FIELDS: All optional. They begin after two spaces and end with periods or question marks plus two spaces (or end of entry). Most start with an identifying code character, e.g. analyses with |, comments !, queries ?, etymologies *, and any of # % @ etc. for serial numbers, etc. FORM PUNCT'N: - or = between morphemes, also around infixes in Go(M); . between reduplicated parts; [...] around morphemes PHONOLOGICAL CODING In <...> (phonemic), occ. in <<...>> (morphophonemic) or [...] (phonetic): LAB DENT RETR PAL VEL GLOT STOPS: p t T c (or affr) k ? OR ' b d D j (or affr) g NASALS: m n N J G FRICS: f s S $ h v z LIQS: l L r R GLIDES: w y VOWELS: u i + (barred-i) U (small cap) I (small cap) o e O (open o) E (epsilon) A (schwa) @ (ae digraph) a In addition there are other chars mainly in non-Munda comparisons, e.g. H for dotted h (visarga) in Sk., etc. DIACRITICS follow the character they modify: X: long X, X~ nasalized X, Xh aspirated X, X' glottalized X MISCELLANEOUS PHONOLOGICAL NOTES: SM* regularly becomes GuReGt*, and then GuRe. SM* > GuReGt In some citations of Sora, is represented as since in that language (alone) there is no (borrowed) dental . The dental vs retroflex distinction is mostly borrowed, excepting the vs distinction, but in several languages it has arisen independently. In several languages <>, <>, <> may be pronounced as , [NDr], and in Juang <> is pronounced in some dialects. The intrusive consonants may or may not be written in the source files, but the difference is never distinctive. In most languages the native syllable final stops are checked and voiceless, as in Mon-Khmer, but they are pronounced as voiced stops before vowels, and may be identified lexically as voiced. (In fact, some Juang dialects no longer check final stops, and pronounce them as voiced.) Citations of stops in syllable-final position may be morphophonemic <> or phonemic , the latter with or without the glottalization diacritic. Historically speaking there was only one series in final position, exactly as in Mon-Khmer, so the transcriptions would not matter, except that in many languages Indo-Aryan words have been borrowed with final unchecked voiced and voiceless (and in Kharia and the Kherwarian group of North Munda, borrowing from Hindi, and in Korku borrowing from Marathi) even final aspirated voiced and voiceless stops, all contrasting with the native final checked stops. In all these cases we have followed the source transcriptions, except for words from Kh(B), Biligiri's Kharia Grammar, which used a coexistent phoneme-systems type of transcription that had arguable merits but was difficult to reconcile with other sources for Kharia, and was therefore normalized to conform to their type of transcription. In Gorum checked stops have in some cases become checked nasals and are written as such. (Some checked nasals may however be from stop plus nasal.) In several languages the point of articulation of a checked stop is lost before a homorganic stop or nasal, and that may or may not be shown explicitly. In all these cases we have followed the source transcriptions. In Kharia (and often in North Munda, and Hindi) there is flapping of to , to , and probably Hindi loans with as ; they may written thus. In Juang, is flapped to instead, while syllable-final becomes . Juang in some dialects has changed some

's to or zero, but this is lexically diffused and therefore represented explicitly; due to entering

, , and zero variants in distinct alphabetical orders, the same entry may have been found three times in search files. This is true of alphabetically distinct variants in any of the languages, unless the search results have been hand-edited to eliminate the repetitious entries. Most Munda languages vowels have a palatal (fronted, raised) glide before tautosyllabic palatals, i.e. . In some languages the vowel is then transcribed or . The palatal may itself be lost, leaving a glottal in the case of the checked stops, nasalization in the case of the nasal stop, and nothing in the case of the glide. In some of the languages the resulting forms have been reinterpreted as phonemic. Different Sora sources may disagree on vs , vs , and vs . This is gradually being corrected in the Sora materials, but in the meantime we have followed the earlier sources. Also before the palatals is usually pronounced [+] or [e], and may be found transcribed as <+> or . ---------------------------------------------------------------- Please address questions and comments to Patricia J. Donegan and David Stampe University of Hawai`i Department of Linguistics Honolulu, Hawai`i 96825 USA