Loading AI tools
Austroasiatic language of Cambodia From Wikipedia, the free encyclopedia
Khmer (/kəˈmɛər/ kə-MAIR;[3] ខ្មែរ, UNGEGN: Khmêr [kʰmae]) is an Austroasiatic language spoken natively by the Khmer people. This language is an official language and national language of Cambodia. The language is also widely spoken by Khmer people in Eastern Thailand and Isan, Thailand, also in Southeast and Mekong Delta of Vietnam.
Khmer | |
---|---|
Cambodian | |
ភាសាខ្មែរ
/ ខេមរភាសា Phéasa Khmêr / Khémôrôphéasa | |
Pronunciation | [pʰiəsaː kʰmae] [kʰeːmarapʰiəsaː] |
Native to | |
Ethnicity | Khmer |
speakers | L1: 17 million (2019)[1] L2: 1 million (no date)[1] |
Austroasiatic
| |
Early forms | |
Official status | |
Official language in | Cambodia |
Recognised minority language in | |
Regulated by | Royal Academy of Cambodia, National Council of Khmer Language[2] |
Language codes | |
ISO 639-1 | km Central Khmer |
ISO 639-2 | khm Central Khmer |
ISO 639-3 | Either:khm – Khmerkxm – Northern Khmer |
Glottolog | khme1253 Khmericcent1989 Central Khmer |
Linguasphere | 46-FBA-a |
Khmer |
Khmer has been influenced considerably by Sanskrit and Pali especially in the royal and religious registers, through Hinduism and Buddhism,[4] due to Old Khmer being the language of the historical empires of Chenla and Angkor.
The vast majority of Khmer speakers speak Central Khmer, the dialect of the central plain where the Khmer are most heavily concentrated. Within Cambodia, regional accents exist in remote areas but these are regarded as varieties of Central Khmer. Two exceptions are the speech of the capital, Phnom Penh, and that of the Khmer Khe in Stung Treng province, both of which differ sufficiently enough from Central Khmer to be considered separate dialects of Khmer.
Outside of Cambodia, three distinct dialects are spoken by ethnic Khmers native to areas that were historically part of the Khmer Empire. The Northern Khmer dialect is spoken by over a million Khmers in the southern regions of Northeast Thailand and is treated by some linguists as a separate language. Khmer Krom, or Southern Khmer, is the first language of the Khmer of Vietnam, while the Khmer living in the remote Cardamom Mountains speak a very conservative dialect that still displays features of the Middle Khmer language.
Khmer is primarily an analytic, isolating language. There are no inflections, conjugations or case endings. Instead, particles and auxiliary words are used to indicate grammatical relationships. General word order is subject–verb–object, and modifiers follow the word they modify. Classifiers appear after numbers when used to count nouns, though not always so consistently as in languages like Chinese. In spoken Khmer, topic-comment structure is common, and the perceived social relation between participants determines which sets of vocabulary, such as pronouns and honorifics, are proper.
Khmer differs from neighboring languages such as Burmese, Thai, Lao, and Vietnamese in that it is not a tonal language. Words are stressed on the final syllable, hence many words conform to the typical Mon–Khmer pattern of a stressed syllable preceded by a minor syllable. The language has been written in the Khmer script, an abugida descended from the Brahmi script via the southern Indian Pallava script, since at least the 7th century. The script's form and use has evolved over the centuries; its modern features include subscripted versions of consonants used to write clusters and a division of consonants into two series with different inherent vowels.
Khmer is a member of the Austroasiatic language family, the autochthonous family in an area that stretches from the Malay Peninsula through Southeast Asia to East India.[5] Austroasiatic, which also includes Mon, Vietnamese and Munda, has been studied since 1856 and was first proposed as a language family in 1907.[6] Despite the amount of research, there is still doubt about the internal relationship of the languages of Austroasiatic.[7]
Diffloth places Khmer in an eastern branch of the Mon-Khmer languages.[8] In these classification schemes Khmer's closest genetic relatives are the Bahnaric and Pearic languages.[9] More recent classifications doubt the validity of the Mon-Khmer sub-grouping and place the Khmer language as its own branch of Austroasiatic equidistant from the other 12 branches of the family.[7]
Khmer is spoken by some 13 million people in Cambodia, where it is the official language. It is also a second language for most of the minority groups and indigenous hill tribes there. Additionally there are a million speakers of Khmer native to southern Vietnam (1999 census)[10] and 1.4 million in northeast Thailand (2006).[11]
Khmer dialects, although mutually intelligible, are sometimes quite marked. Notable variations are found in speakers from Phnom Penh (Cambodia's capital city), the rural Battambang area, the areas of Northeast Thailand adjacent to Cambodia such as Surin province, the Cardamom Mountains, and southern Vietnam.[12][13][14] The dialects form a continuum running roughly north to south. Standard Cambodian Khmer is mutually intelligible with the others but a Khmer Krom speaker from Vietnam, for instance, may have great difficulty communicating with a Khmer native of Sisaket Province in Thailand.
The following is a classification scheme showing the development of the modern Khmer dialects.[15][16]
Standard Khmer, or Central Khmer, the language as taught in Cambodian schools and used by the media, is based on the dialect spoken throughout the Central Plain,[17] a region encompassed by the northwest and central provinces.
Northern Khmer (called Khmer Surin in Khmer) refers to the dialects spoken by many in several border provinces of present-day northeast Thailand. After the fall of the Khmer Empire in the early 15th century, the Dongrek Mountains served as a natural border leaving the Khmer north of the mountains under the sphere of influence of the Kingdom of Lan Xang. The conquests of Cambodia by Naresuan the Great for Ayutthaya furthered their political and economic isolation from Cambodia proper, leading to a dialect that developed relatively independently from the midpoint of the Middle Khmer period.[18]
This has resulted in a distinct accent influenced by the surrounding tonal languages Lao and Thai, lexical differences, and phonemic differences in both vowels and distribution of consonants. Syllable-final /r/, which has become silent in other dialects of Khmer, is still pronounced in Northern Khmer. Some linguists classify Northern Khmer as a separate but closely related language rather than a dialect.[19][20]
Western Khmer, also called Cardamom Khmer or Chanthaburi Khmer, is spoken by a very small, isolated population in the Cardamom mountain range extending from western Cambodia into eastern Central Thailand. Although little studied, this variety is unique in that it maintains a definite system of vocal register that has all but disappeared in other dialects of modern Khmer.[5]
Phnom Penh Khmer is spoken in the capital and surrounding areas. This dialect is characterized by merging or complete elision of syllables, which speakers from other regions consider a "relaxed" pronunciation. For instance, "Phnom Penh" is sometimes shortened to "m'Penh". Another characteristic of Phnom Penh speech is observed in words with an "r" either as an initial consonant or as the second member of a consonant cluster (as in the English word "bread"). The "r", trilled or flapped in other dialects, is either pronounced as a uvular trill or not pronounced at all.[21]
This alters the quality of any preceding consonant, causing a harder, more emphasized pronunciation. Another unique result is that the syllable is spoken with a low-rising or "dipping" tone much like the "hỏi" tone in Vietnamese. For example, some people pronounce ត្រី [trəj] ('fish') as [tʰəj]: the [r] is dropped and the vowel begins by dipping much lower in tone than standard speech and then rises, effectively doubling its length. Another example is the word រៀន [riən] ('study'), which is pronounced [ʀiən], with the uvular "r" and the same intonation described above.[21]
Khmer Krom or Southern Khmer is spoken by the indigenous Khmer population of the Mekong Delta, formerly controlled by the Khmer Empire but part of Vietnam since 1698. Khmers are persecuted by the Vietnamese government for using their native language and, since the 1950s, have been forced to take Vietnamese names.[22] Consequently, very little research has been published regarding this dialect. It has been generally influenced by Vietnamese for three centuries and accordingly displays a pronounced accent, tendency toward monosyllabic words and lexical differences from Standard Khmer.[23]
Khmer Khe is spoken in the Se San, Srepok and Sekong river valleys of Sesan and Siem Pang districts in Stung Treng Province. Following the decline of Angkor, the Khmer abandoned their northern territories, which the Lao then settled. In the 17th century, Chey Chetha XI led a Khmer force into Stung Treng to retake the area. The Khmer Khe living in this area of Stung Treng in modern times are presumed to be the descendants of this group. Their dialect is thought to resemble that of pre-modern Siem Reap.[24]
Linguistic study of the Khmer language divides its history into four periods one of which, the Old Khmer period, is subdivided into pre-Angkorian and Angkorian.[25] Pre-Angkorian Khmer is the Old Khmer language from 600 CE through 800. Angkorian Khmer is the language as it was spoken in the Khmer Empire from the 9th century until the 13th century.[26]
The following centuries saw changes in morphology, phonology and lexicon. The language of this transition period, from about the 14th to 18th centuries, is referred to as Middle Khmer and saw borrowings from Thai in the literary register.[26] Modern Khmer is dated from the 19th century to today.[25]
The following table shows the conventionally accepted historical stages of Khmer.[15]
Historical stage | Date |
---|---|
Pre- or Proto-Khmer | Before 600 CE |
Pre-Angkorian Old Khmer | 600–800 |
Angkorian Old Khmer | 800 to mid-14th century |
Middle Khmer | Mid-14th century to 18th century |
Modern Khmer | 1800–present |
Just as modern Khmer was emerging from the transitional period represented by Middle Khmer, Cambodia fell under the influence of French colonialism.[27] Thailand, which had for centuries claimed suzerainty over Cambodia and controlled succession to the Cambodian throne, began losing its influence on the language.[28] In 1887 Cambodia was fully integrated into French Indochina, which brought in a French-speaking aristocracy. This led to French becoming the language of higher education and the intellectual class. By 1907, the French had wrested over half of modern-day Cambodia, including the north and northwest where Thai had been the prestige language, back from Thai control and reintegrated it into the country.[28]
Many native scholars in the early 20th century, led by a monk named Chuon Nath, resisted the French and Thai influences on their language. Forming the government sponsored Cultural Committee to define and standardize the modern language, they championed Khmerization, purging of foreign elements, reviving affixation, and the use of Old Khmer roots and historical Pali and Sanskrit to coin new words for modern ideas.[27][29] Opponents, led by Keng Vannsak, who embraced "total Khmerization" by denouncing the reversion to classical languages and favoring the use of contemporary colloquial Khmer for neologisms, and Ieu Koeus, who favored borrowing from Thai, were also influential.[29]
Koeus later joined the Cultural Committee and supported Nath. Nath's views and prolific work won out and he is credited with cultivating modern Khmer-language identity and culture, overseeing the translation of the entire Pali Buddhist canon into Khmer. He also created the modern Khmer language dictionary that is still in use today, helping preserve Khmer during the French colonial period.[27]
The phonological system described here is the inventory of sounds of the standard spoken language,[17] represented using the International Phonetic Alphabet (IPA).
The voiceless plosives /p/, /t/, /c/, /k/ may occur with or without aspiration (as [p] vs. [pʰ], etc.); this difference is contrastive before a vowel. However, the aspirated sounds in that position may be analyzed as sequences of two phonemes: /ph/, /th/, /ch/, /kh/. This analysis is supported by the fact that infixes can be inserted between the stop and the aspiration; for example [tʰom] ('big') becomes [tumhum] ('size') with a nominalizing infix. When one of these plosives occurs initially before another consonant, aspiration is no longer contrastive and can be regarded as mere phonetic detail:[30][31] slight aspiration is expected when the following consonant is not one of /ʔ/, /b/, /d/, /r/, /s/, /h/ (or /ŋ/ if the initial plosive is /k/).
The voiced plosives are pronounced as implosives [ɓ, ɗ] by most speakers, but this feature is weak in educated speech, where they become [b, d].[32]
In syllable-final position, /h/ and /ʋ/ approach [ç] and [w] respectively. The stops /p/, /t/, /c/, /k/ are unaspirated and have no audible release when occurring as syllable finals.[17]
In addition, the consonants /ɡ/, /f/, /ʃ/ and /z/ occur occasionally in recent loan words in the speech of Cambodians familiar with French and other languages.
Various authors have proposed slightly different analyses of the Khmer vowel system. This may be in part because of the wide degree of variation in pronunciation between individual speakers, even within a dialectal region.[33] The description below follows Huffman (1970).[17] The number of vowel nuclei and their values vary between dialects; differences exist even between the Standard Khmer system and that of the Battambang dialect on which the standard is based.[34]
Front | Central | Back | ||
---|---|---|---|---|
Short | centering | ĕə អ៊ា |
ŏə, ŭə អ៊ួ | |
Long | centering | iə អៀ, អ៊ា់ |
ɨə អឿ |
ɔə, uə អួ |
mid closing | ei អ៊ៃ |
ɛi អី |
om អុំ | |
open closing | ae អែ |
aə អើ |
ao អុ | |
mid opening | um អ៊ុំ, អ៊ំ |
eh អេះ, អែះ |
eh អិះ, អែះ | |
close opening | aw អៅ |
am អំ, អាំ |
ah អោះ, អះ |
In addition, some diphthongs and triphthongs are analyzed as a vowel nucleus plus a semivowel (/j/ or /w/) coda because they cannot be followed by a final consonant. These include: (with short monophthongs) /ɨw/, /əw/, /aj/, /aw/, /uj/; (with long monophthongs) /əːj/, /aːj/; (with long diphthongs) /iəj/, /iəw/, /ɨəj/, /aoj/, /aəj/ and /uəj/.[35]
Khmer Vowels | IPA |
---|---|
ឥ | ʔe |
ឦ | ʔej |
ឧ | ʔu |
ឩ | ʔuː |
ឪ | ʔɜw, ʔɨw |
ឫ | rɨ |
ឬ | rɨː |
ឭ | lɨ |
ឮ | lɨː |
ឯ | ʔae |
ឰ | ʔaj |
ឱ, ឲ | ʔao |
ឳ | ʔaw |
The independent vowels are the vowels that can exist without a preceding or trailing consonant. The independent vowels may be used as monosyllabic words, or as the initial syllables in longer words. Khmer words never begin with regular vowels; they can, however, begin with independent vowels. Example: ឰដ៏, ឧទាហរណ៍, ឧត្តម, ឱកាស...។
A Khmer syllable begins with a single consonant, or else with a cluster of two, or rarely three, consonants. The only possible clusters of three consonants at the start of a syllable are /str/, /skr/,[36] and (with aspirated consonants analyzed as two-consonant sequences) /sth/, /lkh/. There are 85 possible two-consonant clusters (including [pʰ] etc. analyzed as /ph/ etc.). All the clusters are shown in the following table, phonetically, i.e. superscript ʰ can mark either contrastive or non-contrastive aspiration (see above).
p | ɓ | t | ɗ | c | k | ʔ | m | n | ɲ | ŋ | j | l | r | s | h | ʋ | t+h | k+h | t+r | k+r | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
p | pʰt- | pɗ- | pʰc- | pʰk- | pʔ- | pʰn- | pʰɲ- | pʰŋ- | pʰj- | pʰl- | pr- | ps- | pʰ- | ||||||||
t | tʰp- | tɓ- | tʰk- | tʔ- | tʰm- | tʰn- | tʰŋ- | tʰj- | tʰl- | tr- | tʰ- | tʰʋ- | |||||||||
c | cʰp- | cɓ- | cɗ- | cʰk- | cʔ- | cʰm- | cʰn- | cʰŋ- | cʰl- | cr- | cʰ- | cʰʋ- | |||||||||
k | kʰp- | kɓ- | kʰt- | kɗ- | kʰc- | kʔ- | kʰm- | kʰn- | kʰɲ- | kŋ- | kʰj- | kʰl- | kr- | ks- | kʰ- | kʰʋ- | |||||
s | sp- | sɓ- | st- | sɗ- | sk- | sʔ- | sm- | sn- | sɲ- | sŋ- | sl- | sr- | sʋ- | stʰ- | str- | skr- | |||||
ʔ | ʔʋ- | ||||||||||||||||||||
m | mt- | mɗ- | mc- | mʔ- | mn- | mɲ- | ml- | mr- | ms- | mh- | |||||||||||
l | lp- | lɓ- | lk- | lʔ- | lm- | lŋ- | lh- | lʋ- | lkʰ- |
Slight vowel epenthesis occurs in the clusters consisting of a plosive followed by /ʔ/, /b/, /d/, in those beginning /ʔ/, /m/, /l/, and in the cluster /kŋ-/.[37]: 8–9
After the initial consonant or consonant cluster comes the syllabic nucleus, which is one of the vowels listed above. This vowel may end the syllable or may be followed by a coda, which is a single consonant. If the syllable is stressed and the vowel is short, there must be a final consonant. All consonant sounds except /b/, /d/, /r/, /s/ and the aspirates can appear as the coda (although final /r/ is heard in some dialects, most notably in Northern Khmer).[38]
A minor syllable (unstressed syllable preceding the main syllable of a word) has a structure of CV-, CrV-, CVN- or CrVN- (where C is a consonant, V a vowel, and N a nasal consonant). The vowels in such syllables are usually short; in conversation they may be reduced to [ə], although in careful or formal speech, including on television and radio, they are clearly articulated. An example of such a word is មនុស្ស mɔnuh, mɔnɨh, mĕəʾnuh ('person'), pronounced [mɔˈnuh], or more casually [məˈnuh].[37]: 10
Stress in Khmer falls on the final syllable of a word.[39] Because of this predictable pattern, stress is non-phonemic in Khmer (it does not distinguish different meanings).
Most Khmer words consist of either one or two syllables. In most native disyllabic words, the first syllable is a minor (fully unstressed) syllable. Such words have been described as sesquisyllabic (i.e. as having one-and-a-half syllables). There are also some disyllabic words in which the first syllable does not behave as a minor syllable, but takes secondary stress. Most such words are compounds, but some are single morphemes (generally loanwords). An example is ភាសា ('language'), pronounced [ˌpʰiəˈsaː].[37]: 10
Words with three or more syllables, if they are not compounds, are mostly loanwords, usually derived from Pali, Sanskrit, or more recently, French. They are nonetheless adapted to Khmer stress patterns.[40] Primary stress falls on the final syllable, with secondary stress on every second syllable from the end. Thus in a three-syllable word, the first syllable has secondary stress; in a four-syllable word, the second syllable has secondary stress; in a five-syllable word, the first and third syllables have secondary stress, and so on.[37]: 10–11 Long polysyllables are not often used in conversation.[17]: 12
Compounds, however, preserve the stress patterns of the constituent words. Thus សំបុកចាប, the name of a kind of cookie (literally 'bird's nest'), is pronounced [sɑmˌbok ˈcaːp], with secondary stress on the second rather than the first syllable, because it is composed of the words [sɑmˈbok] ('nest') and [caːp] ('bird').[40]
Khmer once had a phonation distinction in its vowels, but this now survives only in the most archaic dialect (Western Khmer).[5] The distinction arose historically when vowels after Old Khmer voiced consonants became breathy voiced and diphthongized; for example *kaa, *ɡaa became *kaa, *ɡe̤a. When consonant voicing was lost, the distinction was maintained by the vowel (*kaa, *ke̤a); later the phonation disappeared as well ([kaː], [kiə]).[30] These processes explain the origin of what are now called a-series and o-series consonants in the Khmer script.
Although most Cambodian dialects are not tonal, the colloquial Phnom Penh dialect has developed a tonal contrast (level versus peaking tone) as a by-product of the elision of /r/.[30]
Intonation often conveys semantic context in Khmer, as in distinguishing declarative statements, questions and exclamations. The available grammatical means of making such distinctions are not always used, or may be ambiguous; for example, the final interrogative particle ទេ /teː/ can also serve as an emphasizing (or in some cases negating) particle.[41]
The intonation pattern of a typical Khmer declarative phrase is a steady rise throughout followed by an abrupt drop on the last syllable.[36]
Other intonation contours signify a different type of phrase such as the "full doubt" interrogative, similar to yes–no questions in English. Full doubt interrogatives remain fairly even in tone throughout, but rise sharply towards the end.
Exclamatory phrases follow the typical steadily rising pattern, but rise sharply on the last syllable instead of falling.[36]
Khmer is primarily an analytic language with no inflection. Syntactic relations are mainly determined by word order. Old and Middle Khmer used particles to mark grammatical categories and many of these have survived in Modern Khmer but are used sparingly, mostly in literary or formal language.[41] Khmer makes extensive use of auxiliary verbs, "directionals" and serial verb construction. Colloquial Khmer is a zero copula language, instead preferring predicative adjectives (and even predicative nouns) unless using a copula for emphasis or to avoid ambiguity in more complex sentences. Basic word order is subject–verb–object (SVO), although subjects are often dropped; prepositions are used rather than postpositions.[42]
Topic-Comment constructions are common and the language is generally head-initial (modifiers follow the words they modify). Some grammatical processes are still not fully understood by western scholars. For example, it is not clear if certain features of Khmer grammar, such as actor nominalization, should be treated as a morphological process or a purely syntactic device,[43]: 46, 74 and some derivational morphology seems "purely decorative" and performs no known syntactic work.[43]: 53
Lexical categories have been hard to define in Khmer.[43]: 360 Henri Maspero, an early scholar of Khmer, claimed the language had no parts of speech,[43] while a later scholar, Judith Jacob, posited four parts of speech and innumerable particles.[44]: 331 John Haiman, on the other hand, identifies "a couple dozen" parts of speech in Khmer with the caveat that Khmer words have the freedom to perform a variety of syntactic functions depending on such factors as word order, relevant particles, location within a clause, intonation and context.[43] Some of the more important lexical categories and their function are demonstrated in the following example sentence taken from a hospital brochure:[43]: 378
/loːk
you[RESP]
PN
nĕəʔ
you[FAM]
PN
pʰɗɑl
provide
VERB
cʰiəm
blood
NOUN
teaŋ
every
PTCL
ʔɑh
all
ADJ
trəw
must
AUX
tae
have to
INT
tɔtuəl
receive
VERB
nəw
OM
kaː
NMLZ
pinɨt
examine
VERB
riəŋ
shape
NOUN
kaːj
body
NOUN
nɨŋ
and
CNJ
pʰɗɑl
provide
VERB
nəw
OM
prɑʋɔət
history
NOUN
sokʰapʰiəp
health
ADJ
ciə
be
COP
mun
before
ADV
ciə
be
COP
sən/
first
ADV
'All blood donors must pass a physical examination and provide a health history first (before they can give blood).'
Modern Khmer is an isolating language, which means that it uses little productive morphology. There is some derivation by means of prefixes and infixes, but this is a remnant of Old Khmer and not always productive in the modern language.[45] Khmer morphology is evidence of a historical process through which the language was, at some point in the past, changed from being an agglutinative language to adopting an isolating typology.[46] Affixed forms are lexicalized and cannot be used productively to form new words.[37]: 311 Below are some of the most common affixes with examples as given by Huffman.[37]: 312–316
Affix | Function | Word | Meaning | Affixed Word | Meaning |
---|---|---|---|---|---|
prefixed [pʰ] | causation | [ɗac] [ɗaəm] | "broke, torn" "origin" | [pʰɗac] [pʰɗaəm] | "to tear apart" "to originate (trans.)" |
prefixed [rɔː] | derives adjectives nominalization | [lut] [ɓaŋ] | "to extinguish" "to hide" | [rɔːlut] [rɔːɓaŋ] | "extinguished" "a screen, shade" |
prefixed [prɑː] | reciprocity | [kʰam] [ɗouc] | "to bite" "similar" | [prɑːkʰam] [prɑːɗouc] | "to bite each other" "to compare" |
prefixed [ɓɑm], [ɓɑn], [ɓɑŋ] | causation | [ɓaek] [ɗaə] [riən] | "to break (intrans.)" "to walk" "to study, learn" | [ɓɑmɓaek] [ɓɑnɗaə] [ɓɑŋriən] | "to cause to break" "to take for a walk" "to teach" |
infixed [ɑm] | causation | [sʔaːt] [slap] | "to be clean" "to die" | [sɑmʔaːt] [sɑmlap] | "to clean" "to kill" |
infixed [ɑm(n)], [um(n)] | nominalization | [ɗaə] [ɗəŋ] [cɨə] | "to walk" "to know (something)" "to believe" | [ɗɑmnaə] [ɗɑmnəŋ] [cumnɨə] | "a trip" "information" "belief" |
Compounding in Khmer is a common derivational process that takes two forms, coordinate compounds and repetitive compounds. Coordinate compounds join two unbound morphemes (independent words) of similar meaning to form a compound signifying a concept more general than either word alone.[37]: 296 Coordinate compounds join either two nouns or two verbs. Repetitive compounds, one of the most productive derivational features of Khmer, use reduplication of an entire word to derive words whose meaning depends on the class of the reduplicated word.[37]: 185 A repetitive compound of a noun indicates plurality or generality while that of an adjectival verb could mean either an intensification or plurality.
Coordinate compounds:[37]: 296–297
[ʔəwpuk]
father
+
[mɗaːj]
mother
⇒
[ʔəwpuk.mɗaːj]
parents
[ɗək]
to transport
+
[nɔəm]
to bring
⇒
[ɗəknŏəm]
to lead
Repetitive compounds:[37]: 185–185
[cʰap]
fast
⇒
[cʰapcʰap]
very fast, quickly
[srəj]
women
⇒
[srəjsrəj]
women, women in general
Khmer nouns do not inflect for grammatical gender or singular/plural. There are no articles, but indefiniteness is often expressed by the word for "one" (មួយ [muəj]) following the noun as in ឆ្កែមួយ ([cʰkae muəj] "a dog"). Plurality can be marked by postnominal particles, numerals, or reduplication of a following adjective, which, although similar to intensification, is usually not ambiguous due to context.[47]
[cʰkae
dog
craən]
many
"many dogs"
[cʰkae
dog
piː]
two
"two dogs"
[cʰkae
dog
tʰom
large
tʰom]
large
'large dogs'
Classifying particles are used after numerals, but are not always obligatory as they are in Thai or Chinese, for example, and are often dropped in colloquial speech. Khmer nouns are divided into two groups: mass nouns, which take classifiers; and specific, nouns, which do not. The overwhelming majority are mass nouns.[37]: 67–68
[kʰmawɗaj
pencil
piː
two
ɗaəm]
CLF [long cylindrical object]
"two pencils"
Possession is colloquially expressed by word order. The possessor is placed after the thing that is possessed.[43]: 160 Alternatively, in more complex sentences or when emphasis is required, a possessive construction using the word របស់ ([rɔːbɑh] ~ [lə.bɑh], "property, object") may be employed. In formal and literary contexts, the possessive particle នៃ ([nɨj]) is used:[37]: 358
[puəʔmaːʔ
friend
kʰɲom]
I
"my friend"
[puəʔmaːʔ
friend
rɔːɓɑh
property
kʰɲom]
I
"my friend"
[puəʔmaːʔ
friend
nɨj
of
kʰɲom]
I
"my friend"
Pronouns are subject to a complicated system of social register, the choice of pronoun depending on the perceived relationships between speaker, audience and referent (see Social registers below). Khmer exhibits pronoun avoidance, so kinship terms, nicknames and proper names are often used instead of pronouns (including for the first person) among intimates. Subject pronouns are frequently dropped in colloquial conversation.[47]
Adjectives, verbs and verb phrases may be made into nouns by the use of nominalization particles. Three of the more common particles used to create nouns are [kaː], [seckdəj], and [pʰiəp].[43]: 45–48 These particles are prefixed most often to verbs to form abstract nouns. The latter, derived from Sanskrit, also occurs as a suffix in fixed forms borrowed from Sanskrit and Pali such as [sokʰapʰiəp] ("health") from [sok] ("to be healthy").[40]
[kaː
NMLZ
rŭəhnɨw]
to live
'life'
[sec
NMLZ
kdəj deik]
to lie down
'[the act of] lying down'
Adjectives, demonstratives and numerals follow the noun they modify. Adverbs likewise follow the verb. Morphologically, adjectives and adverbs are not distinguished, with many words often serving either function. Adjectives are also employed as verbs as Khmer sentences rarely use a copula.[37]
Degrees of comparison are constructed syntactically. Comparatives are expressed using the word ជាង /ciəŋ/: "A X /ciəŋ/ [B]" (A is more X [than B]). The most common way to express superlatives is with ជាងគេ /ciəŋ keː/: "A X /ciəŋ keː/" (A is the most X).[47] Intensity is also expressed syntactically, similar to other languages of the region, by reduplication or with the use of intensifiers.[47]
/srəj
girl
nuh
DEM
sʔaːt/
pretty
'That girl is pretty.'
/srəj
girl
nuh
DEM
sʔaːt
pretty
sʔaːt/
pretty
'That girl is very pretty.'
/srəj
girl
nuh
DEM
sʔaːt
pretty
nah/
very
'That girl is very pretty.'
As is typical of most East Asian languages,[48] Khmer verbs do not inflect at all; tense, aspect and mood can be expressed using auxiliary verbs, particles (such as កំពុង /kəmpuŋ/, placed before a verb to express continuous aspect) and adverbs (such as "yesterday", "earlier", "tomorrow"), or may be understood from context. Serial verb construction is quite common.[43]: 253
Khmer verbs are a relatively open class and can be divided into two types, main verbs and auxiliary verbs.[43]: 254 Huffman defined a Khmer verb as "any word that can be (negated)",[37]: 56 and further divided main verbs into three classes.
Transitive verbs are verbs that may be followed by a direct object:
/kʰɲom
I
ɲam
eat
ɓaj/
rice
"I eat rice."
/kʰɲom
I
tɨɲ
buy
ɓaːrəj/
cigarettes
"I buy cigarettes."
Intransitive verbs are verbs that can not be followed by an object:
/kʰɲom
I
ɗaə
walk
tɨw
DIR
pʰsaː/
market
"I walk to the market."
/ʔaɲcəɲ
to invite
ʔɑŋkuj/
to sit
"Please sit."
Adjectival verbs are a word class that has no equivalent in English. When modifying a noun or verb, they function as adjectives or adverbs, respectively, but they may also be used as main verbs equivalent to English "be + adjective".
- Adjective:
/proh
boy
sɑŋ.haː/
handsome
"handsome boy"
- Adverb:
/proh
boy
nuh
DEM
tʰʋəː kaː
to work
lʔɑː/
good
"That boy works well."
- Verb:
Syntax is the rules and processes that describe how sentences are formed in a particular language, how words relate to each other within clauses or phrases and how those phrases relate to each other within a sentence to convey meaning.[49] Khmer syntax is very analytic. Relationships between words and phrases are signified primarily by word order supplemented with auxiliary verbs and, particularly in formal and literary registers, grammatical marking particles.[43] Grammatical phenomena such as negation and aspect are marked by particles while interrogative sentences are marked either by particles or interrogative words equivalent to English "wh-words".
A complete Khmer sentence consists of four basic elements—an optional topic, an optional subject, an obligatory predicate, and various adverbials and particles.[50] The topic and subject are noun phrases, predicates are verb phrases and another noun phrase acting as an object or verbal attribute often follows the predicate.[50]
When combining these noun and verb phrases into a sentence the order is typically SVO:
/kʰɲom
I
SBJ
ʔaoj
give
VERB
ceik
banana
OBJ
muəj
one
cɑmnuən/
bunch[CLF]
'I gave a bunch of bananas.'
When both a direct object and indirect object are present without any grammatical markers, the preferred order is SV(DO)(IO). In such a case, if the direct object phrase contains multiple components, the indirect object immediately follows the noun of the direct object phrase and the direct object's modifiers follow the indirect object:
/kʰɲom
I
SBJ
ʔaoj
give
VERB
ceik
banana
DO
cruːk
pig
IO
muəj
one
cɑmnuən/
bunch[CLF]
'I gave the pig a bunch of bananas.'[43]: 207
This ordering of objects can be changed and the meaning clarified with the inclusion of particles. The word /dɑl/, which normally means "to arrive" or "towards", can be used as a preposition meaning "to":
/kʰɲom
I
ʔaoj
give
ceik
banana
muəj
one
cɑmnuən
bunch[CLF]
dɑl
toward
cruːk/
pig
'I gave a bunch of bananas to the pigs.'[43]: 207
Alternatively, the indirect object could precede the direct object if the object-marking preposition /nəw/ were used:
/kʰɲom
I
ʔaoj
give
cruːk
pig
nəw
OM
ceik
banana
muəj
one
cɑmnuən/
bunch[CLF]
'I gave the pig a bunch of bananas.'[43]: 207
However, in spoken discourse OSV is possible when emphasizing the object in a topic–comment-like structure.[43]: 211
The noun phrase in Khmer typically has the following structure:[37]: 50–51 [44]: 83
The elements in parentheses are optional. Honorifics are a class of words that serve to index the social status of the referent. Honorifics can be kinship terms or personal names, both of which are often used as first and second person pronouns, or specialized words such as /preah/ ('god') before royal and religious objects.[43]: 155 The most common demonstratives are /nih/ ('this, these') and /nuh/ ('that, those'). The word /ae nuh/ ('those over there') has a more distal or vague connotation.[40]
If the noun phrase contains a possessive adjective, it follows the noun and precedes the numeral. If a descriptive attribute co-occurs with a possessive, the possessive construction (/rɔbɑh/) is expected.[37]: 73
Some examples of typical Khmer noun phrases are:
ផ្ទះ
/ptĕəh
house
NOUN
ស្កឹមស្កៃ
skəm.skaj
high
ADJ
បី
bəj
three
NUM
បួន
buən
four
NUM
ខ្នង
kʰnɑːŋ
spine[CLF]
CLF
នេះ
nih/
these
DEM
'these three or four high houses'[43]: 142
ចេក
/ceːk
banana
NOUN
ទុំ
tum
ripe
ADJ
ពីរ
piː
two
NUM
ស្និត
snət
bunches[CLF]
CLF
នេះ
nih/
these
DEM
these two bunches of ripe bananas
ពួកម៉ាក
/puəʔmaʔ
friend
NOUN
ខ្ញុំ
kʰɲom
I
POSS
ពីរ
piː
two
NUM
នាក់
nĕə
person[CLF]
CLF
នេះ
nih/
these
DEM
these two friends of mine
ពួកម៉ាក
/puəʔmaʔ
friend
NOUN
តូច
touc
small
ADJ
របស់
rɔbɑh
of
POSS
ខ្ញុំ
kʰɲom
I
POSS
ពីរ
piː
two
NUM
នាក់
nĕə
person[CLF]
CLF
នេះ
nih/
these
DEM
these two small friends of mine[37]: 73
The Khmer particle /dɑː/ marked attributes in Old Khmer noun phrases and is used in formal and literary language to signify that what precedes is the noun and what follows is the attribute. Modern usage may carry the connotation of mild intensity.[43]: 163
/ʋiəl
field
srae
paddy
dɑː
ADJ.MARKER
lʋɨŋ lʋəːj/
vast
'(very) expansive fields and paddies'
Khmer verbs are completely uninflected, and once a subject or topic has been introduced or is clear from context the noun phrase may be dropped. Thus, the simplest possible sentence in Khmer consists of a single verb. For example, /tɨw/ 'to go' on its own can mean "I'm going.", "He went.", "They've gone.", "Let's go.", etc.[37]: 17 This also results in long strings of verbs such as:
Khmer uses three verbs for what translates into English as the copula. The general copula is /ciə/; it is used to convey identity with nominal predicates.[43]: 212 For locative predicates, the copula is /nɨw/.[43]: 212 The verb /miən/ is the "existential" copula meaning "there is" or "there exists".[43]: 208
/piəsaː
language
ciə
copula
kaː
NMLZ
sɑmdaeŋ
to express
cət
heart
kumnɨt
thought
krŏəp
all
jaːŋ/
kind
'Language is the expression of all emotions and ideas'
/ʋiə
he
nɨw
copula
cɪt
close
ʋŏət/
temple
'He is close to the temple.'
/miən
to
pʰaen
exist
kaː/
plan
'There is a plan.'
Negation is achieved by putting មិន /mɨn/ before the verb and the particle ទេ /teː/ at the end of the sentence or clause. In colloquial speech, verbs can also be negated without the need for a final particle, by placing ឥត /ʔɑt/~/ʔət/ before them.[47]
/kʰɲom
I
cɨə/
to believe
'I believe.'
/kʰɲom
I
mɨn
NEG
cɨə
to believe
teː/
NEG
'I don't believe.'
/kʰɲom
I
ʔɑt
NEG
cɨə/
to believe
'I don't believe.'
Past tense can be conveyed by adverbs, such as "yesterday" or by the use of perfective particles such as /haəj/
/kŏət
he
tɨw
to go
msəlmɨɲ/
yesterday
'He went yesterday.'
Different senses of future action can also be expressed by the use of adverbs like "tomorrow" or by the future tense marker /nɨŋ/, which is placed immediately before the verb, or both:
Imperatives are often unmarked.[43]: 240 For example, in addition to the meanings given above, the "sentence" /tɨw/ can also mean "Go!". Various words and particles may be added to the verb to soften the command to varying degrees, including to the point of politeness (jussives):[43]: 240
/cou
IMP
saːk
try
lbɑːŋ
try
kʰluən
you
aeŋ
REFL
coh/
IMP
'Go ahead and try it yourself.'
/soum
please
tʰʋəː
do
taːm
follow
bɑndam
instruction
kŏət
he
tɨw/
IMP
'Please follow his instructions.'
Prohibitives take the form "/kom/ + verb" and also are often softened by the addition of the particle /ʔəj/ to the end of the phrase.[43]: 242
/kom
PROH
nɨw
to be
tiː
place
nih
DEM
ʔəj/
COHORT
'Don't stay in this place.'
There are three basic types of questions in Khmer.[37]: 46 Questions requesting specific information use question words. Polar questions are indicated with interrogative particles, most commonly /teː/, a homonym of the negation particle. Tag questions are indicated with various particles and rising inflection.[37]: 57 The SVO word order is generally not inverted for questions.
/loːk
you
tɨw
to go
naː/
where
'Where are you going?'
/loːk
you
sdap
understand
baːn
modal
teː/
Q
'Can you understand?'
/loːk
you
tɨw
to go
psaː
market
haəj
PRF
rɨː
or
nɨw/
yet
'Have you gone to the store yet?'
In more formal contexts and in polite speech, questions are also marked at their beginning by the particle /taə/.
Khmer does not have a passive voice,[41] but there is a construction utilizing the main verb /trəw/ ("to hit", "to be correct", "to affect") as an auxiliary verb meaning "to be subject to" or "to undergo"—which results in sentences that are translated to English using the passive voice.[43]: 286–288
/piː
from
msəlmɨɲ
yesterday
kʰɲom
I
trəw
to undergo
cʰkae
dog
kʰam/
to bite
'Yesterday I was bitten by a dog.'[37]: 302
Complex sentences are formed in Khmer by the addition of one or more clauses to the main clause. The various types of clauses in Khmer include the coordinate clause, the relative clause and the subordinate clause. Word order in clauses is the same for that of the basic sentences described above.[43] Coordinate clauses do not necessarily have to be marked; they can simply follow one another. When explicitly marked, they are joined by words similar to English conjunctions such as /nɨŋ/ ("and") and /haəj/ ("and then") or by clause-final conjunction-like adverbs /dae/ and /pʰɑːŋ/, both of which can mean "also" or "and also"; disjunction is indicated by /rɨː/ ("or").[43]: 217–218 [51]
Relative clauses can be introduced by /dael/ ("that") but, similar to coordinate clauses, often simply follow the main clause. For example, both phrases below can mean "the hospital bed that has wheels".[43]: 313
/krɛː
bed
pɛːt
hospital
miən
have
kɑŋ
wheel
ruɲ/
to push
"the hospital bed that has wheels"
/krɛː
bed
pɛːt
hospital
dael
REL
miən
have
kɑŋ
wheel
ruɲ/
to push
"the hospital bed that has wheels"
Relative clauses are more likely to be introduced with /dael/ if they do not immediately follow the head noun.[43]: 314 Khmer subordinate conjunctions always precede a subordinate clause.[43]: 366 Subordinate conjunctions include words such as /prŭəh/ ("because"), /hak bəj/ ("seems as if") and /daəmbəj/ ("in order to").[37]: 251 [43]
Counting in Khmer is based on a biquinary system: the numbers from 6 to 9 have the form "five one", "five two", etc. The words for multiples of ten from 30 to 90 are not related to the basic Khmer numbers, but are Chinese in origin, and probably came to Khmer via Thai. Khmer numerals, which were inherited directly from Indian numerals, are used more widely than Western numerals, which like Khmer numerals were inherited from Indian, but first passed through the Arabic numerals before reaching the west.
The principal number words are listed in the following table, which gives Western and Khmer digits, Khmer spelling and IPA transcription.[45]
Value | Khmer | Word form | IPA | UNGEGN | Value | Khmer | Word form | IPA | UNGEGN |
---|---|---|---|---|---|---|---|---|---|
0 | ០ | សូន្យ | /soːn/ | sony | |||||
1 | ១ | មួយ | /muəj/ | muŏy | |||||
2 | ២ | ពីរ | /piː/ | pir | 20 | ២០ | ម្ភៃ | /mpʰɨj/, /məˈpʰɨj/ | mphey |
3 | ៣ | បី | /ɓəj/ | bei | 30 | ៣០ | សាមសិប | /saːmsəp/ | samsĕb |
4 | ៤ | បួន | /ɓuən/ | buŏn | 40 | ៤០ | សែសិប | /saesəp/ | sêsĕb |
5 | ៥ | ប្រាំ | /pram/ | brăm | 50 | ៥០ | ហាសិប | /haːsəp/ | hasĕb |
6 | ៦ | ប្រាំមូយ | /prammuəj/ | brămmuŏy | 60 | ៦០ | ហុកសិប | /hoksəp/ | hŏksĕb |
7 | ៧ | ប្រាំពីរ | /prampiː/, /prampɨl/ | brămpir | 70 | ៧០ | ចិតសិប | /cətsəp/ | chĕtsĕb |
8 | ៨ | ប្រាំបី | /pramɓəj/ | brămbei | 80 | ៨០ | ប៉ែតសិប | /paetsəp/ | pêtsĕb |
9 | ៩ | ប្រាំបួន | /pramɓuən/ | brămbuŏn | 90 | ៩០ | កៅសិប | /kawsəp/ | kausĕb |
10 | ១០ | ដប់ | /ɗɑp/ | dáb | 100 | ១០០ | មួយរយ | /muəjrɔːj/ | muŏyrôy |
Intermediate numbers are formed by compounding the above elements. Powers of ten are denoted by loan words: រយ rôy /rɔːj/ (100), ពាន់ poăn /pŏən/ (1,000), ម៉ឺន mœn /məːn/ (10,000), សែន sên /saen/ (100,000) and លាន léan /liən/ (1,000,000) from Thai and កោដិ kaôdĕ /kaot/ (10,000,000) from Sanskrit.[52]
Ordinal numbers are formed by placing the particle ទី ti /tiː/ before the corresponding cardinal number.[40]
Khmer employs a system of registers in which the speaker must always be conscious of the social status of the person spoken to. The different registers, which include those used for common speech, polite speech, speaking to or about royals and speaking to or about monks, employ alternate verbs, names of body parts and pronouns. As an example, the word for "to eat" used between intimates or in reference to animals is /siː/. Used in polite reference to commoners, it is /ɲam/. When used of those of higher social status, it is /pisa/ or /tɔtuəl tiən/. For monks the word is /cʰan/ and for royals, /saoj/.[4] Another result is that the pronominal system is complex and full of honorific variations, just a few of which are shown in the table below.[40]
Situational usage | I/me | you | he/she/it | ||||||
---|---|---|---|---|---|---|---|---|---|
Intimate or addressing an inferior | អញ | ânh | [ʔaɲ] | ឯង | êng | [ʔaeŋ] | វា | véa | [ʋiə] |
neutral | ខ្ញុំ | khnhŭm | [kʰɲom] | អ្នក | 'nâk | [neaʔ] | គេ | ké | [keː] |
Formal | យើងខ្ញុំ, ខ្ញុំបាទ | yeung khnhŭm, khnhŭm bat | [jəːŋ kʰɲom] [kʰɲom ɓaːt] |
លោក (or kinship term, title or rank) |
loŭk | [loːk] | គាត់ | koăt | [kɔət] |
Layperson to/about Buddhist clergy | ខ្ញុំព្រះករុណា | khnhŭm preăh kârŭna | [kʰɲom preah karunaː] | ព្រះតេជព្រះគុណ | preăh téch preăh kŭn | [preah ɗaec preah kun] | ព្រះអង្គ | preăh ângk | [preah ʔɑŋ] |
Buddhist clergy to layperson | អាត្មា, អាចក្តី | atma, ach kdei | [ʔatʰmaː], [ʔaːc kɗəj] | ញោមស្រី (to female) ញោមប្រុស (to male) |
nhoŭm srei (to female), nhoŭm brŏs (to male) | [ɲoːm srəj] (to female), [ɲoːm proh] (to male) | ឧបាសក (to male), ឧបាសិកា (to female) | ŭbasâk (to male), ŭbasĕka | [ʔuɓaːsɑk] [ʔuɓaːsekaː] |
when addressing royalty | ខ្ញុំព្រះបាទអម្ចាស់ or ទូលបង្គុំ (male), ខ្ញុំម្ចាស់ (female) | khnhŭm preăh bat âmchăs or tul bângkŭm (male), khnhŭm mchăs (female) | [kʰɲom preah ɓaːt ʔɑmcah] or [tuːl ɓɑŋkom] (male), [kʰɲom mcah] (female) | ព្រះករុណា | preăh kârŭna | [preah karunaː] | ទ្រង់ | tróng | [trɔŋ] |
Khmer is written with the Khmer script, an abugida developed from the Pallava script of India before the 7th century when the first known inscription appeared.[53] Written left-to-right with vowel signs that can be placed after, before, above or below the consonant they follow, the Khmer script is similar in appearance and usage to Thai and Lao, both of which were based on the Khmer system. The Khmer script is also distantly related to the Mon–Burmese script.[53] Within Cambodia, literacy in the Khmer alphabet is estimated at 77.6%.[54]
Consonant symbols in Khmer are divided into two groups, or series. The first series carries the inherent vowel [ɑː] while the second series carries the inherent vowel [ɔː]. The Khmer names of the series, [akʰoːsaʔ] ('voiceless') and [kʰoːsaʔ] ('voiced'), respectively, indicate that the second series consonants were used to represent the voiced phonemes of Old Khmer. As the voicing of stops was lost, however, the contrast shifted to the phonation of the attached vowels, which, in turn, evolved into a simple difference of vowel quality, often by diphthongization.[30] This process has resulted in the Khmer alphabet having two symbols for most consonant phonemes and each vowel symbol having two possible readings, depending on the series of the initial consonant:[17]
ត + ា | = តា | [taː] | 'grandfather' |
ទ + ា | = ទា | [tiə] | 'duck' |
The following text is from Article 1 of the Universal Declaration of Human Rights.
Khmer | មនុស្សទាំងអស់កើតមកមានសេរីភាពនិងភាពស្មើៗគ្នាក្នុងសិទ្ធិ និងសេចក្ដីថ្លៃថ្នូរ ។ មនុស្សគ្រប់រូបសុទ្ធតែមានវិចារណញ្ញាណនិងសតិសម្បជញ្ញៈ ហើយត្រូវប្រព្រឹត្ដចំពោះគ្នាទៅវិញទៅមកក្នុងស្មារតីរាប់អានគ្នាជាបងប្អូន ។ |
---|---|
UNGEGN romanization |
Mônŭss teăng ás kaeut môk méan sériphéap nĭng phéap smaeu-smaeu knéa knŏng sĕtthĭ, nĭng séchâkdei thlaithnor. Mônŭss krôb rub sŏtth tê méan vĭcharônânhnhéan nĭng sâtĕsâmbâchônhnheă, haeuy trov brâprœ̆tt châmpŏăh knéa tŏu vĭnh tŏu môk knŏng smarôtei roăp an knéa chéa bâng b'on. |
IPA transcription |
/mɔnuh tĕaŋ ɑh kaət̚ mɔːk̚ miən seːrəjpʰiəp̚ nɨŋ pʰiəp̚ smaəsmaə kniə knoŋ sət̚ nɨŋ seːc̚k̚ɗəj tʰlaj tʰnou. mɔnuh krup̚ ruːp̚ sot̚ tae miən vicaːranaɲiən nɨŋ satəsampacŏəɲɲeaʔ haəj trouʋ prɑprɨt cɑmpŭəh kniə tɨw ʋɨɲ tɨw mɔːk̚ knoŋ smaːrɔːɗəj rŏəp̚ ʔaːn kniə ciə ɓɑːŋ pʔoun/. |
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.