Uralic Phonetic Alphabet
Phonetic alphabet for Uralic languages From Wikipedia, the free encyclopedia
Phonetic alphabet for Uralic languages From Wikipedia, the free encyclopedia
The Uralic Phonetic Alphabet (UPA) or Finno-Ugric transcription system is a phonetic transcription or notational system used predominantly for the transcription and reconstruction of Uralic languages. It was first published in 1901 by Eemil Nestor Setälä, a Finnish linguist; it was somewhat modified in the 1970s.[1]
UPA differs from the International Phonetic Alphabet (IPA) notation in several ways, notably in exploiting italics or boldface rather than using brackets to delimit text, in the use of small capitals for voicelessness, and in more frequent use of diacritics to differentiate places of articulation.
The basic UPA characters are based on the Finnish alphabet where possible, with extensions taken from Cyrillic and Greek orthographies. Small-capital letters and some novel diacritics are also used.
Unlike the IPA, which is usually transcribed in Roman typeface, the UPA is transcribed in italic and bold typeface. Its extended characters are found in the Phonetic Extensions and Phonetic Extensions Supplement blocks. Computer font support is available through any good phonetics font, though lower-case and small-capital may not be visibly distinct in letters such as o where these look similar.
A vowel to the left of a dot is illabial (unrounded); to the right is labial (rounded).[1]
Some sources use a å as the only pair of open vowels. y and ɯ are sometimes used for rounded ü and u̮.
If a distinction between close-mid vowels and open-mid vowels is needed, the IPA letters ⟨ɛ⟩ and ⟨ɔ⟩ can be used. That row is then:
æ lies between ä and ɛ; œ between α̈ and ɔ̈; ø between ɔ̈ and ö.[2]
The UPA has dedicated characters for wildcards or to denote a vowel of uncertain quality:
The following table describes the consonants of the UPA. A 'spirant' in this usage is a non-sibilant fricative. Under 'approximants', v w j ɦ and their voiceless counterparts are 'semivowels', while ɹ ɹ̤ are 'vibrationless rhotics'. Palatalized consonants are indicated with an acute accent. Only a few are shown in the table; the velar letters with an acute are commonly used for palatal consonants.
Plosive | Spirant | Sibilant | Approx. | Lateral | Trill | Flap | Nasal | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Bilabial | p | ʙ | b | φ | β | ᴡ | w | ᴪ | ψ | ᴍ | m | |||||||
Labiodental | p̦ | ʙ̦ | b̦ | f | v̌ | ᴠ | v | m̦ | ||||||||||
Dental | ț | ᴅ̦ | d̦ | |||||||||||||||
Alveolar | t | ᴅ | d | ϑ | δ | s | ᴢ | z | ᴚ | ɹ | ʟ | l | ʀ | r | ᴆ | ð | ɴ | n |
š | ᴢ̌ | ž | ᴌ | ł | ||||||||||||||
Palatalised alveolar | t́ | ᴅ́ | d́ | ϑ́ | δ́ | ś | ᴢ́ | ź | ʟ́ | ĺ | ʀ́ | ŕ | ɴ́ | ń | ||||
š́ | ᴢ̌́ | ž́ | ||||||||||||||||
Retroflex | ṭ | ᴅ̣ | ḍ | ϑ̣ | δ̣ | ṣ | ᴢ̣ | ẓ | ʟ̣ | ḷ | ʀ̣ | ṛ | ɴ̣ | ṇ | ||||
ṣ̌ | ᴢ̣̌ | ẓ̌ | ||||||||||||||||
Palatal (prevelar) | k͕ | ɢ͕ | g͕ | |||||||||||||||
Palatalised velar | ḱ | ɢ́ | ǵ | χ́ | γ́ | j | ᴎ́ | ή | ||||||||||
Velar | k | ɢ | g | χ | γ | ᴫ | л * | ᴎ | η | |||||||||
Postvelar | k͔ | ɢ͔ | g͔ | |||||||||||||||
Uvular | k̤ | ɢ̤ | g̤ | ᴚ̤ | ɹ̤ | ʀ̤ | r̤ | ᴆ̤ | ð̤ | ɴ̤ | n̤ | |||||||
Glottal | ʔ | ȟ | ᴤ | h | ɦ |
When there are two or more consonants in a column, the rightmost one is voiced; when there are three, the centre one is lenis or partially devoiced and the leftmost one is fortis or fully devoiced.
ʟ̌ ľ (not shown in the table) are lateral fricatives. v̌ and ȟ in the table are also fricatives derived from letters for approximants.
* ᴫ л are defined as dark alveolars, with ᴌ ł being 'half-dark', but other sources define ᴫ л as velar. They are distinct in italic typeface, which is the norm for UPA phonetic notation.
Other sources have ᴃ and ᴆ for fricative ʙ ᴅ, and ᴩ ρ for the uvular trills.
The Uralic languages transcribed with this alphabet do not contain non-pulmonic consonants except paralinguistically, thus only clicks are supported by the UPA. There are two conventions: a leftward arrow, for p˿ b˿ t˿ d˿ ḱ˿ ǵ˿ etc., and Greek letters, for ᴨ π ᴛ τ ᴋ κ etc. Nasal clicks can presumably be written ᴍ˿ m˿ ɴ˿ n˿ ᴎ́˿ ή˿ etc. under the first convention.
From extremely short (superscript) to extra-long (circumflex), length of vowels and consonants is indicated as follows:
Example | Description | Use |
---|---|---|
ä | diaeresis above | 'Palatal' (front) vowel; interdental consonant (e.g. ẗ interdental t) |
ạ | dot below | 'Velar' (back) vowel; 'cacuminal' (retroflex) consonant |
a̤ | diaeresis below | Uvular consonant |
ā | macron | Long form of a vowel or consonant |
aa | doubled character | |
a͔ | left arrowhead below | Retracted form of a vowel or consonant (e.g. t͔ post-alveolar t) |
a͕ | right arrowhead below | Advanced form of a vowel or consonant (e.g. t͕ pre-alveolar t) |
a̭ | circumflex below | Raised variant of a vowel |
a̬ | caron below | Lowered variant of a vowel |
ǎ | caron above | Fricative variant of an approximant; 'wide' variant of a sibilant |
ă | breve | Shorter or reduced vowel |
a̮ | breve below | Central vowel |
a̯ | inverted breve below | Non-syllabic variant of a vowel |
á | acute accent | Palatalized variant of a consonant; may be moved to the right of letters with an ascender, as with δˊ. |
ᴀ | small capital | Unvoiced or lenis variant of a sound |
ᵃ | superscripted character | Very short sound |
ₐ | subscripted character | Coarticulation due to surrounding sounds, or intermediate sound |
ɐ | rotated character | Reduced form of sound. Letters ambiguous when rotated 180° are rotated 90°, as with ᴞ. |
For diphthongs, triphthongs and prosody, the Uralic Phonetic Alphabet uses several forms of the tie or double breve:[3][4]
A major difference is that IPA notation distinguishes between phonetic and phonemic transcription by enclosing the transcription between either brackets [aɪ pʰiː eɪ] or slashes /ai pi e/. UPA instead used italic typeface for the former and bold typeface for the latter.[5]
For phonetic transcription, numerous small differences from IPA come into relevance:
Examples:
Sound | UPA | IPA |
---|---|---|
Close-mid back rounded vowel | o̭ | [o] |
Mid back rounded vowel | o | [o̞] or [ɔ̝] |
Open-mid back rounded vowel | o̬ or α̭ | [ɔ] |
Alveolar tap | ð | [ɾ] |
Voiced dental fricative | δ | [ð] |
Voiceless alveolar lateral approximant | ʟ | [l̥] |
Velar lateral approximant | л | [ʟ] |
Voiceless alveolar nasal | ɴ | [n̥] |
Uvular nasal | n̤ | [ɴ] |
Voiceless alveolar trill | ʀ | [r̥] |
Uvular trill | r̤ | [ʀ] |
Reduced vowel | ə | [e̽] |
The IETF language tags register fonupa
as a subtag for text in this notation.[6]
Few system fonts support the small capitals. Support is available through any good phonetics font, such as (among free fonts) Gentium, Andika, Noto, DejaVu and EB Garamond, though lower-case and small-capital ᴄ, л, o, v, w and z may not be distinct in italic typeface and are rarely distinct in bold. DejaVu and EB Garamond do not support stacked the diacritics in š́, ᴢ̌́, ž́. EB Garamond includes the Unicode small capitals in its roman typeface but not in italic or bold, so automated formatting is applied. Following are pairs of small capital and lower case in these fonts; the fonts must be installed on your computer or phone to display here.
Browser default font | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
---|---|---|---|---|---|---|---|---|---|
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
Gentium Plus | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
Andika | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
Noto Serif | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
Noto Sans | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
DejaVu Serif | italic | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | ᴫ л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ | |
EB Garamond | italic | ᴄ c | - л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
bold | ᴄ c | - л | ᴏ o | ᴜ u | ᴠ v | ᴡ w | ᴢ z | š́ ž́ |
This section contains some sample words from both Uralic languages and English (using Australian English) along with comparisons to the IPA transcription.
Language | UPA | IPA | Meaning |
---|---|---|---|
English | šᴉp | [ʃɪp] | 'ship' |
English | rän | [ɹæn] | 'ran' |
English | ʙo̭o̭d | [b̥oːd] | 'bored' |
Moksha | və̂ďän | [vɤ̈dʲæn] | 'I sow' |
Udmurt | miśkᴉ̑nᴉ̑ | [miɕkɪ̈nɪ̈] | 'to wash' |
Forest Nenets | ŋàrŋū̬"ᴲ | [ŋɑˑrŋu̞ːʔə̥] | 'nostril' |
Hill Mari | pᴞ·ń₍ᴅ́ᴢ̌́ö̭ | [ˈpʏnʲd̥͡ʑ̥ø] | 'pine' |
Skolt Sami | pŭə̆ī̮ᵈt̄ėi | [pŭə̆ɨːd̆tːəi] | 'ermine' |
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.