Loading AI tools
Indo-European language From Wikipedia, the free encyclopedia
Albanian (endonym: shqip [ʃcip] , gjuha shqipe [ˈɟuha ˈʃcipɛ], or arbërisht [aɾbəˈɾiʃt]) is an Indo-European language and the only surviving representative of the Albanoid branch, which belongs to the Paleo-Balkan group.[9] It is the native language of the Albanian people. Standard Albanian is the official language of Albania and Kosovo, and a co-official language in North Macedonia and Montenegro, as well as a recognized minority language of Italy, Croatia, Romania and Serbia. It is also spoken in Greece and by the Albanian diaspora, which is generally concentrated in the Americas, Europe and Oceania.[2][10] Albanian is estimated to have as many as 7.5 million native speakers.[1][2]
Albanian | |
---|---|
Shqip | |
Arbërisht | |
Pronunciation | [ʃcip] [ˈɟuha ˈʃcipɛ] [aɾbəˈɾiʃt] |
Native to | |
Ethnicity | Albanians |
Native speakers | 7.5 million (2017)[1][2] |
Indo-European
| |
Early form | |
Dialects |
|
Official status | |
Official language in |
|
Recognised minority language in | |
Regulated by | Academy of Sciences of Albania Academy of Sciences and Arts of Kosovo |
Language codes | |
ISO 639-1 | sq |
ISO 639-2 | alb (B) sqi (T) |
ISO 639-3 | sqi – inclusive codeIndividual codes: aae – Arbëreshaat – Arvanitikaaln – Ghegals – Tosk |
Glottolog | alba1267 |
Linguasphere | to 55-AAA-ahe (25 varieties) 55-AAA-aaa to 55-AAA-ahe (25 varieties) |
Albanian and other Paleo-Balkan languages had their formative core in the Balkans after the Indo-European migrations in the region.[11][12] Albanian in antiquity is often thought to have been an Illyrian language for obvious geographic and historical reasons,[13][14][15][16][17][18] or otherwise an unmentioned Balkan Indo-European language that was closely related to Illyrian and Messapic.[19][20][21][22] The Indo-European subfamily that gave rise to Albanian is called Albanoid in reference to a specific ethnolinguistically pertinent and historically compact language group.[23] Whether descendants or sisters of what was called 'Illyrian' by classical sources, Albanian and Messapic, on the basis of shared features and innovations, are grouped together in a common branch in the current phylogenetic classification of the Indo-European language family.[24][19][23][21][22]
The first written mention of Albanian was in 1284 in a witness testimony from the Republic of Ragusa, while a letter written by Dominican Friar Gulielmus Adea in 1332 mentions the Albanians using the Latin alphabet in their writings. The oldest surviving attestation of modern Albanian is from 1462.[25] The two main Albanian dialect groups (or varieties), Gheg and Tosk, are primarily distinguished by phonological differences and are mutually intelligible in their standard varieties,[26][27] with Gheg spoken to the north and Tosk spoken to the south of the Shkumbin river.[28] Their characteristics[29][30] in the treatment of both native words and loanwords provide evidence that the split into the northern and the southern dialects occurred after Christianisation of the region (4th century AD),[31][32] and most likely not later than the 6th century AD,[33][34][35] hence possibly occupying roughly their present area divided by the Shkumbin river since the Post-Roman and Pre-Slavic period, straddling the Jireček Line.[36][37]
Centuries-old communities speaking Albanian dialects can be found scattered in Greece (the Arvanites and some communities in Epirus, Western Macedonia and Western Thrace),[38] Croatia (the Arbanasi), Italy (the Arbëreshë)[39] as well as in Romania, Turkey and Ukraine.[40] The Malsia e Madhe Gheg Albanian[41][42] and two varieties of the Tosk dialect, Arvanitika in Greece and Arbëresh in southern Italy, have preserved archaic elements of the language.[43] Ethnic Albanians constitute a large diaspora, with many having long assimilated in different cultures and communities. Consequently, Albanian-speakers do not correspond to the total ethnic Albanian population, as many ethnic Albanians may identify as Albanian but are unable to speak the language.[44][45][46]
Standard Albanian is a standardised form of spoken Albanian based on Tosk.
The language is spoken by approximately 6 million people in the Balkans, primarily in Albania, Kosovo, North Macedonia, Serbia, Montenegro and Greece.[1] However, due to old communities in Italy and the large Albanian diaspora, the worldwide total of speakers is much higher than in Southern Europe and numbers approximately 7.5 million.[1][2]
The Albanian language is the official language of Albania and Kosovo and a co-official language in North Macedonia and Montenegro.[47][48] Albanian is a recognised minority language in Croatia, Italy, Romania and in Serbia. Albanian is also spoken by a minority in Greece, specifically in the Thesprotia and Preveza regional units and in a few villages in Ioannina and Florina regional units in Greece.[38] It is also spoken by 450,000 Albanian immigrants in Greece, making it one of the commonly spoken languages in the country after Greek.
Albanian is the third most common mother tongue among foreign residents in Italy.[49] This is due to a substantial Albanian immigration to Italy. Italy has a historical Albanian minority of about 500,000, scattered across southern Italy, known as Arbëreshë. Approximately 1 million Albanians from Kosovo are dispersed throughout Germany, Switzerland and Austria. These are mainly immigrants from Kosovo who migrated during the 1990s. In Switzerland, the Albanian language is the sixth most spoken language with 176,293 native speakers.
Albanian became an official language in North Macedonia on 15 January 2019.[50]
There are large numbers of Albanian speakers in the United States, Argentina, Chile, Uruguay, and Canada. Some of the first ethnic Albanians to arrive in the United States were the Arbëreshë. The Arbëreshë have a strong sense of identity and are unique in that they speak an archaic dialect of Tosk Albanian called Arbëresh.
In the United States and Canada, there are approximately 250,000 Albanian speakers. It is primarily spoken on the East Coast of the United States, in cities like New York City, Boston, Chicago, Philadelphia, and Detroit, as well as in parts of the states of New Jersey, Ohio, and Connecticut.[citation needed]
In Argentina, there are nearly 40,000 Albanian speakers, mostly in Buenos Aires.[51][need quotation to verify]
Approximately 1.3 million people of Albanian ancestry live in Turkey, with more than 500,000 recognizing their ancestry, language and culture. There are other estimates, however, that place the number of people in Turkey with Albanian ancestry and or background upward to 5 million. However, the vast majority of this population is assimilated and no longer possesses fluency in the Albanian language, though a vibrant Albanian community maintains its distinct identity in Istanbul to this day.
Egypt also lays claim to about 18,000 Albanians, mostly Tosk speakers.[52] Many are descendants of the Janissary of Muhammad Ali Pasha, an Albanian who became Wāli, and self-declared Khedive of Egypt and Sudan. In addition to the dynasty that he established, a large part of the former Egyptian and Sudanese aristocracy was of Albanian origin. In addition to the recent emigrants, there are older diasporic communities around the world.
Albanian is also spoken by Albanian diaspora communities residing in Australia and New Zealand.
The Albanian language has two distinct dialects, Tosk which is spoken in the south, and Gheg spoken in the north.[53] Standard Albanian is based on the Tosk dialect. The Shkumbin River is the rough dividing line between the two dialects.[54]
Gheg is divided into four sub-dialects: Northwest Gheg, Northeast Gheg, Central Gheg and Southern Gheg. It is primarily spoken in northern Albania, Kosovo, and throughout Montenegro and northwestern North Macedonia. One fairly divergent dialect is the Upper Reka dialect, which is however classified as Central Gheg. There is also a diaspora dialect in Croatia, the Arbanasi dialect.
Tosk is divided into five sub-dialects, including Northern Tosk (the most numerous in speakers), Labërisht, Cham, Arvanitika, and Arbëresh. Tosk is spoken in southern Albania, southwestern North Macedonia and northern and southern Greece. Cham Albanian is spoken in North-western Greece,[55] while Arvanitika is spoken by the Arvanites in southern Greece. In addition, Arbëresh is spoken by the Arbëreshë people, descendants of 15th and 16th century migrants who settled in southeastern Italy, in small communities in the regions of Sicily and Calabria.[56][57] These settlements originated from the (Arvanites) communities probably of Peloponnese known as Morea in the Middle Ages. Among them the Arvanites call themselves Arbëror and sometime Arbëresh. The Arbëresh dialect is closely related to the Arvanites dialect with more Italian vocabulary absorbed during different periods of time.
The Albanian language has been written using many alphabets since the earliest records from the 15th century. The history of Albanian language orthography is closely related to the cultural orientation and knowledge of certain foreign languages among Albanian writers.[58] The earliest written Albanian records come from the Gheg area in makeshift spellings based on Italian or Greek. Originally, the Tosk dialect was written in the Greek alphabet and the Gheg dialect was written in the Latin script. Both dialects had also been written in the Ottoman Turkish version of the Arabic script, Cyrillic, and some local alphabets (Elbasan, Vithkuqi, Todhri, Veso Bey, Jan Vellara and others, see original Albanian alphabets). More specifically, the writers from northern Albania and under the influence of the Catholic Church used Latin letters, those in southern Albania and under the influence of the Greek Orthodox church used Greek letters, while others throughout Albania and under the influence of Islam used Arabic letters. There were initial attempts to create an original Albanian alphabet during the 1750–1850 period. These attempts intensified after the League of Prizren and culminated with the Congress of Manastir held by Albanian intellectuals from 14 to 22 November 1908, in Manastir (present day Bitola), which decided on which alphabet to use, and what the standardised spelling would be for standard Albanian. This is how the literary language remains. The alphabet is the Latin alphabet with the addition of the letters ⟨ë⟩, ⟨ç⟩, and ten digraphs: dh, th, xh, gj, nj, ng, ll, rr, zh and sh.
According to Robert Elsie:[59]
The hundred years between 1750 and 1850 were an age of astounding orthographic diversity in Albania. In this period, the Albanian language was put to writing in at least ten different alphabets – most certainly a record for European languages. ... the diverse forms in which this old Balkan language was recorded, from the earliest documents to the beginning of the twentieth century ... consist of adaptations of the Latin, Greek, Arabic, and Cyrillic alphabets and (what is even more interesting) a number of locally invented writing systems. Most of the latter alphabets have now been forgotten and are unknown, even to the Albanians themselves.
Albanian constitutes one of the eleven major branches of the Indo-European language family,[61] within which it occupies an independent position.[62] In 1854, Albanian was demonstrated to be an Indo-European language by the philologist Franz Bopp. Albanian was formerly compared by a few Indo-European linguists with Germanic and Balto-Slavic, all of which share a number of isoglosses with Albanian.[63] Other linguists linked the Albanian language with Latin, Greek and Armenian, while placing Germanic and Balto-Slavic in another branch of Indo-European.[64][65][66] In current scholarship there is evidence that Albanian is closely related to Greek and Armenian, while the fact that it is a satem language is less significant.[61]
Albanian in the Palaeo-Balkanic Indo-European branch based on the chapters "Albanian" (Hyllested & Joseph 2022) and "Armenian" (Olsen & Thorsø 2022) in Olander (ed.) The Indo-European Language Family |
Messapic is considered the closest language to Albanian,[24][23][19] grouped in a common branch titled Illyric in Hyllested & Joseph (2022).[24] Hyllested & Joseph (2022) in agreement with recent bibliography identify Greco-Phrygian as the IE branch closest to the Albanian-Messapic one. These two branches form an areal grouping – which is often called "Balkan IE" – with Armenian. The hypothesis of the "Balkan Indo-European" continuum posits a common period of prehistoric coexistence of several Indo-European dialects in the Balkans prior to 2000 BC. To this group would belong Albanian, Ancient Greek, Armenian, Phrygian, fragmentary attested languages such as Macedonian, Thracian, or Illyrian, and the relatively well-attested Messapic in Southern Italy. The common features of this group appear at the phonological, morphological, and lexical levels, presumably resulting from the contact between the various languages. The concept of this linguistic group is explained as a kind of language league of the Bronze Age (a specific areal-linguistics phenomenon), although it also consisted of languages that were related to each other.[67] A common prestage posterior to PIE comprising Albanian, Greek, and Armenian, is considered as a possible scenario. In this light, due to the larger number of possible shared innovations between Greek and Armenian, it appears reasonable to assume, at least tentatively, that Albanian was the first Balkan IE language to branch off. This split and the following ones were perhaps very close in time, allowing only a narrow time frame for shared innovations.[68]
Albanian represents one of the core languages of the Balkan Sprachbund.[61]
Glottolog and Ethnologue recognize four Albanian languages. They are classified as follows:[69][70]
The first attested written mention of the Albanian language was on 14 July 1284 in Ragusa in modern Croatia (Dubrovnik) when a crime witness named Matthew testified: "I heard a voice crying on the mountain in the Albanian language" (Latin: Audivi unam vocem, clamantem in monte in lingua albanesca).[71][72]
The Albanian language is also mentioned in the Descriptio Europae Orientalis[73] dated in 1308:
Habent enim Albani prefati linguam distinctam a Latinis, Grecis et Sclauis ita quod in nullo se intelligunt cum aliis nationibus. (Namely, the above-mentioned Albanians have a language that is different from the languages of Latins, Greeks and Slavs, so that they do not understand each other at all.)
The oldest attested document written in Albanian dates to 1462,[74] while the first audio recording in the language was made by Norbert Jokl on 4 April 1914 in Vienna.[75]
However, as Fortson notes, Albanian written works existed before this point; they have simply been lost. The existence of written Albanian is explicitly mentioned in a letter attested from 1332, and the first preserved books, including both those in Gheg and in Tosk, share orthographic features that indicate that some form of common literary language had developed.[76]
By the Late Middle Ages, during the period of Humanism and the European Renaissance, the term lingua epirotica 'Epirotan language' was preferred in the intellectual, literary, and clerical circles of the time, and used as a synonym for the Albanian language.[77] Published in Rome in 1635, by the Albanian bishop and writer Frang Bardhi, the first dictionary of the Albanian language was titled Latin: Dictionarium latino-epiroticum 'Latin-Epirotan dictionary'.[78][79]
During the five-century period of the Ottoman presence in Albania, the language was not officially recognised until 1909, when the Congress of Dibra decided that Albanian schools would finally be allowed.[citation needed]
Albanian is an isolate within the Indo-European language family; no other language has been conclusively linked to its branch. The only other languages that are the sole surviving members of a branch of Indo-European are Armenian and Greek.[80][lower-alpha 1]
The Albanian language is part of the Indo-European language family and the only surviving representative of its own branch, which belongs to the Paleo-Balkan group.[81][24][82][83][84] Although it is still uncertain which ancient mentioned language of the Balkans it continues, or where in the region its speakers lived.[lower-alpha 2] In general, there is insufficient evidence to connect Albanian with one of those languages, whether Illyrian, Thracian, or Dacian.[lower-alpha 3]Among these possibilities, Illyrian is the most probable.[lower-alpha 4]
Although Albanian shares lexical isoglosses with Greek, Germanic, and to a lesser extent Balto-Slavic, the vocabulary of Albanian is quite distinct.[87] In 1995, Taylor, Ringe, and Warnow used quantitative linguistic techniques that appeared to obtain an Albanian subgrouping with Germanic, a result which the authors had already reasonably downplayed.[clarification needed][88][89] Indeed, the Albanian and Germanic branches share a relatively moderate number of lexical cognates. Many shared grammatical elements or features of these two branches do not corroborate the lexical isoglosses.[89] Albanian also shares lexical linguistic affinity with Latin and Romance languages.[90][91][92] Sharing linguistic features unique to the languages of the Balkans, Albanian also forms a part of the Balkan linguistic area or sprachbund.[93][94]
The place and the time that the Albanian language was formed are uncertain.[95] The American linguist Eric Hamp has said that during an unknown chronological period a pre-Albanian population (termed as "Albanoid" by Hamp) inhabited areas stretching from Poland to the southwestern Balkans.[96] Further analysis has suggested that it was in a mountainous region rather than on a plain or seacoast. The words for plants and animals characteristic of mountainous regions are entirely original, but the names for fish and for agricultural activities (such as ploughing) are borrowed from other languages.[82][97]
A deeper analysis of the vocabulary, however, shows that could be a consequence of a prolonged Latin domination of the coastal and plain areas of the country, rather than evidence of the original environment in which the Albanian language was formed. For example, the word for 'fish' is borrowed from Latin, but not the word for 'gills' which is native. Indigenous are also the words for 'ship', 'raft', 'navigation', 'sea shelves' and a few names of fish kinds, but not the words for 'sail', 'row' and 'harbor'; objects pertaining to navigation itself and a large part of sea fauna. This rather shows that Proto-Albanians were pushed away from coastal areas in early times (probably after the Latin conquest of the region) and thus lost a large amount (or the majority) of their sea environment lexicon. A similar phenomenon could be observed with agricultural terms. While the words for 'arable land', 'wheat', 'cereals', 'vineyard', 'yoke', 'harvesting', 'cattle breeding', etc. are native, the words for 'ploughing', 'farm' and 'farmer', agricultural practices, and some harvesting tools are foreign. This, again, points to intense contact with other languages and people, rather than providing evidence of a possible linguistic homeland (also known as a Urheimat).[citation needed]
The centre of Albanian settlement remained the Mat River. In 1079, the Albanians were recorded farther south in the valley of the Shkumbin River.[98] The Shkumbin, a 181 km long river that lies near the old Via Egnatia, is approximately the boundary of the primary dialect division for Albanian, Tosk and Gheg. The characteristics of Tosk and Gheg in the treatment of the native words and loanwords from other languages are evidence that the dialectal split preceded the Slavic migrations to the Balkans,[54][32][99] which means that in that period (the 5th to 6th centuries AD), Albanians were occupying nearly the same area around the Shkumbin river, which straddled the Jireček Line.[100][97]
References to the existence of Albanian as a distinct language survive from the 14th century, but they failed to cite specific words. The oldest surviving documents written in Albanian are the "formula e pagëzimit" (Baptismal formula), Un'te paghesont' pr'emenit t'Atit e t'Birit e t'Spertit Senit. ("I baptize thee in the name of the Father, and the Son, and the Holy Spirit") recorded by Pal Engjelli, Bishop of Durrës in 1462 in the Gheg dialect, and some New Testament verses from that period.
The linguists Stefan Schumacher and Joachim Matzinger (University of Vienna) assert that the first literary records of Albanian date from the 16th century.[101][102] The oldest known Albanian printed book, Meshari, or "missal", was written in 1555 by Gjon Buzuku, a Roman Catholic cleric. In 1635, Frang Bardhi wrote the first Latin–Albanian dictionary. The first Albanian school is believed to have been opened by Franciscans in 1638 in Pdhanë.
One of the earliest Albanian dictionaries was written in 1693; it was the Italian manuscript Pratichae Schrivaneschae authored by the Montenegrin sea captain Julije Balović and includes a multilingual dictionary of hundreds of the most frequently used words in everyday life in Italian, Slavic, Greek, Albanian, and Turkish.[103]
Pre-Indo-European (PreIE) sites are found throughout the territory of Albania. Such PreIE sites existed in Maliq, Vashtëmi, Burimas, Barç, Dërsnik in the Korçë District, Kamnik in Kolonja, Kolsh in the Kukës District, Rashtan in Librazhd, and Nezir in the Mat District.[104] As in other parts of Europe, these PreIE people joined the migratory Indo-European tribes that entered the Balkans and contributed to the formation of the historical Paleo-Balkan tribes. In terms of linguistics, the pre-Indo-European substrate language spoken in the southern Balkans probably influenced pre-Proto-Albanian, the ancestor idiom of Albanian.[104] The extent of this linguistic impact cannot be determined with precision due to the uncertain position of Albanian among Paleo-Balkan languages and their scarce attestation.[105] Some loanwords, however, have been proposed, such as shegë 'pomegranate' or lëpjetë 'orach'; compare Pre-Greek λάπαθον, lápathon 'monk's rhubarb'.[106][104]
The earliest known texts in Albanian:
Albanian scripts were produced earlier than the first attested document, formula e pagëzimit, but none yet have been discovered. We know of their existence by earlier references. For example, a French monk signed as "Broccardus" notes, in 1332, that "Although the Albanians have another language totally different from Latin, they still use Latin letters in all their books".[112]
In 1967 two scholars claimed to have found a Letter text in Albanian inserted into the Bellifortis text, a book written in Latin dating to 1402–1405.[113]
"A star has fallen in a place in the woods, distinguish the star, distinguish it.
Distinguish the star from the others, they are ours, they are.
Do you see where the great voice has resounded? Stand beside it
That thunder. It did not fall. It did not fall for you, the one which would do it.
...
Like the ears, you should not believe ... that the moon fell when ...
Try to encompass that which spurts far ...Call the light when the moon falls and no longer exists ..."
Robert Elsie, a specialist in Albanian studies, considers that "The Todericiu/Polena Romanian translation of the non-Latin lines, although it may offer some clues if the text is indeed Albanian, is fanciful and based, among other things, on a false reading of the manuscript, including the exclusion of a whole line."[114]
In 1635, Frang Bardhi (1606–1643) published in Rome his Dictionarum latinum-epiroticum, the first known Latin-Albanian dictionary. Other scholars who studied the language during the 17th century include Andrea Bogdani (1600–1685), author of the first Latin-Albanian grammar book, Nilo Katalanos (1637–1694) and others.[115]