The English Cowpath: Indo-European

Showing posts with label Indo-European. Show all posts

Sunday, March 3, 2013

The Simplification of English – Stage 1, Proto-Germanic

While our children were in high school we hosted several European exchange students in our home for whom English was a second, third, or even fourth language. I was surprised when they told me that they considered English an easy language to learn, or at least less complicated than their own (French, Polish, German and Hungarian). What they meant by this was the lack of gender for most nouns and the simplified verb forms, not our beloved illogical English spelling.

John McWhorter, in “Myths, Lies, and Half-Truths of Language Usage” (The Great Courses), explains two quite different stages in the development of English where simplification occurred. The first occurred at the Proto-Germanic stage and therefore affected all the Germanic languages. The second occurred between Old and Middle English. In this post I’ll explore the first.

Compared to the other branches of Indo-European (e.g. Latin, Greek, Celtic, Persian, Sanskrit and Slavic), Proto-Germanic is different in several striking ways. One was the shift in consonants which is known as Grimm’s Law (discussed in an earlier post) where “p” becomes “f”, “k” becomes “h” etc. Another was a significant loss of complexity of verb forms. Where other IE language families have different verb forms for I, we (you & I), we (you, I & others), you (singular), you (plural) and they, PG had only one “we” and had the same verb form for “we”, “you all” and “they”. Verb tenses were also simplified to four (present, past, subjunctive and passive) from six in Indo-European. There were also some internal vowel changes in verb tenses introduced to Proto-Germanic (which I don’t understand so won’t attempt to explain here). Then there is vocabulary – many Germanic words were borrowed early on from the neighboring Indo-European languages of Roman and Celtic, but up to a third of Germanic roots are believed to be of non-Indo-European source. What was going on here?

This type of language change (significant simplification) is not the normal pattern. In fact it is only seen when a language is learned by a group as adults. Who could these adults be?

First keep in mind that nearly all of the wide distribution of Indo-European languages is the result of imposition of the language on existing inhabitants by an introduced ruling class (either military or mercantile), rather than by mass migration of Indo-European speakers. See my post on the PIE Homeland. It had been proposed by Colin Renfrew (and others) that the Indo-European languages were introduced to Europe along with agriculture by farmers migrating west and north from Anatolia (modern Turkey). It made a nice neat theory but turned out not to be true – agriculture preceded IE languages by several thousand years in Europe. There were therefore, by the time the Indo-European language was advancing up the Danube into northwest Europe, farmers settled there with their own languages, whatever they may have been. It was these people who learned Indo-European as adults and adopted it as their own.

There is some linguistic evidence suggesting that these northwestern European people who learned Indo-European, and made it into Proto-Germanic, spoke a Semitic language (of which Arabic and Hebrew are members). Semitic languages are rich in the internal vowel changes of the kind seen in Proto-Germanic verbs; the fricative consonants “f”, “h”, and “th” introduced to Proto-Germanic as explained in Grimm’s Law are common in Semitic languages, and a few of the non-PIE Germanic roots seem to have a Semitic connection. This Semitic influence theory, promoted by Prof. Theo Vennemann, is quite controversial.

One theory to explain a Semitic influence on Germanic languages has the Phoenicians, a Semitic speaking sea-faring people who are known to have reached Portugal, to have sailed all the way around to north-western Europe – the Netherlands perhaps or even Denmark. This is highly speculative at best, as there is no archaeological evidence and so far to my knowledge no genetic evidence, to support their presence that far north. My theory (and you saw it here first) is that some influence of Semitic languages had accompanied agriculture as it made its way from the middle east to northwestern Europe. One of Bryan Sykes' maternal ancestors of modern Europeans described in his 2001 book " The Seven Daughters of Eve" is Jasmine who originated in the middle east and is associated with the spread of agriculture into Europe. Her genes spread into Europe and it seems quite likely that her language did as well, hundreds of years before the introduction ofGermanic Indo-European languages.

Whoever the first speakers of Proto-Germanic were, we owe them our gratitude for helping to simplify not only English but every other Germanic language spoken today.

The next stage of simplification for English, which occurred during the transition from Old English to Middle English, is credited to, of all people, the Norse Vikings who settled in northern England. But that’s a story for another week.

Tuesday, March 6, 2012

The Proto-Indo-European Homeland Puzzle

The Indo-European Language Family

Indo-European was the first language family to be identified. This discovery, and the beginning of modern linguistics, can be dated to February 2, 1786 at a gathering of scientists and other interested men. Sir William Jones, speaking at the Asiatic Society in Calcutta, made this astounding statement:

The Sanskrit language, whatever be its antiquity, is of a wonderful structure: more perfect than the Greek, more copious than the Latin, and more exquisitely refined than either; yet bearing to both of them a stronger affinity, both in the roots of verbs and in the forms of grammar, than could possibly have been produced by accident; so strong indeed, that no philologer could examine them all three, without believing them to have sprung from some common source, which, perhaps, no longer exists.

Jones later added Persian and Celtic as likely members of this family of languages.

Jones was uniquely qualified to make this discovery. His parental language was Welsh; he was taught English at school; he learned classical Greek and Latin in university where he studied law; he wrote the first English grammar of the Persian language (which earned him a reputation as one of the most respected linguists in Europe); and when appointed a judge in India at age 37 set out to learn the Sanskrit language to better understand local laws. Thus by age 40 Jones was familiar with a language in 6 (out of a total of 12) different Indo-European language branches.

Indo-European languages are spoken today by over 3 billion people - about half of the world's population - as either a first or second language. These languages are divided into 10 or 12 language branches or subfamilies. See the attached graph (Figure 1.1 of The Horse, The Wheel and Language p.12) which is arranged more or less geographically. English is a member of the Germanic subfamily along with German, Dutch, Frisian, the Scandinavian languages (which includes Icelandic), Yiddish, and Afrikaans. Other languages to note include:

Tocharian – two extinct languages found in western

China

, the farthest East branch

Hittite – a member of the extinct Anatolian branch – the earliest branch to separate

Romany – the language of the Gypsies of Europe, is a member of the Indic branch showing that they originated in northwest India (not to be confused with Rumanian which is a member of the Latin or Romance language branch)

Source: Figure 1.1 of The Horse, The Wheel and Language p.12

About 6,000 to 5,000 years ago the parent language, called Proto-Indo-European, was spoken by a semi-nomadic tribe of people in the southern Ukraine and Russia. How their language spread and evolved into all of all these languages could be the subject for a future lecture. Today I want to show how historical linguistics and archaeology were combined to solve the puzzle of who the speakers of Proto-Indo-European were, and where and when they lived.

Source: Figure 1.2 of The Horse, The Wheel and Language p.14

The Proto-Indo-European Homeland Puzzle

Since the discovery of the IE language family, the location of the homeland of the original speakers has been claimed by different people to be many different places: India, Pakistan, Syria/Lebanon, the Caucasus Mountains, Turkey, Armenia, Kazakhstan, Russia, Ukraine, the Balkans and Germany. By the late 20^th century linguists only seriously considered two of these – Anatolia (modern Turkey) and the steppes of southern Ukraine and Russia. And as recently as 2000, Calvert Watkins in his essay “Indo-European and the Indo-Europeans” which introduces his book The American Heritage Dictionary of Indo-European Roots stated “Archaeologists have not in fact succeeded in locating the Indo-Europeans.”

Colin Renfrew was a strong supporter of the other serious contender, Anatolia. Renfew's elegant proposal, published in the 1990's, had Proto-Indo-European migrant farmers carry their language along with agriculture from the Middle East to the westernmost part of Europe. But like many elegant theories, this one turned out to be not true. (I was greatly disappointed when linguistics and DNA analysis disproved Thor Heyerdahl's theories of Polynesian origins). There are, as we will see, serious problems with Renfrew's theory.

Before going further, I need to emphasize one point. Proto-Indo-European is a language. It is not a culture, nor is it a genetically-definable population. Language does not necessarily follow cultural boundaries, which can be determined by archaeology. Every first year archaeology student is taught “pots are not people”. But we know that someone must have spoken this language, and they must have lived in a particular place during a particular time. So while looking for the speakers of Proto-Indo-European we need to be careful of this constraint.

Clues from the Language

Since Proto-Indo-European is a language, let's look first at clues to the homeland from the language itself. The American Heritage Dictionary of Indo-European Roots published in 2000 contains 1350 reconstructed root words and several thousand more words based on these roots. These words have been painstakingly reconstructed by comparing similar words (called cognates) from the daughter languages over the more than 200 years since Jones' discovery. What can we learn about the people who spoke this language from their vocabulary?

- they knew four seasons with snow in winter

- they were not familiar with tropical plants or animals

- animals include: wolf, lynx, elk beaver, otter, mouse, fish

- birds include: crane, goose, duck, eagle, woodpecker

- insects: wasp, hornet, fly, louse, bee, honey (mead)

- domestic animals include: dog, cattle, sheep and horse

- horses play an important role in the culture

- they practiced spinning and weaving of wool

- they knew metallurgy - copper

- they knew of the wheel and used wagons or carts (weak link in Anatolian)

- they knew of boats and oars - words like nav (navigate, navy) and rowing.

- gift exchange is an important part of their culture

- the guest-host relation was important – *ghosti is the root of both host and guest (ghost originally meant visitor or guest)

- they borrowed words from Proto-Uralic, another Eurasian language family, suggesting that the Proto-Indo-European speakers must have lived close to, and likely traded with, people who spoke Proto-Uralic who then, as now, live in northern Europe and Siberia (Hungarian is a member of this family found in Europe because of recent migration (~900CE).

The seasons and animals indicate a northern location either in or adjacent to a forest. The words for bee and honey place the homeland west of the Ural Mountains as honeybees do not occur east of there.

Clues to Dating Proto-Indo-European

Language can also help place the Proto-Indo-European speakers in time as well as location.

Agriculture was introduced to Europe between 6700 and 6500 BC while the wheel was not known until 3400 BC and woolen textiles sometime after 4000 BC. For the daughter language families to have similar words for the wheel and wool, they must have separated from Proto-Indo-European after their arrival. This effectively eliminates the Anatolian farmer immigrant theory. Besides, the two or three Anatolian languages were very similar to each other and spoken by only a small number of people in this area, which strongly suggests they are spoken by Indo-European speaking migrants to Anatolia, not by the ancestors of the language.

The domestication of the horse provides additional clues. Horses were hunted for meat by the people of the steppe for millennia before they were domesticated. They were first domesticated sometime after 4800 BC, a thousand years after cattle were introduced to the area. But they were raised for their meat only. During a cool dry period (4200-3800 BC) horses would have an advantage over cattle because they can forage for themselves during the winter. [Pioneer farmers in Saskatchewan like my grandfather often turned their horses loose for the winter to manage for themselves, rounding them up in the spring]. Riding of horses began on the steppes sometime before 3700 BC and had spread to Northern Kazakhstan, the Caucasus Mountains, and into Europe, by 3000 BC.

An important tool used in the dating of horse riding is bit wear on horse molars. The identification of tooth wear caused by bits of metal, bone, rope and rawhide, was pioneered by the author of The Horse, The Wheel and Language – David W. Anthony, and his wife, fellow archaeologist Dorcas Brown. There is an interesting Saskatchewan connection here. One of the experts they contacted was Hilary Clayton who began studying the mechanics of bits in horses’ mouths while working in Philadelphia, and then took a job at the Western Veterinarian College in Saskatoon. Anthony and Brown followed her to Saskatchewan in 1985 and viewed the X-ray videos she had made of horses chewing their bits.

Riding horses provided a significant benefit to herders in the steppes. A man on horseback could manage a herd of cattle or sheep much larger than a man on foot. With the much later advent of wheeled carts, about 3300 BC, the herders could carry with them tents, food and water allowing them to take advantage of the vast areas between the river valleys. This opened up the steppe much as the horse did to the plains of North America 5,000 years later.

Dating the Daughters

Language provides clues to timing in another way. Linguists can date, with more or less certainty, when each of the daughter language branches separated from the mother language. Here is a list of the branches, in the order of separation, with the approximate date (all BC) of separation (from Figure 3.2 The Horse, The Wheel and Language p. 57).

Anatolian 4200

Tocharian 3700 - 3300

Germanic 3300

Celtic / Italic 3000

Greek / Armenian 2500

Balto-Slavic 2500

Indo-Iranian 2500-2200

Clues from Archaeology – The Kurgan Cultures

With the time line narrowed to the period 4000 to 2000 BC, it's time to look at the archaeological record and see who was living in the likely homelands and how well they fit with the linguistic clues. The archaeology of the Pontic-Caspian steppes was mostly carried out by Soviet scientists and published in Russian. These were not translated into English until the 1990s. Anthony was one of the first western archaeologists to study this work and relate it to the Proto-Indo-European homeland question.

Anthony found a close fit with the western steppe peoples who built huge burial mounds called kurgans. Their culture varied somewhat over the Proto-Indo-European time line and also geographically from place to place within this large area, but their overall cultures were similar, especially compared to the foragers to the north and east and to the sophisticated farming cultures to the west and south. They were semi-nomadic, raising cattle and sheep. Horses were important both for meat and for riding to manage their growing herds. They used wheeled carts. They mined their own ore and made their own tools and weapons of copper, tin and bronze.

Even more compelling is the evidence, from archaeology, of known migrations out of the steppes in the right directions and at the right times to account for the birth of the daughter language families.

1) to the west 4200-3900 (Anatolian)

2) to the east 3700-3300 (Tocharian)

3) to the west - several waves (Germanic, Celtic, Italic)

4) to north (Baltic, Slavic)

5) to the east and south (Iranian, Indic)

I should explain that by migration I do not mean large scale movement of people displacing existing populations along with their culture and language. This may have been the case with the Pre-Tocharians who made a remarkably long migration in one jump to the Altai Mountains 2000 km to the east (equivalent to the journey made by my grandparents from southern Ontario to Saskatchewan, but without the advantage of trains). Most if not all the other migrations were by small groups who, through some combination of trade or intimidation, became rulers of existing populations. They brought with them enough of their culture to be recognized archaeologically; and they brought their language which, for a variety of reasons, was adopted by the others and continued to spread long after they were gone.

Puzzle Solved

While there may be a few objections to his theory not yet satisfactorily answered, Anthony is convinced that the Proto-Indo-European Homeland puzzle has been solved.

Source: Figure 5.1 of The Horse, The Wheel and Language p.84

I want to finish with a quote from The Horse, The Wheel and Language p. 464

Understanding the people who lived before us is difficult, particularly the people who lived in the prehistoric tribal past. Archaeology throws a bright light on some aspects of their lives but leaves much in the dark. Historical linguistics can illuminate a few of those dark corners.

Monday, January 23, 2012

Grimm’s Law

In linguistic circles Jacob Grimm (1785-1863) is better known for his discovery of phonetic changes in the development of Proto-Germanic than for his German folk tales.

Jacob was trained as a lawyer but was more interested in history and language – early German literature in particular. He authored many scholarly books including Geschichte der Deutschen Sprache (History of the German Language), Deutsche Grammatik (German Grammar), and the monumental Deutsches Wörterbuch (German Dictionary). In 1812 and 1814 he and his younger brother Wilhelm published their collection of old German folk tales as Grimms Märchen (Grimms’ Fairy Tales) Volumes 1 and 2.

Grimm’s Law was first published in the second edition of his German Grammar in 1822. It is considered significant in linguistic science for introducing “…a rigorous methodology to historic linguistic research.” [http://en.wikipedia.org/wiki/Jacob_Grimm].

Sometime during the progression from Proto-Indo-European (about 2000 BC) to Proto-Germanic (about 500 BC), certain sound shifts in consonants occurred which were nearly universal (occurring to all words). These changes did not occur in any other PIE daughter languages such as the Proto-Romance language, so are observable with comparisons between English words taken directly from Latin, French or other Romance languages and those from its Germanic roots.

Greatly simplified, the changes are (with examples):

Labials (sounds made with the lips)

p changes to f: ped / foot; pater / father

f changes to b: fund / bottom

b changes to p: labial / lip

Velars (sounds made mid-palate)

k changes to h: canine / hound

h changes to g: host / guest

g changes to k: genuflect / knee

Dentals (sounds made with the tongue and teeth)

t changes to th: triple / three

th changes to d: thyroid / door

d changes to t: duo / two

Note that in all three groups, the changes go full circle, sort of like musical chairs.

Verner’s Law, developed by Danish linguist Karl Verner, explains further changes that occurred later to certain consonants in certain conditions (depending on the accent of the preceding syllable).

These sound changes are evident in all of the modern Germanic languages: German, Frisian, Dutch (and Africaans), Danish, Swedish, Norwegian, Icelandic, Faroese, Luxembourgish, and of course English.

Sound changes like this are common in language history and take several generations, often a few centuries, to complete. A more recent example in English is known as the Great Vowel Shift which occurred between 1400 and 1600 (between Chaucer and Shakespeare). That’s a subject for a future post.

Sunday, December 4, 2011

PIE in the Steppes

On a whirlwind visit from Ukraine to his family in Canada, my big brother of Blog Fodder fame returned yesterday a book I had loaned him a few years ago: The Horse, the Wheel, and Language by David W. Anthony (2007).

Anthony is an archaeologist who, with his wife – fellow archaeologist Dorcas Brown – did extensive field work in the southern steppes of Russia, Ukraine and

Kazakhstan

. Although not a linguist, he learned enough about linguistics to understand and appreciate what can be learned about ancient people from their language. His unique understanding of both steppe archaeology and linguistics [1] enabled him to make a persuasive case for the homeland of the Proto-Indo-European (PIE) language-speaking people as the western Eurasian steppes (grasslands) of more than 5,000 years ago.

In the 200 years since PIE was first discovered, historical linguists have reconstructed more than 1500 roots (and several thousand more words based on these roots) of this proto language. Anthony devotes a chapter of his book explaining, in simple terms, how the process of word reconstruction – both sounds and meaning – works so that non-linguistic readers will have some confidence in the results. From this lexicon (vocabulary) much information can be gleaned about its speakers that can’t be learned from archaeology alone. Anthony writes: “If we can combine the Proto-Indo-European vocabulary with a specific set of archaeological remains, it might be possible to move beyond the usual limitations of archaeological knowledge and achieve a much richer knowledge of these particular ancestors.” (p.5)

Here are some of the things learned about the environment, social life and beliefs of the PIE speakers from their reconstructed lexicon:

They had words for otter, beaver, wolf, lynx, elk, hare, mouse, goose, crane, eagle, bee and honey
They raised cattle, sheep, pigs and horses
They wove woolen cloth
They drove wagons or carts
Their society was patrilineal (rights and duties were inherited from the father)
They likely had formal warrior bands (armies)
They recognized a male sky deity

Some of these could be discovered through archaeology (bones of animals hunted for food, bit wear on horses indicating domestication for riding, and possibly cart artifacts); the other “practices and beliefs are simply unrecoverable through archaeology.” (p.15)

It seems incredible to me that the English language can be traced back with a fair degree of confidence to a language spoken 5-6 thousand years ago. Note that even though, as explained in my last post, only about 26% of English words are of Germanic origin, most of the borrowed words are from other Indo-European languages, particularly French and Latin but also Old Norse, Spanish, Italian and even Hindi.

I’ll share more from this book in future posts – I’m only on chapter 3 of 17.

____________________________________________________

[1] Typically historical linguists and archaeologists share a high degree of distrust of each other’s work. Anthony explains: “Both linguists and archaeologists have made communication across the disciplines almost impossible by speaking in dense jargons that are virtually impenetrable to anyone but themselves. Neither discipline is at all simple, and both [appear confusing] to an outsider… Historical linguistics is not taught regularly in graduate archaeology programs ... nor is archaeology taught to graduate students in linguistics.” (p.5). Anthony gives credit to a colleague James P. Mallory as “…perhaps the only double qualified linguist-archaeologist in Indo-European studies”. Mallory’s 1989 book In Search of the Indo-Europeans was unable to come to any firm conclusion as to the PIE homeland. Anthony explains that it was recent archaeological discoveries that enabled him to confidently locate the homeland in the steppes.

Tuesday, August 9, 2011

Hittite - the missing clue

Donna and I have been watching more from The Story of Human Language course. Last night we watched several lectures on the Indo-European language family. I'll write more about Indo-European later but want to share an interesting story about it. I had previously read that the discovery of Hittite writings had proven some theory about Proto-Indo-European (the mother language of the Indo-European language family) but had not really understood it. In one of his lectures McWhorter explained it simply enough for me to comprehend it.

Indo-European was the first language family identified, credited to Sir William Jones, a British born scholar living in India. His address to the Bengal Asiatic Society in 1786 in which he related his observations of similarities between Latin, Greek and Sanskrit and postulated their having a common ancestor, is credited with being the birth of modern linguistics. A whole linguistic industry grew from this discovery with linguists attempting to reconstruct the ancestral language called Proto-Indo-European (PIE). One of the books in my language library is the American Heritage Dictionary of Indo-European Roots.

By comparing cognate words in surviving modern languages (and written historical languages), linguists work backwards using known laws of language change to recreate what the original word must have been. This process is called comparative reconstruction, and is also used to recreate the grammar.

Many languages have strong preferences for word construction. McWhorter provides the example of Japanese where words either end in a vowel or the consonant N. PIE words seemed to be mostly one syllable with a consonant-vowel-consonant construction. One group of exceptions are words that are consonant-vowel where the vowel is long. Here's where the theory comes in.

Ferdinand de Saussure (1), described as a pioneering linguist working in the late 1800s, wondered if these exceptions had at one time had a consonant at the end which had since been dropped. From other known language changes, it was known that throaty consonants called laryngeals (like H) cause the preceding vowel to become long, so de Saussure developed the theory that the missing consonants were laryngeals. The other linguists of the day strongly rejected this theory on the grounds that there was no evidence of it in any known language. His theory was vindicated some 50 years later with the discovery of Hittite texts (on clay tablets) in Anatolia (modern Turkey). Hittite (2) turned out to be what is now believed to be the earliest known IE language and had the missing consonants just where de Saussure predicted. This find not only vindicated the laryngeal theory but more generally supported the entire process of comparative reconstruction.

Footnotes:
(1) Ferdinand de Saussure's father Henri was a scholar of, among several other disciplines, entomology. See my post from July 20 http://englishcowpath.blogspot.com/2011/07/etymology-vs-entomology.html
(2) Hittites - the Anatolian Hittites were named for the people that the ancient Hebrews ran into in the Old Testament but it is debatable at best if the two groups are related (source: Wikipedia).