β

Swadesh list

The Swadesh list/ˈswɒdɛʃ/ is a classic compilation of basic concepts for the purposes of historical-comparative linguistics. Translations of the Swadesh list into a set of languages allow researchers to quantify the interrelatedness of those languages. The Swadesh list is named after linguist Morris Swadesh. It is used in lexicostatistics (the quantitative assessment of the genealogical relatedness of languages) and glottochronology (the dating of language divergence). Because there are several different lists, some authors also refer to "Swadesh lists".

Contents

Swadesh himself created several versions of his list. He started[1] with a list of 225 meanings, which he reduced to 165 words for the Salish-Spokane-Kalispel language. In 1952 he published a list of 215 meanings,[2] of which he suggested the removal of 16 for being unclear or not universal, with one added to arrive at 200 words. In 1955[3] he wrote, "The only solution appears to be a drastic weeding out of the list, in the realization that quality is at least as important as quantity....Even the new list has defects, but they are relatively mild and few in number." After minor corrections, he published his final 100-word list in 1971[4] and 1972.

Frequently used, not for any proven quality, but for its electronic availability via the internet, is the version by I. Dyen (1992, 200 meanings of 95 language variants). Since 2010, a team around M. Dunn has tried to update and enhance that list.[6]

In origin, the words in the Swadesh lists were chosen for their universal, culturally independent availability in as many languages as possible, regardless of their "stability". Nevertheless, the stability of the resulting list of "universal" vocabulary under language change and the potential use of this fact for purposes of glottochronology have been analyzed by numerous authors, including Marisa Lohr 1999, 2000.[7]

The Swadesh list was put together by Morris Swadesh on the basis of his intuitions. More recent similar lists, such as the Dolgopolsky list (1964) or the Leipzig–Jakarta list (2009), are based on systematic data from many different languages, but they are not yet as widely known and as widely used as the Swadesh list.

Lexicostatistical test lists are used in lexicostatistics to define subgroupings of languages, and in glottochronology to "provide dates for branching points in the tree".[8] The task of defining (and counting the number) of cognate words in the list is far from trivial, and often is subject to dispute, because cognates do not necessarily look similar, and recognition of cognates presupposes knowledge of the sound laws of the respective languages. For example, English "wheel" and Sanskrit chakra are cognates, although they are not recognizable as such without knowledge of the history of both languages.

Holman et al. (2008) found that in identifying the relationships between Chinese dialects the Swadesh–Yakhontov list was less accurate than the original Swadesh-100 list. Further they found that a different (40-word) list was just as accurate as the Swadesh-100 list. However, they calculated the relative stability of the words by comparing retentions between languages in established language families. They found no statistically significant difference in the correlations in the families of the Old versus the New World.

In studying the sign languages of Vietnam and Thailand, linguist James Woodward noted that the traditional Swadesh list applied to spoken languages was unsuited for sign languages. The Swadesh list results in overestimation of the relationships between sign languages, due to indexical signs such as pronouns and parts of the body. The modified list is as follows, in largely alphabetical order:[10]

^List, J.-M., M. Cysouw, and R. Forkel (2016): Concepticon. A resource for the linking of concept lists. In: Proceedings of the Tenth International Conference on Language Resources and Evaluation. 2393-2400. PDF