In this paper, it is shown that most of the language families claimed to derive from a common Nostratic protolanguage exhibit complex pronominal systems characterized by intricate patterns of recurrence and suppletion in the personal, demonstrative and possibly interrogative pronominal stems. The analysis proposed here, employing the strict principles of the comparative method, shows that these patterns are probably not due to coincidence but rather represent a new argument in favor of the Nostratic hypothesis.

It has been argued repeatedly that the Old Romance clitics are phrasal in character and obey a Romance version of the Wackernagel law, i.e. the Tobler-Mussafia law (cf. Tobler 1875; Mussafia 1888). This law has been interpreted as a constraint that forbids clitics to appear in initial position, due to their status as phonologically enclitic elements, and it was often taken as the explanation of why Old Romance clitics could follow the verb in matrix sentences but never in embedded sentences. Under this view the change in clitic placement from Old to Modern Romance was explained with a change in the categorial status of the clitic itself, i.e. as long as clitics were moving as phrases the ban on clitic-first orders was active and thus clitics were sometimes forced to follow the verb in matrix sentences. Old Catalan clitics contradict this view. They precede and follow the verb in matrix as well as in embedded sentences even though they need to be analyzed as heads from the beginning onwards. In this paper I argue that the change in clitic placement is neither connected to the clitics’ phrasal character nor to their obedience to the Tobler-Mussafia law, but that it is the parameter determining the relative order between verb and clitic that has changed its value in the development from Old to Modern Catalan. The data I present is taken from a corpus of more than 7000 sentences covering the last eight centuries.

Two computational methods for detecting borrowing among a family of genetically related languages are proposed. One method, based on the detection of branches with negative length in lexicostatistical trees, is shown to work poorly. As we demonstrate, this method is similar to another recently proposed method for detecting borrowing based on skewing in lexicostatistical data. A second method, using character-based classification techniques in common use in the classification of biological taxa, is shown to be more effective. This method allows borrowed characters and the languages among which the borrowing may have taken place to be identified — in some cases, the most likely direction of the borrowing can also be specified.