September 25, 2008

ASHG 2008 abstracts

Just a sample of abstracts that I found interesting from the upcoming meeting of the American Society of Human Genetics.

Strong linkage disequilibrium for the frequent GJB2 35delG mutation in the Greek population.

Up to forty percent of autosomal recessive, congenital, severe to profound hearing impairment cases result from mutations in the GJB2 gene. The 35delG mutation accounts for the majority of mutations detected in Caucasian populations and represents one of the most frequent disease mutations identified so far. Some previous studies have assumed that the high frequency of the 35delG mutation reflects the presence of a mutational hot spot, whilst other studies support the theory of a common founder. Greece is amongst the countries presenting the highest frequency of the 35delG mutation (3.5%), and a recent study raised the hypothesis of the origin of this mutation in ancient Greece. We genotyped 60 Greek deafness patients homozygous for the 35delG mutation for six single nucleotide polymorphisms (SNPs) and two microsatellite markers, mapping within or flanking the GJB2 gene, as compared to 60 Greek hearing controls. A strong linkage disequilibrium was found between the 35delG mutation and the DNA markers at distances of 34 kb on the centromeric and 90 kb on the telomeric side of the gene, respectively. A comparison of the present findings with those of a previous study from Belgium, UK and USA, demonstrated a common haplotype reflecting the common founder. Our study supports the hypothesis of a founder effect and we further propose that ethnic groups of Greek ancestry could have propagated the 35delG mutation, as evidenced by historical data beginning from the 15th century BC.

Detection of population substructure among Jews and a north/south gradient within Ashkenazi Jews using 32 STR markers.

Understanding and detecting population substructure are critical issues. Using 32 autosomal STR markers and the program STRUCTURE we demonstrated differentiation between Ashkenazi (AJ) (N=135) and Sephardic (SJ) (N=226) Jewish populations in the form of Northern and Southern European genetic components (AJ north 73%, south 22%, SJ north 32%, south 61%) and a significant relationship between latitude of grandparental country of origin (GCO) and percent north/south genetic component in AJ. Notably, we revealed substructure among Jews (and among European Americans (EA)) using a small STR panel, only when additional samples representing major continental populations (African American, EA, Asian) were included in analyses. Further, negative RIS (-0.035) indicates recent admixture in individuals with both SJ and AJ parents (N=38). RIS is a measure of inbreeding adapted from FIS for STR markers. Negative RIS indicates allelic variation within individuals greater than expected under random mating, i.e., excess heterozygosity due to outbreeding. Although geographic patterns are seen in the average north/south percent assignment values between groups as defined by AJ or SJ, grandparental world region of origin, or GCO, within each group there is high variability among individual assignment values. Thus, even based on data from a small marker set, AJ is not a homogeneous population. The north/south gradient in AJ may be a reflection of the pre-existing north/south gradient in European host populations (recently shown in other studies using large numbers of SNPs) with which Jews admixed slowly. We also demonstrate the utility of including purported parental populations when attempting to detect population substructure within closely related populations.

Mutation meltdown of mitochondrial DNA and Neanderthal extinction.

There is emerging evidence that mitochondrial DNA (mtDNA) plays and integral role in the evolution of the human species. Although contentious, recent phylogenetic studies of modern humans implicate genetic variation of mitochondrial DNA (mtDNA) as a major factor underpinning the climatic adaptation of across the globe. Greater sequence diversity in the MTATP6 gene in arctic populations led to the idea that specific mtDNA polymorphisms cause subtle uncoupling of the respiratory chain, with the subsequent generation of additional heat being adaptive in northern climes. Our knowledge of mtDNA and its affect on adaptability may help us to understand how modern humans have survived their early ancestors. Here, we characterise the mtDNA of one of these extinct hominids. Neanderthals are the closest hominid relatives of modern humans, who up until 30,000 years ago coexisted in Europe and western Asia. Recently, over 1Mb of DNA was successfully extracted and characterised from the Vi-80 Neanderthal fossil. We reanalysed 2,705 base pairs of mtDNA in order to examine the hypothesis that mitochondrial dysfunction contributed to the Neanderthals demise. We identified thirty-two nucleotide differences from the modern human mtDNA reference sequence and by treating the Vi-80 as a diagnostic sample leads us to the conclusion that sequence variants that are highly likely to be artifacts, and a large proportion of the remaining mutations could be due to nuclear pseudogene amplification. We did identify a potentially deleterious variation; however more study may be needed to ascertain the effect of mitochondrial dysfunction on Neanderthal survival.

Early Siberian Maternal Lineages in the Tubalar of Northeastern Altai Inferred from High-Resolution Mitochondrial DNA Analysis

At the hight of the last glaciation (~18 kya) Siberians were confined to the southern strongholds, which were areas of continuous occupation, and where immediate ancestors of the Uralic, Kettic and Altaian language groups differentiated. To better understand the evolutionary relationships between the earlier and contemporary Siberians, we focused on the northern Altaic prehistory preserved in the mtDNA diversity of the Tubalar, until recently representing a typical hunting-gathering population. The present study includes 139 Tubalar. All mtDNAs were subjected to high-resolution SNP analysis, followed by complete sequencing of selected mtDNA samples. We showed that the core of the Tubalar genetic makeup proved to be a mixture of west (H8, U4b, U5a1, and X2e) and east Eurasian (A and B1) haplogroups derived from macrohaplogroup N, and Siberian derivatives of the macrohaplogroup M identifiable by subhaplogroup-specific mutations. For example, among the 36 Tubalar mtDNA samples that belong to haplogroup D, 10 (28%) harbored diagnostic markers of the subhaplogroup D3a2a shared with the Chukchi and Eskimos. This finding verified at the complete sequence level we attributed to ancient link between early Siberians, who underwent pronounced differentiation in the Altai-Sayan region, and some of the Eskimo tribes. A comparison of the mtDNA data generated through the course of this study with published complete sequences has contributed essentially to parsimonious phylogenetic structure of mtDNA evolution in west Siberia. Specifically, northeastern Altai appears to be a good candidate for the ancestral homeland of the haplogroup U4b, which is apparently ancient European. For some haplogroups, such as X2e, the relatively recent arrival to the Altai region is more likely.

Population Structure in Mongolia from a Mitochondrial DNA Perspective.

Mongolia has experienced a complex series of demographic movements over the past 10-20 millennia that have shaped the patterns of its modern human genetic variation. However, modern populations in Mongolia have not been extensively studied for DNA diversity, nor has the genetic contribution of Mongolians to the gene pools of contemporary populations in Southeast Asia and Oceania been fully resolved. Archaeological evidence from as early as the late Neolithic suggests the presence of both West and East Eurasian cultures in this region. Later demographic movements involving the emergence of the Mongolian and later Manchu Empires have further convoluted Mongolias population structure. To clarify the complex population history of Mongolia, we analyzed variation in the mtDNAs of 190 individuals from several Mongolian ethnic groups, including the Uriankhai, Zakhchin, Derbet, Khoton and Khalkha. We screened all samples for phylogenetically informative coding region SNPs and sequenced HVSI to assess control region variation in them. Our data suggest that the mtDNA diversity present in our population is consistent with the general pattern of variation observed in East Asia, with the most frequent haplogroups being C, D and G. Haplogroup variation in Mongolian ethnic groups reveals considerable maternal diversity with a predominance of basal M types. Interestingly, the Mongolians also possessed West Eurasian haplogroups, such as H, J and K, which are not commonly observed in East Asia, even at low frequencies. The main ethnic group in Mongolia, the Khalkha, was highly variable with respect to mtDNA haplotypes in comparison with the other ethnic groups, and clearly distinct from the Khoton and Zakhchin, as evidenced by distance measures. Overall, these data provide insights into the origins and affinities of these populations, their relationships with East Asian groups and neighboring Turkic speaking groups, including indigenous Altaians, and their possible role in the peopling of the Americas.

Y-chromosome short tandem repeat (YSTR) loci are used extensively in studies of population substructure, temporality of population dynamics, and forensic identification. The occurrence of non-consensus YSTR alleles, such as unusually short alleles or partial insertion/deletion events (microvariants), have been used successfully as indicators of common ancestry among YSTR haplotypes, exposing further levels of phylogenetic substructure with restricted geographic distributions. However, the high variability of STR loci can potentially lead to false associations due to homoplasy (ie, recurrent mutation). Thus, YSTR haplotypes are best interpreted within the context of the binary marker defined Y-chromosome phylogeny. To identify YSTR microvariant alleles potentially useful for elucidating further phylogenetic substructure within binary haplogroups, we have assessed the haplogroup affiliation of microvariant alleles found at informative frequencies in public YSTR databases for the following YSTR loci: DYS385, DYS392, DYS441, DYS446, DYS447, DYS449 and DYS464. We report haplogroup affiliations for each variant allele and geographic origins of representative samples.

L1c2a, the (African) Haplogroup With The Longest Mitochondrial Genome!

Haplotypes derived from the maternally-inherited mitochondrial DNA (mtDNA) control region are often employed as a first step in determining phylogenetic-relevant samples that could be selected for additional coding region testing. Using the currently defined world mtDNA haplogroup tree, researchers can assign these haplotypes to specific branches, paying particular attention to novel mutations that could assist in identifying new subclades. During a recent survey of the nearly 58000 mtDNA control region haplotypes currently present in the publicly accessible Sorenson Molecular Genealogy Foundation database, we observed a small number of mtDNAs (n=16) characterized by the presence of unusually long insertions of up to 200 bases. A small subset of these particularly long mtDNA haplotypes shared an identical insertion of 15 bases. Genealogical analysis combined with haplogroup prediction confirmed that these haplotypes shared a common African origin. Additionally, based on the pedigree data gathered, we determine the donors were not closely related. Moreover, through the analysis of complete mtDNA sequences, we conclude that the newly defined haplogroup is most likely of recent origin. As reported in this study, insertions of more than 10 bps are quite rare in the general population and in the published literature, thus providing an interesting case work in population and possibly future disease studies.

Mitochondrial DNA footprints in modern Mongolia.

Although Mongolia is one of the most sparsely populated countries in the world, it is located at a pivotal crossroad between the four corners of Asia (including the well-known Silk Road) and has been characterized throughout history by events that greatly added to its current cultural and ethnic diversity. Among these, perhaps one of the most significant happening was the ambitious expansion strategy employed by Mongolias most prominent personality, Genghis Khan, whose empire eventually stretched across all of modern-day China, a portion of modern Russia, Southern Asia, Eastern Europe and the Middle East. In 2007, through a well-planned collection effort, researchers at the Sorenson Molecular Genealogy Foundation and the National University of Mongolia were able to gather over 3,000 DNA samples, informed consents, and genealogical data throughout the country of Mongolia, including samples from 21 distinct tribal or ethnic populations. All the samples were sequenced for the three hypervariable segments of the mitochondrial DNA (mtDNA) control region to assess the genetic composition of modern Mongolia. The most common mtDNA haplotypes are typical of haplogroup C, which is frequent throughout Eastern Asia. However, nearly 40% of the observed mtDNA lineages are of Western Eurasian origin, including a significant frequency (~7%) of haplogroup H - the most common in Europe. The high prevalence of Western Eurasian lineages could be a remnant from Genghis Khans conquering efforts, trade and cultural exchanges along the Silk Route. To assess the extent of recent gene flow that could account for the elevated levels of Eurasian haplogroups within Mongolian populations, we have examined genealogical data of samples representative of Western Eurasian haplogroups.

Y chromosome microsatellite haplotypes in the Hutterite founders.

The current population of >12,000 Schmiedeleut Hutterites are descendants of 38 male founders who were born between 1700 and 1830 in Europe. Only 12 of these founders, each with a unique surname, have living male descendants related through male-only lineages. DNA samples were available in our laboratory for 75 male descendants of 11 of the 12 founders, accounting for 673 independent paternal meioses. We genotyped 9 microsatellite loci, which included a mean of 6.8 (range 2-23) males per lineage to evaluate potential relationships between the founders. Fourteen different haplotypes were identified, with an average of 3.5 (range 1-8) pairwise differences between haplotypes. All descendants within each of 9 lineages had identical Y haplotypes. Descendents of two of these lineages, 2 and 10, had the same haplotype despite different surnames, suggesting possible relatedness between the founders of these two lineages. Descendants of two lineages, 6 and 11, each carried three distinct haplotypes. Within each of these lineages the haplotypes differed from the ancestral haplotype by one repeat size at two loci. Additional male descendants in lineages 6 and 11 were then genotyped for the discrepant microsatellites, confirming the presence of three Y haplotypes each in lineages 6 and 11. The one mutation arose at each of four loci: DYS388, DYS389II, DYS390, DYS393. Three mutations were gains of one repeat; it was not possible to determine if the fourth mutation was a gain or loss of one repeat. The ancestral haplotypes in these two lineages are identical at four microsatellite loci; the alleles at the other five loci differ by one repeat size. The average mutation rate at these 9 loci was 0.00066 (95% CI 0.00015-0.0013), similar to other estimates. These data suggest that the founders of lineages 2 and 10 may have been related through paternal lines and that surnames do not strictly correspond to unique Y chromosomes. Moreover, certain ancestral haplotypes (i.e., those in lineages 6 and 11) may be more prone to mutation. Supported by NIH grants HD21244 and HL085197.

Genetic History of human populations of East African inferred from mtDNA and Y chromosome analyses.

Evidence from genetic, paleobiological, and archaeological studies suggest that Africa, especially East Africa, is most likely to be the cradle of the modern human species. Despite this fact, very little is currently known about genetic diversity in African populations in general, and East African populations in particular. Genetic data demonstrate that the patterns of genetic variation in East African populations are complex. All four major language families spoken in Africa (Afro-Asiatic, Nilo-Saharan, Niger-Kordofanian, and Khoisan) are found in the region. As part of a large study of population genetic diversity of East and Northeast Africa, we examined Y chromosome genetic diversity (to ascertain paternal lineages) as well as mitochondrial genetic diversity (to ascertain maternal lineages) in 1200 - 1500 individuals from ~ 40 Tanzanian, Sudanese, and Kenyan populations. For the Y chromosome analysis, we genotyped 60 UEPs (analyzed in a hierarchical manner to construct haplotypes) in a total of ~1500 male individuals. In order to infer ages of lineages and migration patterns, we further genotyped the individuals for 16 Y chromosome microsatellites. For the mtDNA analysis, we sequenced the mitochondrial D-loop in a total of 1200 individuals from the same populations, and for 200 individuals, we did complete mitochondrial genome sequencing. We compare our results with published results of studies from other parts of Africa and the Middle East. Our results indicate that East African populations have some of the most ancestral Y chromosome and mtDNA lineages in Africa, suggesting that they may have been an ancient source of dispersion throughout Africa. Additionally, we find evidence for ancient geneflow between East Africa and the Middle East. We also ascertained the effect of the Bantu-expansion and signature of recent migration of Cushitic-speaking groups originating from Ethiopia on peopling of East Africa.

Analysis of mtDNA and Y-chromosome haplogroups in Mexican Mestizos and Amerindian groups.

The Mexican population is mainly conformed by Mestizos, individuals with a genetic background consisting of Amerindian, European and African contributions. Genetic heterogeneity in Mexicans results from a complex demographic history that started with the peopling of North and Central America about 15,000 yrs ago, including the settlement of at least 60 different indigenous groups in Mexico, regional differences in admixture dynamics after colonization by Spaniards in the XVI century, epidemics and migration. Y chromosome-specific and mitcohondrial (mt) DNA polymorphisms are useful to help understand the genetic structure and history of human populations, due to their uniparental inheritance and lack of recombination. In order to refine the portrait of genetic variability derived from the Mexican Genome Diversity Project, we are characterizing maternal and paternal lineages participating in admixture. For this we included genotypic data from 163 mt SNPs and 123 Y chromosome SNPs present in the Illumina Human1M chip of 450 individuals, 300 mestizos from six states located in different regions: Northern, Central and Southern; and 150 individuals from different Amerindian groups (Tepehuanes, Zapotecos and Mayas). With this information, we are measuring genetic diversity using Fst and AMOVA analysis. Admixture analysis includes average and individual ancestral contribution estimates using autosomal SNPs. Initial results show that in our Mestizo sample, 88% of the mt haplogroups are Amerindian (A, B, C or D), and the rest includes European and African lineages. We have identified differences in proportions of each haplogroup in both Mestizos and Amerindians. Knowledege about the distribution of mt and Y-chromosome haplogroups in Mexican Mestizos and Amerindian groups, will generate valuable information to better understand genetic relationships between Mexicans and other Latin American populations. In addition, it may contribute to strengthen analysis in association studies of common complex diseases.

The origin of Native Americans from a mitochondrial DNA viewpoint.

America, the last continent to be colonized by modern humans, is characterized by an extraordinary linguistic and cultural diversity. Until recently, it was generally believed that starting around 13,500 years ago, the first Paleo-Indians arrived from Beringia, passing through an interior ice-free corridor in western North America, and spread rapidly all the way to Tierra del Fuego. Today, we realize that the peopling of the Americas involved a much more complex process. As for the maternally transmitted mitochondrial DNA (mtDNA), it has been clear since the early nineties that Native Americans could be traced back to four major maternal lineages (haplogroups) of Asian affinity. These were initially named A, B, C and D, and are now termed A2, B2, C1 and D1. More than 95% of living Native Americans belong to these four haplogroups, which can be considered pan-American, because they are shared by North, Central and South American populations. Later, five additional maternal lineages were discovered and named X2a, D2, D3, C4c, and D4h3. These less common or rare haplogroups are restricted only to some Native American populations or geographic areas and bring the overall number of Native American mtDNA lineages to nine. Our comprehensive overview of the four pan-American branches of the mtDNA tree suggests a scenario with a human entry and spread into the Americas from Beringia about 20,000 years ago, and preliminary data raise the possibility that the uncommon five Native American haplogroups might have marked additional migratory events from Asia or Beringia. Overall, through a combined analysis of modern and ancient Native American mtDNA, we are making an effort for reconstructing the complex pre-Columbian history at both macro- and micro-geographic levels.

Seven selection-nominated candidate genes (COL11A1, LMNA, FGFR1, FGFR2, TRPS, BRAF, FLNA) known to be involved in Mendelian craniofacial dysmorphologies and to have high allele frequency differences between West African and European populations were tested for admixture linkage to normal facial feature traits. The sample consists of 254 subjects (n=131 African Americans, n=123 Brazilians) of West African and European genetic ancestry. Each individual was genotyped at 176 ancestry informative markers (AIMs), which allowed for proportional estimation of genetic ancestry from four parental populations and adjustments for admixture stratification.3D images of faces were acquired using the 3dMDface imaging system. 3D coordinate data were collected from 22 landmarks placed on each image using the 3dMDPatient software. The 231 possible pairwise landmark distances were scaled to the geometric mean and then analyzed using Euclidean Distance Matrix Analysis.We used both ANOVA and ADMIXMAP to control for admixture stratification and to test for associations between the 231 pairwise landmark distances and 183 AIMs, using sex, height and BMI as covariates. We used a four-population model (West African, European, East Asian, and Native American).There is a strong concordance between the ANOVA and ADMIXMAP results. Many landmark distances, particularly on the mouth and nose, were significantly associated with genetic ancestry. Additionally, three of the candidate genes show no effects on pairwise landmark distances while four show distinct patterns of association. For example, FGFR2 is associated primarily with the length of the face. These results represent the first identification of the first genes affecting normal variation in facial features.

Ethnicity-Confirmed Genetic Structure in New Hampshire.

Genetic population structure is known to result from shared ancestry. Though there have been several studies of genetic structure within and among different geographic regions and ethnic groups, little is known of the genetic structure of highly admixed US populations or whether the structure is concordant with self-reported ancestry. In this study, 1529 single nucleotide polymorphisms (SNPs) from 864 healthy control individuals from New Hampshire were measured as part of a bladder cancer epidemiology study. The SNPs were from approximately 500 cancer susceptibility genes scattered throughout the genome. Of these, 960 Tag SNPs were used to cluster individuals using the Structure algorithm for between 2 and 5 subpopulations. Subtle genetic structure was found, suggesting the appropriate number of subpopulations to be either 4 or 5 (FSTs 4 populations: 0.0377, 0.0399, 0.0363, 0.0340; 5 populations: 0.0452, 0.0536, 0.0585, 0.0534, 0.0521). We coded the individuals self-reported ancestries in a genotype fashion (i.e. 0= not reporting that ancestry, 1= reporting part that ancestry, 2= reporting only that ancestry) and conducted a Spearmans rank correlation between each ancestry and the structure q value, which represents the proportion of an individual that originated from a certain genetic subpopulation. Those of Russian, Polish and Lithuanian ancestry most consistently clustered together. The ancestry results support either 4 or 5 subpopulations. In order to investigate linkage disequilibrium (LD), the complete set of SNPs from the 7 most densely genotyped genes were used to make haploview plots between the different groups. The results vary by gene, though for one gene in particular, GHR, the results are very different for 4 subpopulations. These results suggest that despite New Hampshires admixture and presumed homogeneity, there are 4 or 5 distinct genetic subgroups within the population that can be linked to self-reported ancestry and display differences in patterns of LD.

Accurate inference of human demographic history from genetic data is essential for identification of single nucleotide polymorphism (SNP) association with disease and for inference of natural selection. Haplotype diversity and haplotype sharing carry additional demographic information to that obtainable from SNP frequency spectra, and so we propose a novel method using haplotype summary statistics to fit demographic models to genome-wide SNP data. We divide the genome into 0.25 cM windows and for each we tabulate the number of distinct haplotypes and the frequency of the most common haplotype. We summarize the data by the genome-wide joint distribution of these two statistics. Coalescent simulations are then used to evaluate whether different demographic models are compatible with the observed data. Application of our method to simulated data shows that our method can reliably infer parameters from complex demographic models (such as bottlenecks) and is relatively robust to the levels of SNP ascertainment bias found in many genome-wide datasets. We have applied our method to data collected by the International HapMap Consortium and find that a bottleneck model best fits the CEU population. We have also analyzed a large dataset consisting of Affymetrix 500k data from ~2,900 individuals with ancestry from Taiwan, Japan, India, Mexico and many European countries. Since this dataset includes ~2,300 European individuals, we are able to study haplotype patterns at a fine scale within Europe. Interestingly, we find that within Europe there is a south-to-north gradient with decreasing levels of haplotype diversity moving north, consistent with south to north migrations.We also find that the southwestern European sample has higher haplotype diversity than the southeastern European sample. Additionally, a higher proportion of haplotypes are shared between the southwestern European sample and the Yoruba sample than between southeastern European sample and the Yoruba sample. These two patterns are consistent with recent admixture across the Mediterranean from Northern Africa.

Genome wide analysis and heritability estimation of intelligence in the International Multi-centre ADHD Genetics (IMAGE) study.

Attention-Deficit/Hyperactivity Disorder (ADHD) is a neurodevelopmental disorder characterised by symptoms of inattention, hyperactivity and impulsivity. There is growing evidence of heterogeneity in its etiology, pathophysiology and clinical expression. One approach to resolving heterogeneity involves the identification of endophenotypes, intervening variables that might mediate pathways between specific genes and clinical phenotype. IQ is a candidate endophenotype for ADHD. Genome-wide linkage analyses of full scale IQ and IQ subscales were performed in the International Multi-centre ADHD Genetics (IMAGE) study including 1094 families with 1094 DSM-IV combined type ADHD probands and their 1441 siblings (unselected for ADHD status). IQ was measured using five subscales of the WISC-IIIR scale. The full scale prorated IQ score and the five subscales were used as quantitative traits for linkage analysis. 5,407 autosomal SNPs were used to run multipoint regression-based linkage analyses using MERLIN. The h2 estimates from the IQ subscales and the full IQ score ranged from 31% to 100%. Three suggestive linkage signals were found (LOD scores 2, p values 0.001) on chromosomes 7, 9 and 14 for three different subscales. Previously, two regions on chromosomes 7 and 14 were reported as being associated or linked to IQ. Our results, though only suggestive, suggest the presence of additional genetic variants contributing to the variance of IQ in ADHD.

16 comments:

The Hutterite study and the use of .00067 is interesting. Klyoshov, on rootsweb, has already shown that using his natural log rule and Chandlers rates he gets the "right" answer. I am a user of the ZUL number, until something better is found?? The discussion of this result should be interesting. I have been studying VV's norther italian data set and am mulling over my results. Gioello may have something going for him??

"Additionally, a higher proportion of haplotypes are shared between the southwestern European sample and the Yoruba sample than between southeastern European sample and the Yoruba sample."

A haplotype is a set of allele values in a number of SNPs. For example if there are three SNPs with values C/T, A/G, A/T then there are 8 possible haplotypes:

CAACATCGACGTTAATATTGATGT

Not all haplotypes are usually observed, since some SNPs may be fixed in one population, or because some SNPs may be in linkage disequilibrium with each other, i.e., they are inherited together and are not split up by recombination.

Any two populations have (i) some haplotypes in common, and (ii) some haplotypes specific to each one of them.

Of the (i), there are (a) some haplotypes are conserved features of the common ancestors of the two groups, while (b) others arose in one of the two populations and were later added to the other population.

Nearby populations (along migration routes) share a greater fraction of their haplotypes with each other.

In the absence of admixture, SW Europeans are expected to share fewer haplotypes with Africans than SE Europeans, since SE Europeans are closer to Africa in the route taken by early modern humans as they migrated out of the continent.

The finding that the opposite is true suggests that the genomes of SW Europeans are enriched in African haplotypes via a separate source, directly from Africa, and this has occurred after the "Out of Africa" migration.

Also, note that the larger sharing of haplotypes between SW Europeans Yoruba does not in itself indicate the presence of Sub-Saharan admixture in SW Europe, as the shared haplotypes could be due to a common source contributing to both populations. The North Africans are an obvious candidate for such a common source, as they are known to have influenced genetically both Sub-Saharan Africa and Iberia.

There might be some, but I think we're looking at sub-Saharan influence here, at least in considerable part.

Perhaps, but that conclusion can't be drawn from the available information, since no North African populations were sampled. The one autosomal study we do have about the Spanish does not suggest Sub-Saharan admixture.

I suspect that has something to do with the low samples used there, because mtDNa data shows relatively large amounts of sub-Saharan admixture in parts of Iberia. Up to about 9% in parts of Portugal...and then as low as 0% in many parts of Spain.

I'm sure we'll see that reflected in genome wide SNPs if large enough samples are taken.

Interesting, but tricky. Aside from Portugal there isnt any SSA admixture in any realistic presence to constitute a "highter proportion" as this article makes it seem according to these haplotype allele values as stated in this study.

BTW, I am just curious here, Dienekes, if theres an absence of pre-historic migration explanation, then whats the explanation for Cameroon carrying something like 90% R1b? Is this from a "Out of Africa/back migration or modern slave era phenomenon? Could this have any bearing on the matter or no?

"I don't know of any major events that contributed to the Yorubans having a lot of North African admixture."

The Hausa and Fulani in Nigeria are often grouped together for demographic analysis (I guess the rationale is that they are "those Muslim folk" or something), and they together form the majority of the Nigerian population. The Yoruba are the second-most numerous demographic group in Nigeria after the Hausa-Fulani. Since the Hausa and Fulani are widely believed (and now pretty much proven) to have significant North African (or at least non-SSA) ancestry, it wouldn't be surprising if the Yoruba, their southern neighbors within the state of Nigeria, also shared some of that (or a similar, related sort of) North African ancestry.

It would seem to me that Berbers from North Africa, carrying Islam South, would have contributed their genes too - same with Berbers carrying Islam North into Iberia as well.But in the case of Northern Portugal, it would have to be a Neolithic event, like Ibero-Maurusian or Oranian Culture.

More here:http://mathildasanthropologyblog.wordpress.com/2008/08/24/ancient-north-african-population-movements/

I got this from Wiki, but I also wrote it myself and updated it with a source about a year ago:"Some Y-chromosomes that appear to be closely related to the northern Cameroonian R1b1*(P25-derived) are found at a substantial frequency among the modern population of Egypt. Many modern populations of northern Cameroon speak Chadic languages, which are classified as an ancient branch of the Afro-Asiatic superfamily of languages; the now extinct language of the Ancient Egyptians also belonged to the same superfamily."This could be two things.A migration from Egypt, through Sudan, through Chad or Central African Republic into Cameroon.Another theory, which I think is more likely, is a more recent migration of people in the Sudan in both directions...one going into Upper Egypt in historical times (as we can document Sudanese inflow into Egypt for a very long time) and an outward migration from Sudan across the Sahel...and maybe slightly South into Cameroon.Does anyone know what tribe(s) have such a high rate of R1b1*? I loked up the tribes in those ergions and could only find one who spoke a Chadic language in the entire country and they are not numerous; the Northeast Central Plain (the closest geographically to Egypt-Sudan )is heavily populated, but not with Chadic speakers. The Kirdi speak a Chadic language and live in the Northern desert near the Fulani. Anyway, if it came from some source in Egypt or Sudan, where did that source come from?Haplogroup T also can be found in Cameroon. It is likely R1b1* and T came together from Western Eurasia. Maybe from , BUT it is at its highest frequency in the Fulani and Somalis, not in Western Eurasian peoples, although present. T could have originated in the East African horne, but R did not…hmmm

It’s interesting that 30% of pygmies have Bantoid Y haplogroup…which would be exogenous genetic inflow…why are they so short? Is this gene dominent or are there some other factors involved (like Pgymy women have problems delivering normal size children)?

Comments on Mongols

As far as the mtdna admixture in Mongols…could this be pre-Mongol expansion? The Eurasian steppe was full of horseback nomads long before that and there was obviously a Slavic, Indo-Persian speaking, and Turkic speaking population with the Mongols on the far Eastside of this cline. Could this just be a result of normal gene flow between nomadic people? I could imagine them trading women like horse, sheep, etc for good they could not make themselves. The Chinese also recorded people who appeared to be Western Eurasian in the current area of Xinjiang, as early as the 2nd century BCE. Xinjing borders Mongolia and although I do not know the topography of the land various turko-mongol groups have been in an out of the area since that time period (and maybe before).

Again, with all this talk of supposedly European mtDNA and Y-DNA haplogroups among the Mongols, one must wonder why on earth the Mongols look as un-European as they do, and why studies of various autosomal loci and epigenetic traits (gammaglobulin types, ABO blood system, skeletal morphology, etc.) position Mongols at the extreme of the Mongoloid pole. I believe that the totality of the data regarding Mongols would not support a claim that they are a derived, hybrid population.

I think it is much more likely that the Mongols are rather an ancient population that has continuously inhabited a territory close to the homeland of the ancestral proto-Eurasian population.

This thread is reinforcing some thinking I have been doing lately as to where did the earliest settlers of western europe came from?? It seems pretty clear that the Neolithic "Invasion" was pretty much east to west. I am starting to wonder if the earliest emigrants didn't come from Africa across the Gibraltar straits??? There are traces of info to suggest that: 1. The climate of northern africa c. 10K to 15K BP.- wasmuch more supportive of life and travel. 2. The "mini stone henge" at Nabta Playa in Egypt and its antiquity. 3. The presence of R1b in Egypt and and countries bounding the Sahara as described above. 4. The lack of a strong presence of R1b in the regions east of the Black Sea, indicating a more recent population/repopulation?

All of this suggests that out of Africa at Gibraltar is plausible. I would like to read any opinion/other resources on this subject. TIA. bob

Old Blog Archive

Dienekes' Anthropology blog is dedicated to human population genetics, physical anthropology, archaeology, and history.

You are free to reuse any of the materials of this blog for non-commercial purposes, as long as you attribute them to Dienekes Pontikos and provide a link to either the individual blog entry or to Dienekes Anthropology Blog.

Feel free to send e-mail to Dienekes Pontikos, or follow @dienekesp on Twitter.