New genetic method pinpoints geographic origin

(Medical Xpress) -- Understanding the genetic diversity within and between populations has important implications for studies of human disease and evolution. This includes identifying associations between genetic variants and disease, detecting genomic regions that have undergone positive selection and highlighting interesting aspects of human population history.

Now, a team of researchers from the UCLA Henry Samueli School of Engineering and Applied Science, UCLA's Department of Ecology and Evolutionary Biology and Israel's Tel Aviv University has developed an innovative approach to the study of genetic diversity called spatial ancestry analysis (SPA), which allows for the modeling of genetic variation in two- or three-dimensional space.

Their study is published online this week in the journal Nature Genetics.

With SPA, researchers can model the spatial distribution of each genetic variant by assigning a genetic variant's frequency as a continuous function in geographic space. By doing this, they show that the explicit modeling of the genetic variant frequency  the proportion of individuals who carry a specific variant  allows individuals to be localized on a world map on the basis of their genetic information alone.

"If we know from where each individual in our study originated, what we observe is that some variation is more common in one part of the world and less common in another part of the world," said Eleazar Eskin, an associate professor of computer science at UCLA Engineering. "How common these variants are in a specific location changes gradually as the location changes.

"In this study, we think of the frequency of variation as being defined by a specific location. This gives us a different way to think about populations, which are usually thought of as being discrete. Instead, we think about the variant frequencies changing in different locations. If you think about a person's ancestry, it is no longer about being from a specific population  but instead, each person's ancestry is defined by the location they're from. Now ancestry is a continuum."

The team reports the development of a simple probabilistic model for the spatial structure of genetic variation, with which they model how the frequency of each genetic variant changes as a function of the location of the individual in geographic space (where the gene frequency is actually a function of the x and y coordinates of an individual on a map).

"If the location of an individual is unknown, our model can actually infer geographic origins for each individual using only their genetic data with surprising accuracy," said Wen-Yun Yang, a UCLA computer science graduate student.

"The model makes it possible to infer the geographic ancestry of an individual's parents, even if those parents differ in ancestry. Existing approaches falter when it comes to this task," said UCLA's John Novembre, an assistant professor in the department of ecology and evolution.

"We are able to also show how to predict the spatial structure of worldwide populations," said Eskin, who also holds a joint appointment in the department of human genetics at the David Geffen School of Medicine at UCLA. "In just taking genetic information from populations from all over the world, we're able to reconstruct the topology of the global populations only from their genetic information."

Using the framework, SPA can also identify loci showing extreme patterns of spatial differentiation.

"These dramatic changes in the frequency of the variants potentially could be due to natural selection," Eskin said. "It could be that something in the environment is different in different locations. Let's say a mutation arose that has some advantageous property in a certain environment. So you can imagine then that a kind of force for genetic selection would make this mutation more common in that environment."

The research team began to examine all of the genes, and for each gene they computed how sharp of a change there was in the frequencies. They soon discovered that the genes which had the largest and most extreme changes are the ones that are known to have experienced selection in the recent past.

"So this is a new method for finding genes that are also undergoing selection in humans," Yang said.

Funding for the study was provided by the National Science Foundation and the National Institutes of Health.

In addition to Eskin, Yang and Novembre, Eran Halperin, of the school of computer science at Tel Aviv University, was a co-author of the research. The research was completed while all four members of the research team were at UCLA as part of the Institute of Pure and Applied Mathematics (IPAM) program on Mathematical and Computational Approaches in High-Throughput Genomics in fall 2011.

Related Stories

For decades, laboratory mice have been widely used in research aimed at understanding which genes are involved in various illnesses. But actual variations in past gene sequences of mice were unknown. While researchers were ...

Recent breakthroughs in the analysis of genetic variation in large populations have led to the discovery of hundreds of genes involved in dozens of common diseases. Many of these discoveries were enabled by performing "meta-analysis," ...

In the ongoing quest to identify the genetic factors involved in disease, scientists have increasingly turned to genome-wide association studies, or GWAS, which enable the scanning of up to a million genetic markers in thousands ...

(Medical Xpress) -- A large survey of human genetic variation, published today in the online version of the journal Science, shows that rare genetic variants are not so rare after all and offers insights into h ...

UCLA life scientists and colleagues have produced one of the first high-resolution genetic maps for African American populations. A genetic map reveals the precise locations across the genome where DNA from a person's father ...

Recommended for you

Lung diseases like emphysema and pulmonary fibrosis are common among people with malfunctioning telomeres, the "caps" or ends of chromosomes. Now, researchers from Johns Hopkins say they have discovered what ...

Moffitt Cancer Center researchers are using integrative approaches to study cancer by combining mathematical and computational modeling with experimental and clinical data. The use of integrative approaches ...

Newly published research from the Forsyth Institute details a discovery explaining why the 100 million Americans estimated to be taking prescription and over-the-counter antacid and heartburn medications may be at an increased ...

Nobody likes getting the flu, but for some people, fluids and rest aren't enough. A small number of children who catch the influenza virus fall so ill they end up in the hospital—perhaps needing ventilators ...

User comments

Please sign in to add a comment.
Registration is free, and takes less than a minute.
Read more

Click here to reset your password.
Sign in to get notified via email when new comments are made.