Study suggests rare genetic variants most likely to influence disease

March 31, 2011

DURHAM, N.C. - New genomic analyses suggest that the most common genetic variants in the human genome aren't the ones most likely causing disease. Rare genetic variants, the type found most often in functional areas of human DNA, are more often linked to disease, genetic experts at Duke University Medical Center report.

The study was published in the American Journal of Human Genetics on March 31.

"The more common a variant is, the less likely it is to be found in a functional region of the genome," said senior author David Goldstein, Ph.D., director of the Duke Center for Human Genome Variation. "Scientists have reported this observation before, but this study is the most comprehensive effort to date using annotations of the functional regions of the human genome and fully sequenced genomes."

Goldstein said that "the magnitude of the effect is dramatic and is consistent across all frequencies of variants we looked at." He also said he was surprised by the notable consistency of the finding. "It's not just that the most rare variants are different from the most common, it's that at every increase in frequency, a variant is less and less likely to be found in a functional region of the DNA," Goldstein said. "This analysis is consistent with what appears to be a growing consensus that common variants are less important in common diseases than many had originally thought."

The researchers established simple criteria to learn which genetic variants were functional, based on their locations. They looked at the regions in genes that make proteins and also functional regions that influence the expression of proteins.

The scientists sequenced the complete genomes of 29 people (of European origin) to assess the relationship between the functional properties of the variants and their population allele frequencies. (An allele is one member of a pair of genes, each contributed by a mother and a father, found at a specific location on a specific chromosome).

"We also asked whether we could identify any patterns by examining variants in which the derived form (i.e., the one that appeared in humans and is not seen in related primates) was the most common form," said Goldstein, the Richard and Pat Johnson Distinguished University Professor in molecular genetics and microbiology. "That is the unusual case, because when there is a mutation that changes an allele from the ancestral allele, this is usually the more rare form in the population. If you do have a mutation that is beneficial, then the variant can increase in frequency and become the common one."

When the researchers focused on the derived form that was also the common form of the allele, they didn't know quite what to expect, but one possibility was that the common alleles would be more enriched in functional locations of DNA.

"You might expect the findings to turn around in this case and go in the other direction," Goldstein said. "Instead, we observed that there was no connection at all between the frequency of these particular common variants in a population and whether they were in functional regions of the genome or not. The pattern just evaporated."

He said the obvious interpretation is that whenever a variant does become common, it does so precisely because it has no impact. "The bottom line is that we see common variants, as a rule, being neutral and not having effects," Goldstein said. "There are some big exceptions to this rule, in particular in the HLA gene region where selection for resistance to infectious disease has resulted in many common variants with major effects. But in general in the genome, we see this as very much the exception."

Goldstein noted that before scientists developed sequencing tools to look at genomewide effects, they only knew there were some common variants that caused disease, with one of the classic examples being the strong effect of Apoe4 on the risk of Alzheimer's disease. "Many people expected to see more examples of common variants having an important collective impact on common diseases, but that didn't happen," Goldstein said.

"One really interesting aspect of the work is that for every functional category we considered, there is preferentially exclusion of common variants," said Qianqian Zhu, Ph.D., lead author and postdoctoral associate in the Goldstein laboratory. "This observation helps to illustrate how well the scientific community is succeeding in identifying the functional regions of the human genome."

In the future, scientists will need to develop sophisticated ways to use population allele frequency to directly facilitate the search for disease-causing mutations, Zhu said.

"If it is true that a lot of human disease is due to relatively rare variants that are relatively harmful, then we are going to be able to find them. It may take sequencing thousands of patient genomes to track down the responsible mutations, but they will be found," Goldstein said. "I am entirely convinced that sequencing, which is becoming less expensive every month, will unlock a lot of the causes of genetic disease. What we can do clinically with that information will become the primary challenge."
Other authors include co-lead author Dongliang Ge, Jessica M. Maia, Mingfu Zhu, Slave Petrovski, Samuel P. Dickson, Erin L. Heinzen, and Kevin V. Shianna of the Center for Human Genome Variation. Petrovski is also with the Department of Medicine at the Royal Melbourne Hospital at the University of Melbourne in Australia.

The study was funded by a Bill and Melinda Gates Foundation grant, and with additional funding from National Institute of Allergy and Infectious Diseases Center for HIV/AIDS Vaccine Immunology grant.

Duke University Medical Center

Related Genome Articles from Brightsurf:

Genome evolution goes digital
Dr. Alan Herbert from InsideOutBio describes ground-breaking research in a paper published online by Royal Society Open Science.

Breakthrough in genome visualization
Kadir Dede and Dr. Enno Ohlebusch at Ulm University in Germany have devised a method for constructing pan-genome subgraphs at different granularities without having to wait hours and days on end for the software to process the entire genome.

Sturgeon genome sequenced
Sturgeons lived on earth already 300 million years ago and yet their external appearance seems to have undergone very little change.

A sea monster's genome
The giant squid is an elusive giant, but its secrets are about to be revealed.

Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.

Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.

A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.

How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.

Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.

Unlocking the genome
A team led by Prof. Stein Aerts (VIB-KU Leuven) uncovers how access to relevant DNA regions is orchestrated in epithelial cells.

Read More: Genome News and Genome Current Events is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to