New tool facilitates inclusion of people of diverse ancestry in large genetics studies

February 02, 2021

BOSTON -- Genome-wide association studies (GWAS) have typically excluded diverse and minority individuals in the search for gene variants that confer risk of disease. Researchers at Massachusetts General Hospital (MGH), the Broad Institute of MIT and Harvard, and other institutions around the world have now developed a free-access software package called Tractor that increases the discovery power of genomics in understudied populations. A study of Tractor's performance and accuracy was published in

Researchers perform GWAS to identify where genetic variants responsible for causing disease are located in the genome. Recently, geneticists have begun creating models from published GWAS data to predict risks of disease in individuals. But the clinical utility of these models is currently limited, since most are based on genomic studies of people with European ancestry.

"If you build disease-risk models on available data and attempt to extrapolate them to diverse populations, the accuracy of predicting who will get sick is reduced," says Elizabeth Atkinson, PhD, lead author of the paper and an investigator in the Analytic and Translational Genetics Unit (ATGU) at MGH. "These errors exacerbate existing health disparities, in part because we aren't finding specific gene variants that may contribute to higher risk of a particular disease in diverse populations."

Another significant shortcoming of current GWAS is that "they leave many opportunities for genetic discovery on the table for all populations," says Atkinson. People of African descent, for example, have a million more genetic variations on average than someone who doesn't have African ancestry due to human migration patterns over the ages. Conducting a GWAS with diverse populations allows geneticists to pinpoint genetic associations to disease at many more spots across the genome, says Atkinson.

"Within these genomic regions identified in a GWAS, the genetic mutation that actually causes disease is shared across ancestries most of the time," she adds. By studying admixed populations -- people with recent ancestry from two or more previously isolated population groups, such as Africa and Europe -- "we can get more powerful and precise genetic association signals and do a better job at pinpointing where the causal mutation is, which improves our understanding of disease for everyone."

Until now, there was no fine-scale way to control for ancestry composition in mixed groups being studied in a GWAS. "Different ancestry groups have gene variants that occur at different frequencies due to the populations' demographic history," explains Atkinson. "Not taking ancestry into account in a GWAS can lead to false-positive hits or to gene variants cancelling themselves out and dismissed as not important. So, until now, it's been easier to exclude people with multiple ancestries from GWAS to avoid being confounded by different patterns of gene variants."

Tractor, however, allows researchers to account for ancestry in a precise manner so admixed individuals can be included in large-scale gene discovery efforts. The software colors pieces of each person's chromosomes according to its ancestry origin, which researchers can infer from reference genome sequences, and uses this information in a new GWAS model. "Tractor takes into account the ancestry backbone of each genetic variant so we can correctly calibrate the GWAS results to find causal variants in specific population groups," says Atkinson.

Tractor also provides estimates of ancestry-specific effect sizes, which isn't possible in a standard GWAS. "Instead of getting a weighted average of the disease-risk effect size for a particular gene variant, Tractor can determine how large or small the effect of a variant is in various ancestry groups," says Atkinson. "This will be informative for building genetic risk scores in diverse populations." Another advantage of Tractor is its ability to improve the power of GWAS by detecting risk gene variants across multiple ancestries. "With Tractor, we can get stronger disease-association signals by leveraging ancestral genomic differences," says Atkinson.

"Tractor advances the existing methodologies for studying the genetics of complex disorders in diverse and minority populations," she adds. "We hope that this method increases the inclusion of admixed participants in large-scale association studies going forward."
-end-
Major funding for this study was provided by the National Institute of Mental Health.

Co-authors include Mark Daly, PhD, founding chief of the ATGU and associate professor of Medicine at Harvard Medical School (HMS); and Benjamin Neale, PhD, also of the ATGU, is associate professor of Medicine at HMS and director of Population Genetics, Stanley Center for Psychiatric Research, at the Broad Institute.



About the Massachusetts General Hospital


Massachusetts General Hospital, founded in 1811, is the original and largest teaching hospital of Harvard Medical School. The

Massachusetts General Hospital

Related Disease Articles from Brightsurf:

CLCN6 identified as disease gene for a severe form of lysosomal neurodegenerative disease
A mutation in the CLCN6 gene is associated with a novel, particularly severe neurodegenerative disorder.

Cellular pathway of genetic heart disease similar to neurodegenerative disease
Research on a genetic heart disease has uncovered a new and unexpected mechanism for heart failure.

Mechanism linking gum disease to heart disease, other inflammatory conditions discovered
The link between periodontal (gum) disease and other inflammatory conditions such as heart disease and diabetes has long been established, but the mechanism behind that association has, until now, remained a mystery.

Potential link for Alzheimer's disease and common brain disease that mimics its symptoms
A new study by investigators from Brigham and Women's Hospital uncovered a group of closely related genes that may capture molecular links between Alzheimer's disease and Limbic-predominant Age-related TDP-43 Encephalopathy, or LATE, a recently recognized common brain disorder that can mimic Alzheimer's symptoms.

Antioxidant agent may prevent chronic kidney disease and Parkinson's disease
Researchers from Osaka University developed a novel dietary silicon-based antioxidant agent with renoprotective and neuroprotective effects.

Tools used to study human disease reveal coral disease risk factors
In a study published in Scientific Reports, a team of international researchers led by University of Hawai'i (UH) at Mānoa postdoctoral fellow Jamie Caldwell used a statistical technique typically employed in human epidemiology to determine the ecological risk factors affecting the prevalence of two coral diseases--growth anomalies, abnormalities like coral tumors, and white syndromes, infectious diseases similar to flesh eating bacteria.

Disease-aggravating mutation found in a mouse model of neonatal mitochondrial disease
The new mitochondrial DNA (mtDNA) variant drastically speeds up the disease progression in a mouse model of GRACILE syndrome.

Human longevity largest study of its kind shows early detection of disease & disease risks
Human Longevity, Inc. (HLI) announced the publication of a ground-breaking study in the journal Proceedings of the National Academy of Sciences (PNAS).

30-year study identifies need of disease-modifying therapies for maple syrup urine disease
A new study analyzes 30 years of patient data and details the clinical course of 184 individuals with genetically diverse forms of Maple Syrup Urine Disease (MSUD), which is among the most volatile and dangerous inherited metabolic disorders.

Long-dormant disease becomes most dominant foliar disease in New York onion crops
Until recently, Stemphylium leaf blight has been considered a minor foliar disease as it has not done much damage in New York since the early 1990s.

Read More: Disease News and Disease Current Events
Brightsurf.com is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.