Nav: Home

New tool enables easy, effective disease tracking

October 15, 2020

15th October 2020, Hong Kong: Published today in the journal GigaScience is a new open source, cloud-based tool called IDseq that makes it possible to rapidly detect, identify, and track emerging pathogens such as SARS-CoV-2. This tool can identify pathogens before there is an available complete genome sequence; thus, it can be used for current infectious disease outbreaks and also for emerging ones. This will substantially aid in preventing future pandemics.

The coronavirus pandemic demonstrates the importance of global infectious disease monitoring. Finding the cause of an infectious disease outbreak is challenging, especially if it stems from a previously unknown pathogen. IDseq, an open source, cloud-based metagenomic analysis platform, identifies both novel and existing disease-causing pathogens from a given sample -- be it a human, animal, or parasite -- to provide an actionable report of what is happening on the ground in labs and clinics anywhere in the world.

"IDseq can be thought of as an early warning radar for emerging or novel infectious agents," said Joe DeRisi, PhD, Co-President of the Chan Zuckerberg Biohub, who contributed to the identification of the SARS coronavirus in 2003 and whose research lab at the University of California, San Francisco initiated the IDseq tool. It is designed to enable the global health community to leverage the ever-decreasing cost of sequencing for tracking and identifying infectious disease in essentially any sample. "At the beginning of the coronavirus pandemic, researchers in Cambodia used IDseq to help confirm and sequence the whole genome of the country's first case of COVID-19 in a matter of days, and in California, we're providing critical SARS-CoV-2 genomic data to public health officials to inform contact tracing and intervention strategies."

In a study published in GigaScience, scientists use various approaches to demonstrate that the IDseq tool is indeed able to reliably identify emerging pathogens, among them, as proof of principle, a nasal swab from a COVID-19 patient in Cambodia. A partnership between the Chan Zuckerberg Biohub, the Chan Zuckerberg Initiative (CZI), and the Bill and Melinda Gates Foundation enabled these researchers to sequence and confirm the country's first case of COVID-19 in a matter of days -- not the weeks it could typically take. The results demonstrate that IDseq can detect the presence of an emerging pathogen prior to the existence of a full reference genome. IDseq also now contains a new workflow for building SARS-CoV-2 consensus genomes.

"Metagenomic sequencing (mNGS) is an incredibly useful tool for pathogen detection because of its highly sensitive and hypothesis-free nature," said Katrina Kalantar, Computational Biologist at CZI. "We've seen labs that are using IDseq for existing mNGS studies rapidly pivot their focus to more targeted sequencing of SARS-CoV-2, which has helped researchers better understand coronavirus transmission patterns."

In Cambodia, researchers uploaded the genome sequence to open source pathogen data repository GISAID (Global Initiative on Sharing All Influenza Data) and to Nextstrain, so scientists anywhere can see the full genome sequence of the SARS-CoV-2 coronavirus and study it within the broader context of SARS-CoV-2 coronavirus sequences uploaded globally. Researchers at the Cambodian National Center for Parasitology, Entomology and Malaria Control (CNM) and the National Institute of Allergy and Infectious Diseases (NIAID) partnered with the Institut Pasteur Cambodia to complete this research. These researchers are one of several teams around the world receiving molecular biology and bioinformatics training from the infectious disease team at the Biohub; free access, training, and compute on the IDseq platform from CZI; and the necessary equipment and supplies to begin work in their own countries through the Grand Challenges Explorations Grants.

Unlike tests that are specific for a known agent, such as the SARS-CoV-2 PCR test, mNGS is a universal method that can detect novel disease-causing pathogens, which can be especially useful in cases where researchers may not know what is causing an infection, or what pathogens are circulating in a particular area. A mNGS experiment starts with mass-amplifying DNA traces of pathogens from a patient's sample, resulting in millions of small bits of DNA sequences, or reads. This enormous dataset must then be analysed and interpreted using bioinformatic techniques. The aim is to assign individual DNA fragments from the clinical sample to specific pathogens by leveraging knowledge from sequence databases.

Analysing the massive amount of data from a typical mNGS experiment often requires a battery of specialized bioinformatic tools, including highly specialized expertise and expensive commercially licenced software -- making mNGS a hard-to-access method. The new user-friendly IDseq software is open source and freely available to the global health community, reducing the barrier of entry to metagenomics. Researchers can reuse and build upon the code, which works via a cloud-based service and a web application designed for collaboration and data sharing. The pipeline starts with raw sequencing data as the input, and then goes through steps of filtering, quality control, alignment, and reporting and visualization.

For more information, visit

Further Reading Kalantar KL et al., IDseq - An Open Source Cloud-based Pipeline and Analysis Service for Metagenomic Pathogen Detection and Monitoring. Gigascience. 2020;9(10):giaa085. doi:10.1093/gigascience/giaagiaa111

Preprint available at

Contacts: Scott Edmunds, Editor in Chief GigaScience, BGI Hong Kong Email:

Leah Duran, Communications Manager, CZI Email:

Sharing on social media? Find GigaScience online on twitter @GigaScience; Facebook, and keep up-to-date with our blog

About GigaScienceGigaScience is co-published by GigaScience Press and Oxford University Press. Winner of the 2018 PROSE award for Innovation in Journal Publishing (Multidisciplinary), the journal covers research that uses or produces 'big data' from the full spectrum of the biological and biomedical sciences. It also serves as a forum for discussing the difficulties of and unique needs for handling large-scale data from all areas of the life and medical sciences. The journal has a completely novel publication format -- one that integrates manuscript publication with complete data hosting, and analyses tool incorporation. To encourage transparent reporting of scientific research as well as enable future access and analyses, it is a requirement of manuscript submission to GigaScience that all supporting data and source code be made available in the GigaScience database, GigaDB, as well as in publicly available repositories. GigaScience will provide users access to associated online tools and workflows, and has integrated a data analysis platform, maximizing the potential utility and re-use of data.
About the Chan Zuckerberg Initiative

Founded by Dr. Priscilla Chan and Mark Zuckerberg in 2015, the Chan Zuckerberg Initiative (CZI) is a new kind of philanthropy that's leveraging technology to help solve some of the world's toughest challenges -- from eradicating disease, to improving education, to reforming the criminal justice system. Across three core Initiative focus areas of Science, Education, and Justice & Opportunity, we're pairing engineering with grant-making, impact investing, and policy and advocacy work to help build an inclusive, just, and healthy future for everyone. For more information, please visit

About the Chan Zuckerberg Biohub

The Chan Zuckerberg Biohub is a nonprofit research organization setting the standard for collaborative science, where leaders in science and technology come together to drive discovery and support the bold vision to cure, prevent, or manage disease in our children's lifetime. The CZ Biohub seeks to understand the fundamental mechanisms underlying disease and to develop new technologies that will lead to actionable diagnostics and effective therapies. The CZ Biohub is a regional research endeavor with international reach, where the Bay Area's leading institutions -- the University of California, Berkeley, Stanford University and the University of California, San Francisco -- join forces with the CZ Biohub's innovative internal team to catalyze impact, benefitting people and partnerships around the world. To learn more, visit


Related Genome Articles:

Genome evolution goes digital
Dr. Alan Herbert from InsideOutBio describes ground-breaking research in a paper published online by Royal Society Open Science.
Breakthrough in genome visualization
Kadir Dede and Dr. Enno Ohlebusch at Ulm University in Germany have devised a method for constructing pan-genome subgraphs at different granularities without having to wait hours and days on end for the software to process the entire genome.
Sturgeon genome sequenced
Sturgeons lived on earth already 300 million years ago and yet their external appearance seems to have undergone very little change.
A sea monster's genome
The giant squid is an elusive giant, but its secrets are about to be revealed.
Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.
Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.
A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.
How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.
Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.
Unlocking the genome
A team led by Prof. Stein Aerts (VIB-KU Leuven) uncovers how access to relevant DNA regions is orchestrated in epithelial cells.
More Genome News and Genome Current Events

Trending Science News

Current Coronavirus (COVID-19) News

Top Science Podcasts

We have hand picked the top science podcasts of 2020.
Now Playing: TED Radio Hour

Sound And Silence
Sound surrounds us, from cacophony even to silence. But depending on how we hear, the world can be a different auditory experience for each of us. This hour, TED speakers explore the science of sound. Guests on the show include NPR All Things Considered host Mary Louise Kelly, neuroscientist Jim Hudspeth, writer Rebecca Knill, and sound designer Dallas Taylor.
Now Playing: Science for the People

#576 Science Communication in Creative Places
When you think of science communication, you might think of TED talks or museum talks or video talks, or... people giving lectures. It's a lot of people talking. But there's more to sci comm than that. This week host Bethany Brookshire talks to three people who have looked at science communication in places you might not expect it. We'll speak with Mauna Dasari, a graduate student at Notre Dame, about making mammals into a March Madness match. We'll talk with Sarah Garner, director of the Pathologists Assistant Program at Tulane University School of Medicine, who takes pathology instruction out of...
Now Playing: Radiolab

Kittens Kick The Giggly Blue Robot All Summer
With the recent passing of Ruth Bader Ginsburg, there's been a lot of debate about how much power the Supreme Court should really have. We think of the Supreme Court justices as all-powerful beings, issuing momentous rulings from on high. But they haven't always been so, you know, supreme. On this episode, we go all the way back to the case that, in a lot of ways, started it all.  Support Radiolab by becoming a member today at