Scientists create program that finds synteny blocks in different animals

June 23, 2020

Modern genetics implies working with immense amounts of data which cannot be processed without the help of complex mathematical algorithms. For this reason, the task of developing special processing programs is no less important for bioinformatics specialists than that of genomic sequencing of specific animals. An international team of scientists that included researchers from ITMO University developed a software tool that makes it possible to quickly and efficiently find similar parts in the genomes of different animals, which is essential for understanding how closely related two species are, and how far they have evolved from their common ancestor. The research was published in GigaScience.

There are millions of biological species on planet Earth, and this diversity is laid down on the genetic level. Animals' anatomy, size, color patterns and habits are defined by their genes. Then again, the diversity of genes themselves is not that great: by today, scientists have only identified about over 20,000. Therefore, species are different in not only the sets of genes they have but also in how their genes are arranged. In the language of comparative genomics, this is called synteny, i.e. the arrangement of genes and regulatory elements.

"Let's take a gorilla and a chimpanzee as an example," says Ksenia Krasheninnikova, a researcher and engineer at ITMO University. "These two species have the same set of genes, but their regulatory elements and genome mutations create slightly different orders which results in differences between these primates."

Therefore, for the purposes of understanding how close two species are from the evolutionary standpoint, scientists need to know not just their genes but also how they are arranged in a chromosome, and how many common genome fragments, or synteny blocks, as geneticists call them, there are. Then again, looking for them manually is impossible: the amount of data is just too big. Genomes of mammals consist of millions and billions of base pairs, which makes processing without big data technologies next to impossible. For this reason, scientists create programs of their own that make it possible to solve this new category of tasks which has emerged in the course of the development of this science. And this is what the research team that included scientists from ITMO's Laboratory of Genomic Diversity did.

The new software tool was named halSynteny. According to its authors, it can search for synteny blocks better and faster than other programs developed for this purpose. What's more, halSynteny works with data in two standard and well-documented formats.

"Our goal was to create an algorithm that could be easily applied to accessible data," says Ksenia, who is the first author of this research. "Some of the approaches to the identification of synteny sequences are based on annotating genes in advance; our method is different. We don't use any additional annotation. We use the alignment method, when different parts of one genome are aligned by their degree of similarity with parts of another genome. This way, we can identify homogeneous parts, parts that are of the same origin."

The program makes it possible to speed up the computations by over two times in comparison with SatsumaSynteny2, another popular tool. Such high efficiency was attained by implementing a mathematically effective algorithm using C++.

The proposed method and software tool were tested by comparing cat and dog genomes.

"We showed that large fragments of cat chromosomes and some fragments of dog chromosomes unite in synteny blocks, which means that they've evolved from similar chromosomes of a common ancestor. And this can be used as a basis for making conclusions about their evolutionary process. Previous research in the field of "wet" biology demonstrated that cats' genome changed less from the genome of their common ancestor in comparison with that of dogs. This can be seen in comparison with other species that are not part of the carnivora order. The results that we got confirm these conclusions and make them more accurate. This means that in some specific part, the genome of a cat and the species taken for comparison is similar, and in dogs, it is rearranged."

In future, this algorithm will be used in other research in the field of comparative genomics that takes place at ITMO University.

ITMO University

Related Genome Articles from Brightsurf:

Genome evolution goes digital
Dr. Alan Herbert from InsideOutBio describes ground-breaking research in a paper published online by Royal Society Open Science.

Breakthrough in genome visualization
Kadir Dede and Dr. Enno Ohlebusch at Ulm University in Germany have devised a method for constructing pan-genome subgraphs at different granularities without having to wait hours and days on end for the software to process the entire genome.

Sturgeon genome sequenced
Sturgeons lived on earth already 300 million years ago and yet their external appearance seems to have undergone very little change.

A sea monster's genome
The giant squid is an elusive giant, but its secrets are about to be revealed.

Deciphering the walnut genome
New research could provide a major boost to the state's growing $1.6 billion walnut industry by making it easier to breed walnut trees better equipped to combat the soil-borne pathogens that now plague many of California's 4,800 growers.

Illuminating the genome
Development of a new molecular visualisation method, RNA-guided endonuclease -- in situ labelling (RGEN-ISL) for the CRISPR/Cas9-mediated labelling of genomic sequences in nuclei and chromosomes.

A genome under influence
References form the basis of our comprehension of the world: they enable us to measure the height of our children or the efficiency of a drug.

How a virus destabilizes the genome
New insights into how Kaposi's sarcoma-associated herpesvirus (KSHV) induces genome instability and promotes cell proliferation could lead to the development of novel antiviral therapies for KSHV-associated cancers, according to a study published Sept.

Better genome editing
Reich Group researchers develop a more efficient and precise method of in-cell genome editing.

Unlocking the genome
A team led by Prof. Stein Aerts (VIB-KU Leuven) uncovers how access to relevant DNA regions is orchestrated in epithelial cells.

Read More: Genome News and Genome Current Events is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to