Bluesky Facebook Reddit Email

DNAformer: where nature meets AI

03.20.25 | Technion-Israel Institute of Technology

Apple AirPods Pro (2nd Generation, USB-C)

Apple AirPods Pro (2nd Generation, USB-C) provide clear calls and strong noise reduction for interviews, conferences, and noisy field environments.


Researchers from the Henry and Marilyn Taub Faculty of Computer Science have developed an AI-based method that accelerates DNA-based data retrieval by three orders of magnitude while significantly improving accuracy. The research team included Ph.D. student Omer Sabary, Dr. Daniella Bar-Lev, Dr. Itai Orr, Prof. Eitan Yaakobi, and Prof. Tuvi Etzion.

DNA data storage is an emerging field that leverages DNA as a platform for storing information. DNA offers significant advantages as a storage medium, including:

DNA is a molecule composed of a sequence of organic compounds called nucleotides. These nucleotides are classified into four types, represented by the letters A, C, G, and T. Unlike traditional computing, where data is encoded using only two digits (0 and 1), DNA storage is based on sequences of four letters, dramatically increasing the number of possible combinations.

To write (store) data in this technology, DNA synthesis is required – creating DNA molecules based on the sequences encoding the information. To read the stored data, DNA sequencing is necessary.

Challenges in DNA Data Storage

Developing DNA-based storage technology presents several technological challenges:

DNAformer: AI-Powered Data Retrieval

The current research presents a comprehensive computational solution for retrieving and correcting errors in complex DNA-based storage systems. Using advanced algorithms and encoding techniques, the researchers have demonstrated that their solution reduces data retrieval and reading time from several days to just 10 minutes.

The Technion-developed method, DNAformer, is based on a transformer model trained on simulated data (generated using a simulator, which was also developed at the Technion) to reconstruct accurate DNA sequences from erroneous copies. The method also includes a custom error-correction code tailored for DNA, ensuring robust data integrity.

Additionally, an extra safety margin mechanism detects particularly noisy DNA sequences (unwanted signals or errors that occur during the sequencing process, which can interfere with the accurate interpretation of the data) and applies powerful algorithmic tools to handle them efficiently. At the end of the process, the data is converted back into digital information.

Breakthrough Performance

The new method enables the reading of 100 megabytes of data at a speed 3,200 times faster than the most accurate existing method – without any loss of accuracy. Compared to previously known fast methods, DNAformer also improves accuracy by up to 40% while significantly reducing processing time. This was demonstrated on a 3.1-megabyte dataset, which included:

The researchers plan to develop customized versions of DNAformer tailored to different needs. They emphasize that their technology is scalable and adaptable, meaning it can be optimized for large-scale data storage applications, meeting market demands and future DNA synthesis and sequencing advancements.

The study was supported by The European Research Council (ERC Grant, DNAStorage), The European Innovation Council (EIC Grant, Project DiDAX) and The Israel Science Foundation (ISF).

10.1038/s42256-025-01003-z

Experimental study

Scalable and robust DNA-based storage via coding theory and deep learning

21-Feb-2025

Keywords

Article Information

Contact Information

Doron Shaham
Technion-Israel Institute of Technology
sdoron@technion.ac.il

Source

How to Cite This Article

APA:
Technion-Israel Institute of Technology. (2025, March 20). DNAformer: where nature meets AI. Brightsurf News. https://www.brightsurf.com/news/1GR4ZZE8/dnaformer-where-nature-meets-ai.html
MLA:
"DNAformer: where nature meets AI." Brightsurf News, Mar. 20 2025, https://www.brightsurf.com/news/1GR4ZZE8/dnaformer-where-nature-meets-ai.html.