Storing Digital Data in DNA: Advances Toward Practical Implementation
Keywords:
DNA Data Storage, Archival Storage Systems, Error Correction Codes, Enzymatic DNA Synthesis, Data Encoding ArchitecturesAbstract
The exponential growth of global digital data, projected to exceed 175 zettabytes by 2025, has exposed fundamental limitations in conventional storage infrastructure, including finite media lifespan and unsustainable energy consumption. This review examines synthetic deoxyribonucleic acid (DNA) as a next-generation archival storage medium, systematically analyzing the encoding architectures, synthesis methodologies, error correction strategies, and retrieval mechanisms that define the DNA data storage pipeline. DNA offers volumetric storage densities of approximately 500 GB/mm³ and molecular stability, enabling data preservation across centuries. The four-nucleotide alphabet theoretically permits 2 bits per nucleotide, though biochemical constraints reduce practical efficiency to 1.5–1.98 bits per nucleotide. Error correction has advanced substantially, with fountain code implementations achieving near-theoretical efficiency at sequencing coverage depths below 11x. However, significant barriers impede deployment: synthesis costs of $0.05–$0.10 per base, write throughput limited to kilobytes per second, and absent universal standards. Recent advances, including CRISPR-based overwriting, microfluidic platforms achieving 97.67% synthesis yields, and enzymatic methods producing oligonucleotides exceeding 1,000 nucleotides, offer promising pathways toward economic viability. The review concludes that DNA storage is approaching feasibility for hybrid archival hierarchies targeting rarely accessed data, with initial commercial deployment anticipated within 10–20 years.
Downloads
References
Alex Sensintaffar et al., "Advancing Archival Data Storage: The Promises and Challenges of DNA Storage System," ACM, 2025. https://dl.acm.org/doi/pdf/10.1145/3723166
Melpomeni Dimopoulou and Marc Antonini, "Data and image storage on synthetic DNA: existing solutions and challenges," Springer, 2022, no. 23, 2022. https://link.springer.com/content/pdf/10.1186/s13640-022-00600-x.pdf
Linda C. Meiser et al., "Synthetic DNA applications in information technology," Nature, 2022. https://www.nature.com/articles/s41467-021-27846-9
Zarif Bin AKHTAR and Ahmed TAJBIUL RAWOL, "Unlocking the Future for the New Data Paradigm of DNA Data Storage : An Investigative Analysis of Advancements, Challenges, Future Directions," JIS, 2024. https://revues.imist.ma/index.php/JIS/article/view/47102
Andrea Doricchi et al., "Emerging Approaches to DNA Data Storage: Challenges and Prospects," ACS, 2022. https://pubs.acs.org/doi/10.1021/acsnano.2c06748
Puru Sharma et al., "DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator." https://prongs1996.github.io/assets/pdf/DNA_Storage_Toolkit.pdf
Peter Michael Schwarz and Bernd Freisleben, "Optimizing fountain codes for DNA data storage," ScienceDirect, 2024. https://www.sciencedirect.com/science/article/pii/S2001037024003581
Chenyang Wang et al., "DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator," Springer, 2022. https://link.springer.com/article/10.1007/s42514-022-00094-z
Ben Cao et al., "Efficient data reconstruction: The bottleneck of large-scale application of DNA storage," ScienceDirect, 2024. https://www.sciencedirect.com/science/article/pii/S2211124724000275
Daniella Bar-Lev et al., "Deep DNA Storage: Scalable and Robust DNA-based Storage via Coding Theory and Deep Learning," arXiv:2109.00031, 2024. https://arxiv.org/abs/2109.00031
Afsaneh Sadremomtaz et al., "Digital data storage on DNA tape using CRISPR base editors," Nature, 2023. https://www.nature.com/articles/s41467-023-42223-4
Kunjie Li et al., "DNA-DISK: Automated end-to-end data storage via enzymatic single-nucleotide DNA synthesis and sequencing on digital microfluidics." PNAS, 2024. https://www.pnas.org/doi/10.1073/pnas.2410164121
Pavani Yashodha De Silva and Gamage Upeksha Ganegoda, "New Trends of Digital Data Storage in DNA," Wiley, 2016. https://onlinelibrary.wiley.com/doi/full/10.1155/2016/8072463
Joao Henrique Diniz Brandao Gervasio et al., "How close are we to storing data in DNA?" ScienceDirect, 2024. https://www.sciencedirect.com/science/article/pii/S0167779923002354
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.


