Synthetic Biology Journal

   

Data writing in DNA storage systems

Xuanliang ZHANG, Qingting LI, Fei WANG   

  1. School of Chemistry and Chemical Engineering,Frontiers Science Center for Transformative Molecules,Zhiyuan College,Shanghai Jiao Tong University,200240 Shanghai,China
  • Received:2024-01-02 Revised:2024-05-20 Published:2024-06-17
  • Contact: Fei WANG

DNA存储系统中的数据写入

张宣梁, 李青婷, 王飞   

  1. 上海交通大学,化学化工学院,变革性分子前沿科学中心,致远学院,上海 200240
  • 通讯作者: 王飞
  • 作者简介:张宣梁(2004—),男,学士。研究方向为DNA存储与计算。E-mail:zhangxuanliang@sjtu.edu.cn
    王飞(1990—),女,博士,副教授。研究方向为DNA分子计算与信息存储。E-mail:wangfeu@sjtu.edu.cn
  • 基金资助:
    国家重点研发计划(2022YFF1201800);国家自然科学基金(22322704)

Abstract:

Advances in science and technology are creating huge benefits and value for society. The digitalization of the world has brought great changes to people's daily life. Meanwhile, the increasing degree of digitalization has led to an unprecedented explosion of data, resulting in increasingly severe information storage challenges. According to the current developing trend, the global data volume is expected to reach 175 zettabytes by 2025. With the rapid growth of global data volume and the exponential growth of total data, the existing storage methods will no longer be able to meet the storage needs brought by the digitalization of the world and there is an urgent need to develop information storage methods with better storage performance, higher storage efficiency and more durable storage media. Nature has offered a powerful solution by using DNA molecules as carriers of information, where genetic information has been transferred stably more than a million years. DNA storage has many advantages over traditional storage media, including high storage density, potentially low maintenance costs, and ease of synthesis and chemical modification, which make it an ideal alternative for information storage. The current process of storing data in DNA includes six main steps: encoding, writing, preservation, retrieval, reading, and decoding. Among them, the writing of data is the basic for realizing the storage of data in DNA, concluding writing data in DNA sequence and in DNA structure. In this review, we first introduce strategies for in vivo data writing in DNA storage systems, which primarily involve writing data into DNA sequences and DNA structures. This is followed by an overview of the development of in vivo writing techniques in DNA storage systems. Finally, we discuss the challenges faced by DNA storage systems in terms of high writing costs, slow writing speeds, and prospects for large-scale synthesis of high-purity DNA and improved biocatalysts.

Key words: DNA based data storage, DNA synthesis, nucleic acid sequences, DNA nanotechnology, framework nucleic acids

摘要:

世界的数字化给人们的生活带来了极大的变化,但与此同时,史无前例的数据激增使得信息存储面临的挑战日益严峻。随着全球数据总量的指数级增长,传统存储介质将无法满足数字化带来的存储需求。使用DNA分子作为基本载体的信息存储展现出高存储密度,低维护成本和易于化学修饰等独特优势。DNA存储主要包括编码、写入、保存、检索、读取和解码六个主要步骤,其中数据的写入是实现DNA存储功能的基础。在这篇综述中,我们首先介绍DNA存储系统中体外写入数据的策略方法,主要分为将数据写入DNA序列和写入DNA结构两个部分,接着概述体内写入数据技术的发展。最后将讨论DNA存储系统中数据写入面临的写入成本高、写入速度慢等挑战,并对大规模合成高纯度DNA、改进生物酶等具有前景的应用技术进行展望。

关键词: DNA存储, DNA合成, 核酸序列, DNA纳米技术, 框架核酸材料

CLC Number: