PDF (1975K)
摘要
在DNA信息存储中,从寡核苷酸池中准确有效地检索信息是提升其实用性的关键.目前,基于PCR扩增的信息检索方法因具备技术成熟、操作简单和成本低等优势被广泛应用.该方法通过引物与目标文件DNA序列上引物结合位点的特异性结合并扩增实现信息检索,若引物间的正交性不强,极易造成引物与非目标DNA序列的错误结合从而导致信息检索出错.为了提升DNA存储信息检索的准确性,提出一种面向DNA存储低串扰信息检索的引物设计方法,该方法主要包括随机引物生成、引物间加权汉明距离设计和强正交性引物筛选3个部分.首先随机生成满足CGC、均聚物长度、熔化温度和吉布斯自由能等基本引物设计规则的引物;然后设计加权汉明距离以确保对引物间正交性的衡量更加精准;最终构建以引物为节点、以加权汉明距离为路径的全耦合网络,并基于网络路径优化的策略筛选出具备强正交性的引物用于DNA存储信息检索,以降低信息检索出错的概率.实验结果表明,通过该方法设计的引物与传统方法相比具有更强的正交性,并且采用该方法设计的引物进行DNA存储信息检索可将误检可能性降低1/5左右,有效提升了DNA存储信息检索的准确性,有助于推动DNA存储的实用化进程.
Abstract
The accurate and efficient retrieval of data from an oligonucleotide pool is crucial for enhancing the practi- cality of DNA information storage systems. Currently,PCR amplication is widely used as an information retrieval method due to its advantages,such as its advanced technological nature,ease of operation,and low cost. This method retrieves information by specifically binding primers to primer binding sites in the target DNA sequences, followed by amplification. However,a lack of primer orthogonality most likely leads to nonspecific binding, resulting in errors. To improve the accuracy of DNA storage information retrieval,this study proposed a novel method for designing primers that enable low-interference information retrieval. It consisted of three main components: random primer generation,the design of weighted Hamming distance for primers,and screening of primers with strong orthogonality. First,primers that met basic primer design rules such as CGC,homopolymer length,melting temperature,and Gibbs free energy were randomly generated. Then,the weighted Hamming distance was designed to ensure a more accurate measurement of primer orthogonality. Finally,a fully-coupled network with primers as nodes and weighted Hamming distances as paths was constructed. From this complex,primers with strong or- thogonality were selected using a network path optimization strategy,thereby reducing the likelihood of errors. The experimental results confirmed that these primers were more orthogonal than those obtained using traditional methods,reducing the risk of false retrieval by about one-fifth. Thus,they effectively improved the accuracy of DNA storage information retrieval and advanced the practical applications of DNA information storage systems.
关键词
Key words
[Author(id=1273285155570864350, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, orderNo=0, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=shufangzhang@tju.edu.cn, emailSecond=null, emailThird=null, correspondingAuthor=1, authorType=1, ext={EN=AuthorExt(id=1273285155658944737, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155570864350, language=EN, stringName=Shufang Zhang, firstName=Shufang, middleName=null, lastName=Zhang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
2 State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin 300072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1273285155713470690, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155570864350, language=CN, stringName=张淑芳, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, 2, address=1 天津大学电气自动化与信息工程学院, 天津 300072
2 合成生物技术全国重点实验室(天津大学), 天津 300072, bio={"content":"张淑芳(1979—),女,博士,副教授.
"}, bioImg=null, bioContent=张淑芳(1979—),女,博士,副教授.
, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1273285155398897879, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, xref=1, ext=[AuthorCompanyExt(id=1273285155415675096, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China), AuthorCompanyExt(id=1273285155432452313, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 天津大学电气自动化与信息工程学院, 天津 300072)]), AuthorCompany(id=1273285155482783962, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, xref=2, ext=[AuthorCompanyExt(id=1273285155499561179, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155482783962, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 State Key Laboratory of Synthetic Biology, Tianjin University, Tianjin 300072, China), AuthorCompanyExt(id=1273285155516338396, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155482783962, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=2 合成生物技术全国重点实验室(天津大学), 天津 300072)])]), Author(id=1273285155772190948, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, orderNo=1, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1273285155843494118, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155772190948, language=EN, stringName=Huaqing Yang, firstName=Huaqing, middleName=null, lastName=Yang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1273285155898020071, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155772190948, language=CN, stringName=杨华卿, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 天津大学电气自动化与信息工程学院, 天津 300072, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1273285155398897879, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, xref=1, ext=[AuthorCompanyExt(id=1273285155415675096, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China), AuthorCompanyExt(id=1273285155432452313, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 天津大学电气自动化与信息工程学院, 天津 300072)])]), Author(id=1273285155952546025, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, orderNo=2, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1273285156019654891, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155952546025, language=EN, stringName=Penghao Wang, firstName=Penghao, middleName=null, lastName=Wang, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1273285156074180844, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285155952546025, language=CN, stringName=王鹏浩, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 天津大学电气自动化与信息工程学院, 天津 300072, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1273285155398897879, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, xref=1, ext=[AuthorCompanyExt(id=1273285155415675096, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China), AuthorCompanyExt(id=1273285155432452313, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 天津大学电气自动化与信息工程学院, 天津 300072)])]), Author(id=1273285156128706798, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, orderNo=3, firstName=null, middleName=null, lastName=null, nameCn=null, orcid=null, stid=null, country=null, authorPic=null, dead=0, email=null, emailSecond=null, emailThird=null, correspondingAuthor=0, authorType=1, ext={EN=AuthorExt(id=1273285156200009968, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285156128706798, language=EN, stringName=Ming Luo, firstName=Ming, middleName=null, lastName=Luo, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null), CN=AuthorExt(id=1273285156254535921, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, authorId=1273285156128706798, language=CN, stringName=罗茗, firstName=null, middleName=null, lastName=null, prefix=null, suffix=null, authorComment=null, nameInitials=null, affiliation=null, department=null, xref=1, address=1 天津大学电气自动化与信息工程学院, 天津 300072, bio=null, bioImg=null, bioContent=null, aboutCorrespAuthor=null)}, companyList=[AuthorCompany(id=1273285155398897879, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, xref=1, ext=[AuthorCompanyExt(id=1273285155415675096, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=EN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China), AuthorCompanyExt(id=1273285155432452313, tenantId=1045748351789510663, journalId=1155139928303341634, articleId=1273282126829564502, companyId=1273285155398897879, language=CN, country=null, province=null, city=null, postcode=null, companyName=null, departmentName=null, remark=1 天津大学电气自动化与信息工程学院, 天津 300072)])])]
张淑芳,杨华卿,王鹏浩,罗茗.
面向DNA存储低串扰信息检索的引物设计方法[J].
天津大学学报(自然科学与工程技术版), 2026, 59(6): 565-572 DOI:10.11784/tdxbz202506038
| [1] |
Xu Q, Lu Z H, Bi K. DNA-LSIED:DNA lossy storage for images by encryption and corrective denoising method[J]. Signal, Image and Video Processing, 2025, 19:11.
|
| [2] |
Zheng Y F, Cao B, Zhang X K, et al. DNA-QLC:An efficient and reliable image encoding scheme for DNA storage[J]. BMC Genomics, 2024, 25:266.
|
| [3] |
刘彦军, 杨越飞, 胡迎新. 基于非天然核酸的高密度DNA存储编码方法[J]. 生物信息学, 2026, 24(1):70-84.
|
| [4] |
Liu Yanjun, Yang Yuefei, Hu Yingxin. High-density DNA storage encoding method based on unnatural nucleic acids[J]. Chinese Journal of Bioinformatics, 2026, 24(1):70-84(in Chinese).
|
| [5] |
张宣梁, 李青婷, 王飞. DNA存储系统中的数据写入[J]. 合成生物学, 2024, 5(5):1125-1141.
|
| [6] |
Zhang Xuanliang, Li Qingting, Wang Fei. Data writing in DNA storage systems[J]. Synthetic Biology Journal, 2024, 5(5):1125-1141(in Chinese).
|
| [7] |
Wang K, Cao B, Ma T, et al. Storing images in DNA via base128 encoding[J]. Journal of Chemical Information and Modeling, 2024, 64(5):1719-1729.
|
| [8] |
Seo S, Tandon A, Lee K W, et al. Information density enhancement using lossy compression in DNA data storage[J]. Advanced Materials, 2025, 37:2403071.
|
| [9] |
Rasool A, Hong J W, Hong Z L, et al. An effective DNA-based file storage system for practical archiving and retrieval of medical MRI data[J]. Small Methods, 2024, 8(10):2301585.
|
| [10] |
Church G M, Gao Y, Kosuri S. Next-generation digital information storage in DNA[J]. Science, 2012, 337(6102):1628.
|
| [11] |
Goldman N, Bertone P, Chen S Y, et al. Towards practical,high-capacity,low-maintenance information storage in synthesized DNA[J]. Nature, 2013, 494(7435):77-80.
|
| [12] |
Grass R N, Heckel R, Puddu M, et al. Robust chemical preservation of digital information on DNA in silica with error-correcting codes[J]. Angewandte Chemie International Edition, 2015, 54(8):2552-2555.
|
| [13] |
Erlich Y, Zielinski D. DNA fountain enables a robust and efficient storage architecture[J]. Science, 2017, 355(6328):950-954.
|
| [14] |
Yazdi S M H T, Yuan Y B, Ma J, et al. A rewritable, random-access DNA-based storage system[J]. Scientific Reports, 2015, 5(1):14138.
|
| [15] |
Organick L, Ang S D, Chen Y J, et al. Random access in large-scale DNA data storage[J]. Nature Biotechnology, 2018, 36(3):242-248.
|
| [16] |
Song X, Shah S, Reif J. Multidimensional data organization and random access in large-scale DNA storage systems[J]. Theoretical Computer Science, 2021, 894: 190-202.
|
| [17] |
张淑芳, 李予辉, 李炳志. DNA存储场景下基于引物索引矩阵的文件高效随机检索方法[J]. 电子与信息学报, 2024, 46(6):2568-2577.
|
| [18] |
Zhang Shufang, Li Yuhui, Li Bingzhi. Efficient file random access method based on primer index matrix in DNA storage scenarios[J]. Journal of Electronics and Information Technology, 2024, 46(6) : 2568-2577(in Chinese).
|
| [19] |
Wang Q, Zhang S F, Li Y H. Efficient DNA coding algorithm for polymerase chain reaction amplification information retrieval[J]. International Journal of Molecular Sciences, 2024, 25(12):6449.
|
| [20] |
Newman S, Stephenson A P, Willsey M, et al. High density DNA data storage library via dehydration with digital microfluidic retrieval[J]. Nature Communications, 2019, 10(1):1706.
|
| [21] |
Piantanida L, Hughes W L. A PCR-free approach to random access in DNA[J]. Nature Materials, 2021, 20(9):1172-1178.
|
| [22] |
Bee C, Chen Y J, Queen M, et al. Molecular-level similarity search brings computing to DNA data storage[J]. Nature Communications, 2021, 12(1):4764.
|
| [23] |
Ping Z, Chen S H, Zhou G Y, et al. Towards practical and robust DNA-based data archiving using the yin-yang codec system[J]. Nature Computational Science, 2022, 2(4):234-242.
|
基金资助
天津市科技计划资助项目(22JCYBJC01390)
合成生物技术全国重点实验室自主创新基金资助项目(HCZC-202610A)