Rxivist logo

Enzymatic DNA synthesis for digital information storage

By Henry H Lee, Reza Kalhor, Naveen Goela, Jean Bolot, George M. Church

Posted 16 Jun 2018
bioRxiv DOI: 10.1101/348987 (published DOI: 10.1038/s41467-019-10258-1)

DNA is an emerging storage medium for digital data but its adoption is hampered by limitations of phosphoramidite chemistry, which was developed for single-base accuracy required for biological functionality. Here, we establish a de novo enzymatic DNA synthesis strategy designed from the bottom-up for information storage. We harness a template-independent DNA polymerase for controlled synthesis of sequences with user-defined information content. We demonstrate retrieval of 144-bits, including addressing, from perfectly synthesized DNA strands using batch-processed Illumina and real-time Oxford Nanopore sequencing. We then develop a codec for data retrieval from populations of diverse but imperfectly synthesized DNA strands, each with a ~30% error tolerance. With this codec, we experimentally validate a kilobyte-scale design which stores 1 bit per nucleotide. Simulations of the codec support reliable and robust storage of information for large-scale systems. This work paves the way for alternative synthesis and sequencing strategies to advance information storage in DNA.

Download data

  • Downloaded 4,717 times
  • Download rankings, all-time:
    • Site-wide: 829 out of 88,646
    • In synthetic biology: 15 out of 828
  • Year to date:
    • Site-wide: 5,994 out of 88,646
  • Since beginning of last month:
    • Site-wide: 17,609 out of 88,646

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)