DNA Storage Error Simulator: A Tool for Simulating Errors in Synthesis, Storage, PCR and Sequencing

28 May 2022  ·  Jamie J. Alnasir, Thomas Heinis, Louis Carteron ·

DNA has many valuable characteristics that make it suitable for a long-term storage medium, in particular its durability and high information density. DNA can be stored safely for hundreds of years with virtually no degradation, in contrast to hard disk drives which typically last for about 5 years. Furthermore, the duration of DNA-Storage can be extended to potentially up to thousands of years if it is desiccated and cooled in storage. Advances in DNA technologies have made it possible to store the entirety of Wikipedia in a test tube and read that information using a handheld sequencing device, although imperfections in writing (synthesis) and reading (sequencing) need to be mitigated for it to be viable as a mainstream storage medium. New sequencing technologies, such as nanopore sequencing, aim to penetrate the consumer world, thanks to their affordability and size. However, the error characteristics of nanopore sequencing are not yet well characterised. DNA Storage Error Simulator models errors that can be introduced in all the phases of DNA storage workflow, including synthesis, storage, PCR for amplification and finally sequencing. The error characteristics for sequencing and synthesis can be configured in all necessary detail or can be chosen from a predefined set of values based on available literature and our own analysis. Availability: DNA Storage Error Simulator can be accessed online from: https://master.dbahb2jho41s4.amplifyapp.com (https://dnastorage.doc.ic.ac.uk/DNA-error-simulator)

PDF Abstract
No code implementations yet. Submit your code now

Tasks


Datasets


  Add Datasets introduced or used in this paper

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods


No methods listed for this paper. Add relevant methods here