Show simple item record

dc.contributor.advisorKashyap, Navin
dc.contributor.authorSamanta, Puspabeethi
dc.date.accessioned2023-08-18T09:42:18Z
dc.date.available2023-08-18T09:42:18Z
dc.date.submitted2023
dc.identifier.urihttps://etd.iisc.ac.in/handle/2005/6192
dc.description.abstractIn DNA-based data storage, the desired information is encoded into the quaternary sequence of synthetic DNA molecules, called oligos. We look at secure communication via information-containing oligos in the usual three-party setting, where Alice and Bob are legitimate communicators and Eve is an eavesdropper. Our scheme for secure DNAbased communication is two-fold: First, we store secret data in synthetic DNA molecules in a properly designed oligo pool of suitably high dilution. The oligo pool comprises information-containing oligos mixed with background genomic DNA cleaved into strands of the same length as the useful oligos. Designing oligos for storing the secret data involves the design of a library of primer pairs and a code book specific to this set of primers, that satisfy constraints on homopolymer run length, GC content, primerprimer dissimilarity, and primer-sequence dissimilarity. The differential knowledge of the designed primer pairs allows Bob to retrieve most of the information-containing DNA molecules after carrying out sufficient rounds of PCR amplification on the diluted oligo pool. In contrast, Eve is not able to access the stored information owing to her lack of any prior knowledge of the primer pairs. In order to improve upon the security of this scheme, we next develop an index-based secrecy coding scheme for the resulting wiretap system, where Bob observes a substantially lower number of erasures in the main channel, as compared to Eve’s channel, which suffers a large number of erasures (loss of DNA molecules). We show that under the conditions of a noise-free setting, proper library preparation, and proper measurement of oligos represented in smaller fractions, this coding scheme achieves the secrecy capacity of a DNA storage wiretap channel model with strong secrecy. An error-correcting code for reliable storage of DNA-based data must adhere to the constraints of GC content and homopolymer run length. In this thesis, we also explore the constraint of GC balance in the context of insertion and deletion error-correcting codes for DNA storage systems. We derive an upper bound on the cardinality of single indel-correcting GC-balanced quaternary codes. This allows us to deduce the minimal number of redundancy bits required for GC-balancing in such codes. We also develop a GC-balanced coding scheme for the correction of a single burst of 2 insertion or 2 deletion errors.en_US
dc.language.isoen_USen_US
dc.relation.ispartofseries;ET00205
dc.rightsI grant Indian Institute of Science the right to archive and to make available my thesis or dissertation in whole or in part in all forms of media, now hereafter known. I retain all proprietary rights, such as patent rights. I also retain the right to use in future works (such as articles or books) all or part of this thesis or dissertationen_US
dc.subjectDNA storage systemsen_US
dc.subjectOligosen_US
dc.subjectGC-balanced coding schemeen_US
dc.subject.classificationResearch Subject Categories::TECHNOLOGY::Electrical engineering, electronics and photonics::Electronicsen_US
dc.titleCoding Schemes for Secure and Reliable DNA-Based Data Storageen_US
dc.typeThesisen_US
dc.degree.nameMTech (Res)en_US
dc.degree.levelMastersen_US
dc.degree.grantorIndian Institute of Scienceen_US
dc.degree.disciplineEngineeringen_US


Files in this item

This item appears in the following Collection(s)

Show simple item record