Construction of Multiple Constrained DNA Codes

DNA sequences are prone to creating secondary structures by folding back on themselves by non-specific hybridization among its nucleotides. The formation of secondary structures makes the sequences chemically inactive towards synthesis and sequencing processes. In this letter, our goal is to tackle the problems due to the creation of secondary structures in DNA sequences along with constraints such as not having a large homopolymer run length. In this paper, we have presented families of DNA codes with secondary structures of stem length at most two and homopolymer run length at most four. By mapping the error correcting codes over _11 to DNA nucleotides, we obtained DNA codes with rates 0.5765 times the rate of corresponding code over _11, which include some new secondary structure free and better-performing codes for DNA based data storage and DNA computing purposes.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset