Optimal Codes Detecting Deletions in Concatenated Binary Strings Applied to Trace Reconstruction

04/19/2023
by   Serge Kas Hanna, et al.
0

Consider two or more strings 𝐱^1,𝐱^2,…, that are concatenated to form 𝐱=⟨𝐱^1,𝐱^2,…⟩. Suppose that up to δ deletions occur in each of the concatenated strings. Since deletions alter the lengths of the strings, a fundamental question to ask is: how much redundancy do we need to introduce in 𝐱 in order to recover the boundaries of 𝐱^1,𝐱^2,…? This boundary problem is equivalent to the problem of designing codes that can detect the exact number of deletions in each concatenated string. In this work, we answer the question above by first deriving converse results that give lower bounds on the redundancy of deletion-detecting codes. Then, we present a marker-based code construction whose redundancy is asymptotically optimal in δ among all families of deletion-detecting codes, and exactly optimal among all block-by-block decodable codes. To exemplify the usefulness of such deletion-detecting codes, we apply our code to trace reconstruction and design an efficient coded reconstruction scheme that requires a constant number of traces.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset