DNA sequencing and restoring malformed sequences

Atran · Nov 28, 2020

I was just reading about DNA sequencing. In my view, DNA can be modeled into an ordered sequence of nucleobases, as if the two strands were joined into a single strand (just like in RNA). The first half of the sequence models the first strand. The four nucleobases are numbered from 0 to 3. Hence, a random sequence S equals (0 1 2 0 1 1 0 3 ...). The length of the sequence is on the order of billions.

Assume the same sequence is malformed (0 0 2 0 1 1 3 3 ...). It's malformed at position 1 and 6. Visualizing the two sequences parallel to each other, restoring the malformed sequence would be an easy computational task.

Am I missing something? I looked through this because I heard of sequencing in the context of NP-completeness in my computational complexity class.

Ygggdrasil · Dec 5, 2020

I don't think recognizing mutations is a computationally hard problem (provided the mutation rate is sufficiently low). In the context of genome sequencing, here's a good source that describes one computationally hard problem that had to be addressed in the genome sequencing field (figuring out how to assemble a full DNA sequence from multiple overlapping short fragments of that DNA sequence): http://www.cs.cmu.edu/afs/cs/academic/class/15210-s15/www/lectures/genome-notes.pdf

DNA sequencing and restoring malformed sequences

Similar threads

I need help understanding this quote about recent Natural Selection

Influenza A H5, an avian flu, first time in a human

Magnetoreception in Animals

One of my Favorite Parasites

What causes the asymmetry in a symmetrically developing organism?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers