I have 2x10^12 unique sequences of DNA, and I have an average of 47 copies of each sequence (so 94x10^12 DNA molecules total).

How many molecules do I need to choose at random to be "confident" (defined as you please) that I have at least 10^10 unique molecules? 10^11? 10^12?

I would really like to know how to do this calculation.

Any help would be very appreciated.

Thanks,

Ed

# Probability and random DNA

