I Estimating Gene Mutation Proportion: A & B Approaches

schniefen
Messages
177
Reaction score
4
TL;DR Summary
Two approaches in estimating the proportion of the population with a certain gene mutation.
The proportion of individuals that carry a certain gene mutation in the population is unknown. A research assistant at a medical laboratory wants to estimate this proportion. The research assistant is thinking of two approaches:

A. Take blood samples from all individuals that come to the hospitals ER ward during a month, and test their gene status.

B. Sample people, by reaching them by cell phone or land line phone, and ask them to participate and have their gene status tested.

Suppose in both cases A. and B., all individuals that are asked will agree to have their blood sample taken and gene tested. Can you think of features in approaches A. and B. that may not fit into our setup for estimation?

Since the information in the problem is quite general, it is hard to make any definite conclusion about each approach. For instance, if one would like to use an MLE, one would like the sample to be i.i.d., however, this could probably be accomplished in both approaches. Maybe in A. there may be some issue with independence, since imagine a diseases that causes gene mutation and easily spreads, then every person that gets tested on the day when a person with that disease was present in the ward will likely turn out to also have a gene mutation.
 
Physics news on Phys.org
schniefen said:
Summary:: Two approaches in estimating the proportion of the population with a certain gene mutation.

Can you think of features in approaches A. and B. that may not fit into our setup for estimation?
The main issue is that A will not be a random sample of the population.
 
  • Like
Likes schniefen
Both of these are, at least partially, what are called "self-selecting samples". They have done something that makes them more likely to end up in your selected sample. This is only safe if their actions are independent of the property that you want to test for. If certain gene mutations tend to put more people in the ER, then your results are biased. Likewise, if the gene mutation tends to influence whether they have a phone, your results are biased. I can think of situations where that can occur in either case, so it is best for you to consider it. This is a very treacherous subject and it is easy to overlook dependencies that will bias your results.
 
Last edited:
  • Like
Likes Klystron, jim mcnamara and Dale
Hi all, I've been a roulette player for more than 10 years (although I took time off here and there) and it's only now that I'm trying to understand the physics of the game. Basically my strategy in roulette is to divide the wheel roughly into two halves (let's call them A and B). My theory is that in roulette there will invariably be variance. In other words, if A comes up 5 times in a row, B will be due to come up soon. However I have been proven wrong many times, and I have seen some...
Thread 'Detail of Diagonalization Lemma'
The following is more or less taken from page 6 of C. Smorynski's "Self-Reference and Modal Logic". (Springer, 1985) (I couldn't get raised brackets to indicate codification (Gödel numbering), so I use a box. The overline is assigning a name. The detail I would like clarification on is in the second step in the last line, where we have an m-overlined, and we substitute the expression for m. Are we saying that the name of a coded term is the same as the coded term? Thanks in advance.
Back
Top