JeffJo
- 146
- 27
If the family has two boys, which one is the "the son" that I colored in red? If you can't identify him, you have to consider the chances of the two children together.Dadface said:We know that, regardless of how the information was obtained, the son has one sibling. We further know that there is a fifty percent chance that the sibling is a boy and a fifty percent chance that it is a girl. The answer is 1/2. Where's the paradox?
Essentially, you are treating the problem statement as "I know about one particular child of Mr. Smith's. That child is a boy." When that is the problem statement, the answer is indeed 1/2 because the un-pictured child has a 1/2 chance to be a boy, as you reasoned. But that is assuming information not included in the problem. You are assuming a specific child is identified.
Others treat it as "I know about both of Mr. Smith's children. The pair includes at least one boy." When that is the problem statement, the answer is 1/3 because 3/4 of all the possible families include a boy, and 1/4 include two, so the answer is (1/4)/(3/4)=1/3. But this also assumes information not included in the problem. It assumes you actually know both ngenders: you need to, in order to always find a boy when one is there. It is implicit that you can know about one boy, but not both. It also assumes the intent to look for a boy, something you can't deduce from the statement alone.
Most people who take the second approach accuse those who answer "1/2" of taking the first. That's why it is usually compared to the Mr. Jones question. While many do exactly that - as did you - that isn't the only way to get 1/2. I prefer to think of it as "I know a gender that exists in Mr. Smith's family." This assumes nothing, since all that is really implicit in the statement is that you know a gender. This is more complicated to solve, because you have to allow for the possibility that you might know "girl." In fact, the whole reason conditional probability is unintitive is that you have to allow for things you know didn't happen. But the short of it is that in the 1/2 that have one boy and one girl, it is just as likely to know about the girl as the boy.
An example of "knowing a gender" might be if a new family moves into the three-bedrooom house next door. You might deduce they have a boy by seeing a boy's bicycle in the driveway, but you can't associate it with a specific child.
+++++
But since you are interested in history, I have traced what may (it's speculation, but it seems plausible) be the history of the Tuesday Boy Problem, and how top-level experts can't always agree.
In his 1988 book "Innumeracy," Professor John Allen Paulos of Temple University included a variant of the Two Child Problem as an example, paraphrased: "If Myrtle is a girl from a family of two, what is the probability she has a brother?" His answer was 2/3 (note that he asks for the probability if a mixed family, not two girls).
J. L. Snell (Dartmouth) and R. Vanderbei (Princeton) pointed out in a 1995 article titled "Three Bewitching paradoxes" that, by giving the girl an uncommon name, he inadvertantly changed the problem. The probability Myrtle has a sister should be 2/(4-p), where p is the probability a girl is named Myrtle (this is the same formula that would be used for the Tuesday Boy question, if it asked fro a mixed family). In his 2008 book "The Drunkard's Walk," Leonard Mlodinow (Stanford, and a frequent collaborator with no less than Stephen Hawkings) used the same problem, about a girl named Florida.
Snell and Vanderbei ignored the illogic of having two girls named Myrtle in the same family, and included that case in their count. Mlodinow argued that the factor for it depends on p^2, which is negligibly small compared to the other kinds of families. Which is wrong: compared to the fraction of the famlies he is counting, it depends on p^1.
Giulio D'Agostini (Universit`a “La Sapienza” and INFN, Rome, Italy) attempted to correct for that factor, but did it wrong by disallowing only girls named Myrtle/Florida. He allowed two girls of any other name. But at least he got the right answer, 1/2, which is quite trivial to prove by a different method!
Define the following events: M2 is the event where a family of 2 children includes a girl named Myrtle. MO is the event where she is the older sibling, and in MY she is the younger. Finally, MB is the event where she has a brother. Everybody will agree that the probability Myrtle has a brother, given that she is either the older or younger sibling, is 1/2. Which is the first statwement in this progression:
- P(MB|MO)=1/2
- P(MB|MY)=1/2
- P(MO|M2)=Q (Most will say it is actually 1/2. I use a variable because I don't need to know it, and the error I'll demonstrate makes it a little more than 1/2.)
- P(MY|M2)=1-Q (That is, MO and MY represent all possibilites in M2, and do not overlap)
- P(MB|MO)*P(MO|M2) = P(MB and MO|M2) = Q/2
- P(MB|MY)*P(MY|M2) = P(MB and MY|M2) = (1-Q)/2
- P(MB and MO|M2) + P(MB and MY|M2) = P(MB|M2) = Q/2 + (1-Q)/2 = 1/2.
- QED.
And what is event worse, is that my derivation above is wrong. Specificalky, equation #2 is wrong. #1 is right because the gender of a second child is independent of both the gender and name of the first child, but the name (not gender) of a second child will depend on the name of the first child, if the gender is the same. I won't go into details unless asked, but it turns out that P(Myrtle has a brother) is approximately equal to 1/2+[P(a girl receives the "average" name)-P(a girl gets named Myrtle)]/8. In other words, for uncommon names, the probabiltiy is greater than 1/2!
Last edited: