The Monty Hall paradox/conundrum

sysprog · Jan 5, 2020

Holy X, @cmb, you are clearly evincing befuddlement.

Please trust that no-one here is trying to mislead anyone. If the Mentors/Moderators here were to determine that someone was deliberately doing that, the someone would be not be allowed to persist in doing that on PF.

You start the game with 1/3 chance of choosing the door that conceals the car. We call that door S1, for subset 1.

That leaves the 2 doors, that were not chosen, to contain the other 2 of the 3 possibilities for being the door that conceals the car. We call those 2 doors S2, for subset 2.

We know in advance that no more than 1 member of S2 can conceal the car, because there is only 1 car.
We therefore also know in advance that if the car is behind an S2 door, it's not behind the other S2 door.

There is always either 1 door or 2 doors in S2 that do not conceal a car: 1 if S2 conceals the car, and 2 if S1 conceals the car.

When Monty opens a door, without revealing a car, he thereby reveals nothing that tells you whether there was only 1 door in S2 that he could have opened, or there were 2 doors in S2 that he could have opened.

That means that he has thereby told you exactly nothing that serves to change what you already knew about the car-concealing chances of S1 or S2. You therefore can and should know that S1 remains at 1/3 chance of concealing the car, and that S2 remains at 2/3.

Once S2 has only 1 available member left, i.e., only 1 unopened door, being offered to switch to that door gives you the option to to take the S2 chance instead of the S1 chance. The S2 chance is still 2/3, just as it would have been if Monty had not opened a door from S2, and had given you the option of choosing both doors of S2.

So, after you have chosen, and after a door-opening interlude designed to obfuscate the obvious, you are offered the chance to switch, to the S2 member that at that time contains all of the S2 chance of concealing the car, and that chance is 2/3.

cmb · Jan 5, 2020

sysprog said:

Holy X, @cmb, you are clearly evincing befuddlement.

I am, truly, befuddled on this, no question at all!

Not trying to get anyone else to be befuddled, just interested in their answers to the 'stand-in contestant' scenario.

Is their probability different? If not how? Surely 1 choice in n is 1/n, and that is what they would face?

I know this can be dragged back to 'evidencing' the overall game-plan outcome is 2/3 by tables and diagrams. I totally get that, and I don't seek to doubt 'swapping' as the game plan of choice for the contestant walking into the studio. (And, no, I would not take the counter-bet! ;) )

I am interested in whether the 'fresh choice' the stand-in contestant faces with two doors is, at that moment in time, 50:50 or if not how is that possible?

Yes, very befuddled.

In your example, and comparing it with the 1000 door example, presumably that'd mean picking a door, seeing 998 being opened, then the chance of the one other door having the car behind it is 999/1000? ...

hmmm ... yeah, I think I am beginning to see that ... so if the contestant's choice was totally random but all the others weren't, then that 'other' door was 'chosen' by Monty in full knowledge of where it was, thus much more likely ... i.e. not independent. Yeah, OK, really tricky to see with 3 doors, but I think I am comfortable with this now.

Few... befuddlement indeed!

Thanks.

PeroK · Jan 5, 2020

cmb said:

I am, truly, befuddled on this, no question at all!

Not trying to get anyone else to be befuddled, just interested in their answers to the 'stand-in contestant' scenario.

Is their probability different? If not how? Surely 1 choice in n is 1/n, and that is what they would face?

I know this can be dragged back to 'evidencing' the overall game-plan outcome is 2/3 by tables and diagrams. I totally get that, and I don't seek to doubt 'swapping' as the game plan of choice.

I am interested in whether the 'fresh choice' the stand-in contestant faces with two doors is, at that moment in time, 50:50 or if not how is that possible?

Yes, very befuddled.

The stand-in contestant complicates things because, as I pointed out in an earlier post, you now have a variable amount of information available.

1) Monty knows where the car is, so he could win every time.

2) A contestant who knows which door was originally chosen can win 2/3 of the time.

3) A contestant who doesn't know which door was originally chosen, but is simply presented with a choice of two doors, can only win 1/2 of the time.

The mistake is to see this as some paradox on "absolute" probabilities. What are being calculated here are conditional probabilities based on the knowledge available. The amount of information you have determines how often you can win.

Orodruin · Jan 5, 2020

cmb said:

They know exactly what occurred.

The question is would the probability be the same for the 'stand-in' contestant who's been watching from the curtains or does he get a 50:50 shot at two doors with one win behind one of them?

If they know exactly what occurred, then the probability is 1/3 if they keep their friend’s door and 2/3 if they switch. Information does matter.

Consider the following: You know that there is a diamond in a piece of rock. The rock is cut into two pieces, one twice as big as the other. If you are allowed a choice between piece A and B without being told which piece is the big one your probability of getting the piece with the diamond is 1/2. If you are told which piece is the big piece, your will pick that one for a 2/3 chance. Nothing changed between the physical setups, but your probability of finding the diamond changed only based on additional information given to you.

cmb said:

Is their probability different? If not how? Surely 1 choice in n is 1/n, and that is what they would face?

Given the extra information, it is not a symmetric choose 1 of n.

sysprog · Jan 5, 2020

cmb said:

I am, truly, befuddled on this, no question at all!

Not trying to get anyone else to be befuddled, just interested in their answers to the 'stand-in contestant' scenario.

Is their probability different? If not how? Surely 1 choice in n is 1/n, and that is what they would face?

I know this can be dragged back to 'evidencing' the overall outcome is 2/3 by tables and diagrams. I totally get that, and I don't seek to doubt 'swapping' as the game plan of choice.

I am interested in whether the 'fresh choice' the stand-in contestant faces with two doors is, at that moment in time, 50:50 or if not how is that possible?

Yes, very befuddled.

From the point of view of the audience, the stand-in post-door-opening contestant who is asked choose 1 of the 2 remaining unopened doors, assuming that he is not told which door was the original contestant's choice, or any other special information, has 1/2 chance of choosing the 1/3 chance door, and 1/2 chance of choosing the 2/3 chance door. The average (arithmetic mean) of 1/3 and 2/3 is 1/2, so the stand-in has a 1/2 chance of choosing the door that conceals the car. That squares with what he should suppose if he is told nothing about the situation other than that there are 2 doors, 1 of which conceals a car, and the other of which conceals a goat, and that he can choose a door, and that he can keep whatever is concealed behind the door he chooses.

[Per @Orodruin pointing out that the stand-in was postulated to have been in on the prior corse of play:]

Assuming that the stand-in contestant knows what the first contestant knows, then he can and should know that: the original door chosen still has 1/3 chance of concealing the car, and the other unopened door has 2/3 chance of concealing the car.

cmb · Jan 5, 2020

Orodruin said:

Given the extra information, it is not a symmetric choose 1 of n.

Got it.

Thanks everyone, took a while but happy to withdraw the assertions in my OP. Having just the 3 doors does make this fact tricky to see.

... Will still have to go and think about it a bit more!

Orodruin · Jan 5, 2020

sysprog said:

assuming tat he is not told which door was the original contestant's choice

OP has stated that the stand-in will be told, see #60.

cmb · Jan 5, 2020

Yeah, I can now see 2/3 is true for the stand-in too, so long as he knows which one was picked by the contestant (who didn't know) and which one was (effectively) picked by Monty who did.

If he didn't know that, then, yes, in that case it'd drop to the 1 in 2. But knowing which one was 'chosen' by each makes the difference, as the 1000 door case shows. In the 1000 door show it becomes 'obvious' that the 999th door was 'chosen' by Monty as the last remaining, thus is 999/1000 likely the car.

... phew ... got there in the end. Thanks for bearing with me. I'd like to think I was befuddled than dumb!

Thanks to gmax for showing me a little sympathy! ;)

gmax137 said:

Me too. This Monty Hall thing has always bugged me, so I have a lot of sympathy for the previous posters who are harshly judged by others as having "bad intuition."

I don't know if this is a true story, but

https://files.eric.ed.gov/fulltext/EJ1060344.pdf

I wouldn't criticize anyone too harshly for following Erdos' intuition.

sysprog · Jan 5, 2020

... phew ... got there in the end. Thanks for bearing with me. I'd like to think I was befuddled than dumb!

Thanks to you for being good about being contradicted. Many bright people get this one wrong at first. No-one here calls anyone here dumb. Everyone here on PF is here at least partly to learn. That includes those who do a lot of teaching. Anyone here who is not learning something is missing out on a wealth of opportunity. I'm confident that I've learned a great deal here that I wouldn't have picked up elsewhere.

Dale · Jan 5, 2020

DaveC426913 said:

I can't just eliminate those rows, since I've already stipulated that they're all equally likely.

This is why, if you are interested in looking at outcomes of games (as opposed to probabilities), the proper tool is a Monte Carlo simulation, not a table. With a table or a decision tree you should calculate the probabilities for each row or path. In my tables I also included the impossible rows and calculated their probabilities which, as expected, was 0.

sysprog · Jan 5, 2020

Dale said:

DaveC426913 said:

I can't just eliminate those rows, since I've already stipulated that they're all equally likely.

This is why, if you are interested in looking at outcomes of games (as opposed to probabilities), the proper tool is a Monte Carlo simulation, not a table. With a table or a decision tree you should calculate the probabilities for each row or path. In my tables I also included the impossible rows and calculated their probabilities which, as expected, was 0.

Tables or decision trees that show all the possible outcomes and how they are arrived at can be used to establish probabilities for each variable, and those probabilities can be verified experimentally by means of a Monte Carlo simulation; however, the rules for the simulation and the rules for constructing the table or decision tree are inter-derivable, and both are derivable directly from the rules of the game.

FactChecker · Jan 5, 2020

The critical issue is the screening process that Monte applies in deciding which door to open. He will intentionally avoid opening the door with the car. That changes the odds. If he had 100,000 doors and opened 99,999 goat-doors, one must ask why he avoided the remaining door.

To emphasize the screening process, consider another problem:
Suppose there is a bucket of sand containing one diamond. You take a random pinch of sand from the bucket. It has a tiny chance of containing the diamond. Now suppose that all the sand in the bucket except a small pinch has been dumped out through a screen that will not allow the diamond to pass. You clearly know that there is an overwhelming chance that the small pinch remaining in the bucket has the diamond, while your pinch of sand still has a tiny chance. You should switch.

Orodruin · Jan 5, 2020

FactChecker said:

Now suppose that all the sand in the bucket except a small pinch has been dumped out through a screen that will not allow the diamond to pass. You clearly know that there is an overwhelming chance that the small pinch remaining in the bucket has the diamond, while your pinch of sand still has a tiny chance. You should switch.

I do not think that this is true unless you know that somebody deliberately dumped sand that they knew did not contain the diamond. If you just dumped sand randomly from the remainder, the largest probability is that the diamond ends up in the screen. It would be the equivalent of a game show where Monty opens one of the remaining doors at random, ie, the Monty Fall problem - where the probabilities are known to be 50-50.

Dale · Jan 5, 2020

sysprog said:

Tables or decision trees that show all the possible outcomes and how they are arrived at can be used to establish probabilities for each variable

The problem with that is that e.g. the first version of this table had four entries two representing wins for “stay” and two representing wins for “switch”. So simply counting entries in the table doesn’t give you probabilities. So the table itself must always be augmented with the probabilities. Even in the case where counting entries in the table gives you the probabilities it is because you know that the probabilities for each row are equal.

FactChecker · Jan 5, 2020

Orodruin said:

I do not think that this is true unless you know that somebody deliberately dumped sand that they knew did not contain the diamond.

As I said, the sand was dumped through a screen that did not allow the diamond to pass.

PeroK · Jan 5, 2020

FactChecker said:

As I said, the sand was dumped through a screen that did not allow the diamond to pass.

If you ended up on the Monty Hall show, I'll bet you'd bring a giant magnet and point it at each door and see which one bulges as the car gets pulled towards you. But, that's a different sort of screening process if you do something to find out exactly where something is!

That's nothing to do with probabilities and information. That's to do with physically isolating something.

FactChecker · Jan 5, 2020

PeroK said:

If you ended up on the Monty Hall show, I'll bet you'd bring a giant magnet and point it at each door and see which one bulges as the car gets pulled towards you. But, that's a different sort of screening process if you do something to find out exactly where something is!

That's nothing to do with probabilities and information. That's to do with physically isolating something.

So I should have said "the sand was dumped through a screen that would not allow the diamond to pass if it is in the bucket. " Thank you for the English lesson.

Orodruin · Jan 5, 2020

FactChecker said:

As I said, the sand was dumped through a screen that did not allow the diamond to pass.

This is irrelevant. Somebody would have to pre-screen the sand before the dumping. You would have to explicitly arrange for the diamond to be left in the bucket.

Orodruin said:

It would be the equivalent of a game show where Monty opens one of the remaining doors at random, ie, the Monty Fall problem - where the probabilities are known to be 50-50.

To put that into mathematics, consider the following:
You pick a door and Monty picks a door not to open at random among N > 2 doors. All the other doors are opened and none of them contains the car. Let A being the event that the car is behind your door, B the event that the car is behind Monty's door and C the event that the remaining N-2 doors do not contain a car. Clearly, P(A∪B∪!C) = 1 and A, B, and !C are exclusive. Both P(A) and P(B) are equal to 1/N because without further information to condition the probabilities on, they are both just choosing one out of N doors. This means that P(C) = 1-P(!C) = 1 - (N-2)/N = 2/N.

We seek P(A|C) and P(B|C).

P(A|C) = P(C|A) P(A)/P(C) = 1 * (1/N)/(2/N) = 1/2

Here, P(C|A) = 1 because if the car is behind your door then the remaining N-2 doors will not contain the car. P(A) = 1/N because there is originally a chance of 1/N of your door containing the car. The same logic can be applied to B.

sysprog · Jan 5, 2020

Dale said:

The problem with that is that e.g. the first version of this table had four entries two representing wins for “stay” and two representing wins for “switch”. So simply counting entries in the table doesn’t give you probabilities. So the table itself must always be augmented with the probabilities. Even in the case where counting entries in the table gives you the probabilities it is because you know that the probabilities for each row are equal.

That criticism seems to me to be apropos of either incorrect table construction or incorrect counting. It is clear from the rules of the game that there are 3 unique possible locations of the car, so 4 lines is immediately apparently 1 too many, and examination of the 3 car locations shows that 1 of them is repeated. The repeated car location line should be merged with its repetition, in the table construction or at least in the count, or the repetition should not be counted.

DaveC426913 · Jan 5, 2020

sysprog said:

...there are 3 unique possible locations of the car, so 4 lines is immediately apparently 1 too many

What? No.

'Possible locations of the car' is not what the number of rows is representing. Your idea that there is one too many rows is in error.

sysprog · Jan 5, 2020

DaveC426913 said:

What? No.

'Possible locations of the car' is not what the number of rows is representing. Your idea that there is one too many rows is in error.

In post #55 I included this table:

\begin{array}{|c|c|c|c|c|c|c|c|c|c|}
\hline & \mathtt 1 & \mathtt 2 & \mathtt 3 & \mathtt {st\ 1} & \mathtt {sw\ 1} & \mathtt {st\ 2} & \mathtt {sw\ 2} & \mathtt {st\ 3} & \mathtt {sw\ 3}\\
\hline \mathtt 1 & \mathbf {\underline{car}} & \mathtt {goat} & \mathtt {goat} & \mathbf {\underline{car}} & \mathtt {goat} & \mathtt {goat} & \mathbf {\underline{car}} & \mathtt {goat} & \mathbf {\underline{car}}\\
\hline \mathtt 2 & \mathtt {goat} & \mathbf {\underline{car}} & \mathtt {goat} & \mathtt {goat} & \mathbf {\underline{car}} & \mathbf {\underline{car}} & \mathtt {goat} & \mathtt {goat} & \mathbf {\underline{car}}\\
\hline \mathtt 3 & \mathtt {goat} & \mathtt {goat} & \mathbf {\underline{car}} & \mathtt {goat} & \mathbf {\underline{car}} & \mathtt {goat} & \mathbf {\underline{car}} & \mathbf {\underline{car}} & \mathtt {goat} \\
\hline
\end{array}

There are no rows there that repeat a car location and thereby obfuscate the fact that there are exactly 3 different locations for the car. That the 3 locations are equiprobable at 1/3 each is immediately discernible from the stated game conditions. Counting of the 1 car and 2 goats in each of the stick columns, and of the 2 cars and 1 goat in each of the switch columns, shows that the chance for the car is 1/3 if you stick, and 2/3 if you switch.

On the more complicated tables that have more than 1 row per car location, there is still 1/3 probability per car location; not 1/3 probability per reference.

mfb · Jan 5, 2020

Orodruin said:

This is irrelevant. Somebody would have to pre-screen the sand before the dumping. You would have to explicitly arrange for the diamond to be left in the bucket.

I think the whole bucket is put through the screen which let's everything pass apart from the diamond and a small bit of sand.

DaveC426913 · Jan 5, 2020

sysprog said:

In post #55 I included this table:

Right. OK. That was your table, not the tables that have been mostly at the centre of the discussion. I might have picked that up if my knee hadn't jerked.

Orodruin · Jan 5, 2020

mfb said:

I think the whole bucket is put through the screen which let's everything pass apart from the diamond and a small bit of sand.

The initial statement makes it clear that not all sand left in the bucket is passed through the filter (my boldface):

FactChecker said:

Now suppose that all the sand in the bucket except a small pinch has been dumped out through a screen that will not allow the diamond to pass.

sysprog · Jan 5, 2020

DaveC426913 said:

Right. OK. That was your table, not the tables that have been mostly at the centre of the discussion. I might have picked that up if my knee hadn't jerked.

Those more complicated tables can be examined for a characteristic repetition: when the car is behind the door initially chosen, they list once for each of the 2 doors that Monty is allowed by the rules to possibly open.

When the car is behind a door not initially chosen, there is only 1 door that Monty can open. There are exactly 3 doors and exactly 1 car, so there are exactly 3 possible outcomes that pivotally depend on where the car is. Listing 4 possibilities is therefore suspect.

If you ensure that you count exactly once per car position, you (presumably) won't wind up with the spurious 1/2 probability assessment that the ceremonious door-opening invites, and you'll (presumably) arrive instead at the correct 1/3 for sticking and 2/3 for switching probability distribution.

Dale · Jan 5, 2020

sysprog said:

It is clear from the rules of the game that there are 3 unique possible locations of the car, so 4 lines is immediately apparently 1 too many

The rows of the table indicate more than just the location of the car.

sysprog said:

Listing 4 possibilities is therefore suspect.

There are four possibilities. There is nothing suspect about that.

Not all the possibilities are equally probable, but there is no requirement that all entries in a table must be equally probable.

FactChecker · Jan 5, 2020

Orodruin said:

The initial statement makes it clear that not all sand left in the bucket is passed through the filter (my boldface):

Just to clarify what I meant: All the sand except a pinch is dumped out of the bucket through a filter. The remaining pinch of sand remains in the bucket along with any possible diamond.

mfb · Jan 6, 2020

FactChecker said:

Just to clarify what I meant: All the sand except a pinch is dumped out of the bucket through a filter. The remaining pinch of sand remains in the bucket along with any possible diamond.

Then you don't have the Monty Hall problem. You have the Monty Fall problem, and both pinches of sand are equivalent.

FactChecker · Jan 6, 2020

mfb said:

Then you don't have the Monty Hall problem. You have the Monty Fall problem, and both pinches of sand are equivalent.

That is not true. If the diamond remained in the bucket after you removed a pinch of sand, it will remain in the bucket after the majority of cand has been filtered out. That gives a very high probability that the remaining bit in the bucket has the diamond.

I have to think that we are not talking about the same thing since what I am saying is trivially true, IMHO, but people are arguing about it.

EDIT: I see what was ambiguous in my statements. I meant that as most of the sand is screened and dumped out, any diamond that hits the screen remains in the bucket. It is not removed.

sysprog · Jan 7, 2020

Dale said:

The rows of the table indicate more than just the location of the car.

True, but using an extra row for 1 of the car locations is misleading to many, who would, as you pointed out, then have to include a column, as you did and as @cmb did not, to show the fact that each of the 2 rows has only half of the probability for the 1 car location that they have in common. With only 1 row per car, you don't need a probability column for this problem, because the rows are simply 1/3 likely each.

There are four possibilities. There is nothing suspect about that.

The fact that there are more than 3 possibilities listed, when all that matters are the 3 possibilities for where the car is, in my view, makes the listing of 4 possibilities suffice as grounds to suspect that something that doesn't matter may be being given undue emphasis. Cursory investigation immediately makes apparent the fact that 1 car position is represented by 2 rows, and to my mind, that reinforces the suspicion. I think that the best correction of the overemphasis is to merge the 2 rows for the same car position into 1 row, but I recognize that your correction of including a probability column was also valid, and was well suited to the explanation that accompanied it, although I liked your earlier terser explanation:

Dale said:

Another way to look at it is as follows: the stick strategy wins if you initially pick the car. The switch strategy wins if you initially pick a goat. The probability of initially picking the car is 1/3 and the probability of initially picking a goat is 2/3.

I was about to post something similar, but I was still thinkig about how to word it, and you beat me to it.

Not all the possibilities are equally probable, but there is no requirement that all entries in a table must be equally probable.

I think that using 4 rows when there are only 3 possible car positions, without noticing that 2 of the rows share the same car location possibility and recognizing the implications of that, maps out exactly what misled @cmb in post #1.

The Monty Hall paradox/conundrum

Similar threads

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad A variant of the Monty Hall problem

Undergrad My basic understanding of set theory

High School Onto set mapping is the surjective set mapping, and into injective?

Undergrad How do E[X] and E[|X|] relate?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers