Can't wrap my head around conditional probability

In summary: A→B) ⇒ BSo, the probability space is a kind of logic. The probability of a thing happening, P(A), is equivalent to A being TRUE, or ⊤. The probability of A not happening is the same as saying that A is ⊥, or FALSE. So, when you say P(A ∩ B), you are saying that A and B are both ⊤.In summary, the conversation discusses the difference between P(A\bigcapB) and P(A|B) in the context of a statistics problem. It explains the concept of probability spaces and how conditional probability is related to logic. The conversation also addresses the correct formula for finding the probability of a student who knows statistics well answering a question
  • #1
berdan
32
0
So,I'm studying statistics (for engineers) now,and this is one of them courses that really gives me a headache time after time.
Here for example,I can't seem to get the difference between P(A[itex]\bigcap[/itex]B),and P(A|B).I'l give an example in for of a question given to us in some class.

"70 percent of students know statistics well.The probability that a student who doesn't know statistics well, answers statistics question in exam is 0.2 .
Probability for a student who knows statistics well and answers statistics question correctly is 0.95.

What is the probability of a random student who does know statistics well ,to answer the question wrong?"

So,when they answered the question,they said that A-event of a student who knows statistics well.
B-event that the student answered correctly.

So,what we looking for is P(A[itex]\bigcap[/itex]Bc),as far as I've understood.And yes,in the answer to the question it goes :
P(A[itex]\bigcap[/itex]Bc)=P(Bc|A)*P(A)=0.05*0.7

Why why on Earth why??
And why does P(A[itex]\bigcap[/itex]Bc)=P(A)-P(A[itex]\bigcap[/itex]B) does not work here?
 
Physics news on Phys.org
  • #2
berdan said:
the difference between P(A[itex]\bigcap[/itex]B),and P(A|B)

I guess you've learned that probability is defined on some sort of "space" of events. It's helpful to think of [itex] P(A \cap B) [/itex] as the probability of an event in the "original space" of a problem and [itex] P(A|B) [/itex] as a probablity in a different event space space.
What is the probability of a random student who does know statistics well ,to answer the question wrong?"

Did you quote the question exactly? Note that the question does not ask "What is the probability that a random student will both know statistics well and also answer the question wrong?"

So,when they answered the question,they said that A-event of a student who knows statistics well.
B-event that the student answered correctly.

So,what we looking for is P(A[itex]\bigcap[/itex]Bc

No. [itex] P(A \cap B) [/itex] is an event in the "original" probabiity space, the one that gives probabilities for picking a student at random out of all students, including both those who know statistics well and those who don't.

The question is asking for [itex] P(B^C | A) [/itex]. The event [itex] B^C | A [/itex] is an event in the space of picking a student at random from those who know statistics well. This is not the same probablity space as the original space.

The laws relating conditional probability to probability in the "original" space are unusual because they contain probabilities from two different probability spaces. By contrast a law like [itex] P(A \cup B) = P(A) + P(B) - P(A \cap B) [/itex] only contains probabilities in one probability space.

In a manner of speaking, all probabilities are conditional, in the sense that their space of events is defined by certain given information. In the the context of a given problem, one set of information defines the "unconditional" probability space and we dont' bother to annotate its probabilities by [itex] P(A| Info) [/itex] where [itex] Info[/itex] is information stated in the problem. When the problem defines additional probability spaces by adding more information (such as the information that we assume an event [itex] C [/itex] in the original space did definitely happen, it uses the notation [itex] P(A|C) [/itex] to indicate that we have changed the probability space.

Since the "original" probability space is really a conditional probability, it isn't suprising that laws such as [itex] P(A \cup B) = P(A) + P(B) - P(A \cap B) [/itex] apply to other conditional probability spaces. So, for example [itex] P(A \cup B | S) = P(A| S) + P(B|S) - P(A \cap B | S) [/itex].
 
  • #3
berdan said:
yes,in the answer to the question it goes :
P(A[itex]\bigcap[/itex]Bc)=P(Bc|A)*P(A)=0.05*0.7

That is a correct formula for [itex] P(B^c \cap A) [/itex]. It isn't the correct formula for [itex] P(B^c | A) [/itex].

Why why on Earth why??

Are you asking why the formula is correct for [itex] P(B^c \cap A) [/itex] ? Or are you asking why [itex] P(B^c \cap A) [/itex] is the correct way to translate the question? In the way that you phrased the question, I think it is not the correct way to translate the question into symbols.

And why does P(A[itex]\bigcap[/itex]Bc)=P(A)-P(A[itex]\bigcap[/itex]B) does not work here?

It does work. (.7)(.05) does equal (.7) - (.7)(.95).
 
  • #4
Stephen Tashi said:
Did you quote the question exactly? Note that the question does not ask "What is the probability that a random student will both know statistics well and also answer the question wrong?"

Ok,now as you say it,the question is actually :
"What is the probability that there is a student ,who knows statistics well,and answers the question wrong".
Ok,now I start to see the difference.In this question,they do ask for a student from an original set,P(A [itex]\bigcap[/itex]Bc).Ok,now,I'm getting what you mean.
They didn't said that a student knows probability ,as a given-cause in that case,I would have to look from a different set,and the probability would be P(Bc|A).

Ok,ok,thanks now.It looks like half of answering statistics questions is reading comprehension :)Damn,they didn't bother to explain that to us at all :(.Thanks a bunch ,man!
 
Last edited:
  • #5
Well, consider the trivial case: P(A given A) is trivially one, whereas P(A and A) is trivially equal to P(A).
 
  • #6
berdan said:
So,I'm studying statistics (for engineers) now,and this is one of them courses that really gives me a headache time after time.
Here for example,I can't seem to get the difference between P(A[itex]\bigcap[/itex]B),and P(A|B).I'l give an example in for of a question given to us in some class.

I'm not sure it would be entirely helpful for you, but I'll throw it out here anyway.

Probability admits a certain kind of logic. In logic, you have the operators ⊤ (true) ⊥ (false) ¬ (not) ∧ (and) ∨ (or) and → (implies). These correspond neatly with concepts in probability:

⊤ - An event with 100% certainty
⊥ - An event with 0% certainty.
¬ - The probability of something NOT happening
∧ - The probability of two things happening
∨ - The probability of either thing happening
→ - Conditional probability. (Note the order is reversed with the vertical bar).

The conditional probability law is:

P(B | A) * P(A) = P(A ∩ B)

Which, with a little squinting, corresponds to the logical notion of modus ponens:

(A → B) ∧ A ⇔ A ∧ B

It might not help you develop an intuition, but at least it should help you think of it in a new way.
 
  • #7
Hey berdan and welcome to the forums.

You can think of conditional probability as a probability where the new universal probability space is now a subset of the true space instead of the actual universal set.

This is all that is going on: you are selecting a "slice" of the expanding universal set and finding a probability of an event relative to a particular subset.

Its a lot easier to draw a Venn diagram to see this in action but this is all conditioning is.
 
  • #8
I think the way to think of it is that in Pr(A|B), B is the sample space.

It is quite intuitive to see this when looking at the definition:

Pr(A|B) = Pr(A intersect B) / Pr(B)

So, we consider only the instances of A occurring in the case that B also has occurred. We then divide by the probability of B to "adjust it" because our probability is one of saying "B certainly happened, given this, what is the probability that A happened as well."

You could even write any standard probability as a conditional probability. For example, Pr(A) in some sample space S is exactly Pr(A|S). Using the definition for conditional probability,

Pr(A|S) = Pr(A intersect S)/Pr(S)

We know that A intersect S is just A because A is a subset of S. We also know that Pr(S) = 1, so we just have Pr(A). Conditional probability for some other even B is the same notion, we are "moving" from sample space S to "sample space" B.

You also seem to be trying to relate Pr(A|B) to Pr(A intersect B). There are important relations between the two.

If Pr(A intersect B) = Pr(A|B), then it must be the case that:

Pr(A int B) = Pr(A int B)/Pr(B)

But the only solution to this is that Pr(A int B) has a probability of 0, and thus A and B are disjoint events.

There is also another.

If Pr(A|B) = Pr(A),

Then

Pr(A intersect B)/Pr(B) = Pr(A)
Pr(A intersect B) = Pr(A)Pr(B)

This is the definition of independent events. If the probability of A is the same whether or not B occurred then the probability of A intersect B is equal to the product of the probabilities of A and B.
 

FAQ: Can't wrap my head around conditional probability

1. What is conditional probability?

Conditional probability is the likelihood of an event occurring given that another event has already occurred. It is calculated by dividing the probability of the two events occurring together by the probability of the first event occurring.

2. Why is conditional probability important?

Conditional probability is important because it allows us to make predictions and decisions based on previous information. It is used in fields such as statistics, economics, and machine learning to analyze data and make informed decisions.

3. How is conditional probability different from regular probability?

Regular probability looks at the likelihood of an event occurring without any prior information. Conditional probability takes into account the likelihood of an event occurring given that another event has already occurred. In other words, it takes into consideration additional information that can affect the probability of an event.

4. What is an example of conditional probability?

An example of conditional probability is the probability of drawing two red cards from a deck of cards. If we know that the first card drawn was a red card, the probability of drawing a second red card decreases because there are now fewer red cards in the deck. This is an example of how the probability of an event changes based on prior information.

5. How can I improve my understanding of conditional probability?

To improve your understanding of conditional probability, it is important to practice solving problems and familiarize yourself with the basic principles and formulas. You can also seek out online resources, textbooks, or consult with a tutor or mentor for additional guidance.

Back
Top