Probably a silly question about tree diagrams

etotheipi · Feb 29, 2020

Each node of a tree diagram corresponds to an event (i.e. a subset of the sample space), and the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node.

So considering the following two tree diagrams,

I would have thought the one on the right is correct, and the one on the left is wrong. Since on the left tree diagram, for instance, the top right node doesn't correspond to the event ##B## but the event ##A\cap B##. But I only ever see variants of the diagram on the left. My guess is that on the left only the last set in the compound event is written to save time? Or am I interpreting the tree structure in the wrong way?

On a slightly unrelated (more pedagogical) note, all of the undergrad level textbooks I've looked at don't have any tree diagrams. Is it the case that they're not used that much beyond school, since everything can be done more rigorously just with set notation?

member 587159 · Feb 29, 2020

I feel like the answer to your question will require the precise context where these diagrams came up.

FactChecker · Feb 29, 2020

The tree diagram on the left implies the same thing as on the right. Putting the full expression at every node would accumulate to a long expression in a deep tree. That is not necessary to convey the meaning.

Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram. If the events of A and B are dependent, you would need to use the conditional probabilities like P(B|A) at the second level nodes to make that statement true on the left diagram. The statement is not appropriate at all on the right diagram.

etotheipi · Feb 29, 2020

FactChecker said:

Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram. If the events of A and B are dependent, you would need to use the conditional probabilities like P(B|A) at the second level nodes to make that statement true.

Yes, I should have worded that more carefully! I should have added that each edge represents a probability of the event in the subsequent node occurring given that all the previous events along the path have occurred.

FactChecker said:

The tree diagram on the left implies the same thing as on the right. Putting the full expression at every node would accumulate to a long expression in a deep tree. That is not necessary to convey the meaning.

Alright, so sort of what what I hoped would be the case! So I assume then that we can take each node to represent the event which is the intersection of all of the events up to and including the event labelled on the node. A necessary simplification for large trees especially. Thank you.

Mark44 · Feb 29, 2020

I don't see any difference between the two diagrams. The topmost node in both diagrams represents ##A \cap B##; i.e., to get to that node both A and B must be true. The next node down indicates that A is true but B is false.

Edit: I didn't see @FactChecker's reply when I posted this.

etotheipi · Feb 29, 2020

Mark44 said:

I don't see any difference between the two diagrams. The topmost node in both diagrams represents ##A \cap B##; i.e., to get to that node both A and B must be true. The next node down indicates that A is true but B is false.

Yeah, the logic remains the same in both cases. It's just that, like you said, the top right node represents the set ##A \cap B## and not the set ##B##, so it's a bit of a confusing shortcut to just write ##B##!

Mark44 · Feb 29, 2020

etotheipi said:

Yeah, the logic remains the same in both cases. It's just that, like you said, the top right node represents the set ##A \cap B## and not the set ##B##, so it's a bit of a confusing shortcut to just write ##B##!

It's not confusing to me at all., and IMO, the figure on the left is better. Each node in the tree represents one event. Traversing the edges between nodes gives the probability of the branch.

BTW, as long as we're discussing the probability of events, rather than sets, the notation ##A \wedge B## would be more suitable.

etotheipi · Feb 29, 2020

Mark44 said:

It's not confusing to me at all., and IMO, the figure on the left is better. Each node in the tree represents one event. Traversing the edges between nodes gives the probability of the branch.

BTW, as long as we're discussing the probability of events, rather than sets, the notation ##A \wedge B## would be more suitable.

I suppose there are two ways of looking at it.

If we use Kolmogorov's model of probability, then events are sets and I would be inclined to say that ##A \cap B## is a better fit for the top right node (i.e. the set of outcomes in A and B, which in this case happens to be an elementary set), as opposed to the set ##\overline{A} \cap B##.

If we discard the notion of sets and instead speak of events as "things that happen", then labelling the top right node as ##B## makes sense. Though I'm unaware of how often this latter convention is used, since my text puts forward all of the probability in terms of set theory and Kolmogorov's model.

Mark44 · Feb 29, 2020

I don't have Kolmogorov at hand, so I can't check. Set theory and probability are closely related, so perhaps he's distinguishing between a set A and the event that ##x \in A##.

etotheipi · Feb 29, 2020

Mark44 said:

I don't have Kolmogorov at hand, so I can't check. Set theory and probability are closely related, so perhaps he's distinguishing between a set A and the event that ##x \in A##.

I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes. So for two tosses of a coin, an event could be ##\{ (H,H), (H,T), (T,H) \}##. Though it could also be an elementary event, which has a size of one, like ##\{ (H,H) \}##.

So if ##A## is the event that the first toss is heads (the subset of ##S## containing the outcomes where this is true, that is, ##A = \{ (H,H), (H,T) \})## and ##B## is the event that the second toss is heads ##B = \{ (H,H), (T,H) \})##, then ##A \cap B## is the event ##\{ (H,H) \}##. So in this sense, it doesn't seem right to label the top right node ##B## since it doesn't represent the whole set ##B##, only a subset which is the intersection with ##A##. The other part of ##B## would be contained within ##\overline{A} \cap B## which would be below.

etotheipi · Feb 29, 2020

That is, if we wrote the sets instead of the letters, we'd end up with something like this using the example of an experiment consisting of two flips of a coin:

Stephen Tashi · Feb 29, 2020

etotheipi said:

I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes.

You are correct. Mathematical probability theory deals with assigning a probability to sets in a probability space. The notion of an event as a statement is used in applications of probability theory. An "outcome" in a probability space ##(\Omega, \mathcal{F}, P)## is defined to be an element of a certain set ##\Omega##. As a special case, one can define the members of ##\Omega## to be statements.

In that special case, for a ##A \subset \Omega##, we can identify the symbol "##A##" with a statement that describes the individual statements that constitute the set ##A##. (It is common in math to "abuse notation" by using the same symbol to mean two different things.) The leads to using notation like "P(A and B)" to mean ##P(A \cap B)##. This, in turn, leads to using the symbols employed in formal logic (##\land##, ##\lor##, ##\lnot##) to abbreviate "and", "or", "not".

Mark44 · Feb 29, 2020

etotheipi said:

I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes. So for two tosses of a coin, an event could be ##\{ (H,H), (H,T), (T,H) \}##. Though it could also be an elementary event, which has a size of one, like ##\{ (H,H) \}##.

Yes, this makes sense.

etotheipi said:

So if ##A## is the event that the first toss is heads (the subset of ##S## containing the outcomes where this is true, that is, ##A = \{ (H,H), (H,T) \})## and ##B## is the event that the second toss is heads ##B = \{ (H,H), (T,H) \})##, then ##A \cap B## is the event ##\{ (H,H) \}##. So in this sense, it doesn't seem right to label the top right node ##B## since it doesn't represent the whole set ##B##, only a subset which is the intersection with ##A##. The other part of ##B## would be contained within ##\overline{A} \cap B## which would be below.

It seems to me to make more sense to me to label each node as H or T, as in this drawing. Labeling the nodes with A and B, etc., adds an unnecessary level of complexity, IMO.

The top two edges represent the event (H, H). The upper branch represent the events (H, H) and (H, T). If I define A as the event "at least one head occurs in two throws of the coin" can be represented as the set ##A = \{(H, H), (H, T), (T, H)\}##. As a compound event, this would be ##(H, H) \vee (H, T) \vee (T, H)##.

etotheipi · Feb 29, 2020

We could perhaps also let ##H_{1}## be the event that a head occurs on the first throw, ##H_2## that a head occurs on the second throw, etc.

Stephen Tashi said:

In that special case, for a ##A \subset \Omega##, we can identify the symbol "##A##" with a statement that describes the individual statements that constitute the set ##A##.

With this, events like ##H_{1}## do a double duty as (formally) sets and (informally) statements / logical propositions. We can translate "the event that a head occurs on the first throw" as "the set of outcomes where a head occurs on the first throw". I might draw it like this,

I think there are lots of different ways of denoting the same maths; I seem to find it simpler to keep the idea of events as sets at the forefront, since it means that the notation meshes together quite nicely.

For instance, ##P(H_2 \cap T_1) = P(T_1) \times P(H_2 | T_1)##

which is consistent with

etotheipi said:

each edge represents a probability of the event in the subsequent node occurring given that all the previous events along the path have occurred.

etotheipi · Feb 29, 2020

FactChecker said:

Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram ... The statement is not appropriate at all on the right diagram.

@FactChecker I'm having difficulty understanding this part, why would this not apply to the second diagram?

If the diagram on the left implies the one on the right, shouldn't they both operate in the same manner?

FactChecker · Feb 29, 2020

etotheipi said:

@FactChecker I'm having difficulty understanding this part, why would this not apply to the second diagram?

If the diagram on the left implies the one on the right, shouldn't they both operate in the same manner?

Sorry if my statement was confusing. Both can be right if the probability calculation of the second level uses the conditional on the top level, like P(A)*P(A|B). I will leave it to you to decide if the two diagrams are equivalent in terms of how difficult it is to translate the diagram node labels to the correct probability calculation.

etotheipi · Mar 1, 2020

Alright, awesome. To me at least it still makes more sense to draw the tree as branching into smaller and smaller subsets; i.e. if a node represents an event, the set must be unambiguously specified. And we might mentally substitute the intersection with all of the previous nodes (i.e. '##B##' ##\implies B\cap A##) if we do abbreviate the expression in the interest of clarity.

Although there is a close correspondence between set theory and logic theory (e.g. some have mentioned the ##\wedge## to ##\cap## isomorphism for acting on statements and events (sets) accordingly), it seems easier to treat probability theory entirely in terms of set theory, since this seems to be how it is formulated usually.

Probably a silly question about tree diagrams

Similar threads

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad A variant of the Monty Hall problem

High School How Rare Is Low Smartphone Usage Among Metro Travelers in Japan?

High School Onto set mapping is the surjective set mapping, and into injective?

Undergrad How do E[X] and E[|X|] relate?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers