Probably a silly question about tree diagrams

In summary, tree diagrams correspond to events in a sample space and the probability of an event occurring at any node is equal to the product of the probabilities along the path from the root to that node. The diagram on the left, which shows the intersection of events, is often used as a simpler representation of the diagram on the right. However, the statement that the probability of an event at any node is equal to the product of the probabilities along the path is not always true, as it depends on the independence of events. Additionally, there is a difference in notation between using set theory and speaking of events as "things that happen".
  • #1
etotheipi
Each node of a tree diagram corresponds to an event (i.e. a subset of the sample space), and the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node.

So considering the following two tree diagrams,

1582983830312.png


I would have thought the one on the right is correct, and the one on the left is wrong. Since on the left tree diagram, for instance, the top right node doesn't correspond to the event ##B## but the event ##A\cap B##. But I only ever see variants of the diagram on the left. My guess is that on the left only the last set in the compound event is written to save time? Or am I interpreting the tree structure in the wrong way?

On a slightly unrelated (more pedagogical) note, all of the undergrad level textbooks I've looked at don't have any tree diagrams. Is it the case that they're not used that much beyond school, since everything can be done more rigorously just with set notation?
 
Physics news on Phys.org
  • #2
I feel like the answer to your question will require the precise context where these diagrams came up.
 
  • Like
Likes etotheipi
  • #3
The tree diagram on the left implies the same thing as on the right. Putting the full expression at every node would accumulate to a long expression in a deep tree. That is not necessary to convey the meaning.

Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram. If the events of A and B are dependent, you would need to use the conditional probabilities like P(B|A) at the second level nodes to make that statement true on the left diagram. The statement is not appropriate at all on the right diagram.
 
  • Like
Likes sysprog and etotheipi
  • #4
FactChecker said:
Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram. If the events of A and B are dependent, you would need to use the conditional probabilities like P(B|A) at the second level nodes to make that statement true.

Yes, I should have worded that more carefully! I should have added that each edge represents a probability of the event in the subsequent node occurring given that all the previous events along the path have occurred.

FactChecker said:
The tree diagram on the left implies the same thing as on the right. Putting the full expression at every node would accumulate to a long expression in a deep tree. That is not necessary to convey the meaning.

Alright, so sort of what what I hoped would be the case! So I assume then that we can take each node to represent the event which is the intersection of all of the events up to and including the event labelled on the node. A necessary simplification for large trees especially. Thank you.
 
  • #5
I don't see any difference between the two diagrams. The topmost node in both diagrams represents ##A \cap B##; i.e., to get to that node both A and B must be true. The next node down indicates that A is true but B is false.

Edit: I didn't see @FactChecker's reply when I posted this.
 
  • Like
Likes sysprog and etotheipi
  • #6
Mark44 said:
I don't see any difference between the two diagrams. The topmost node in both diagrams represents ##A \cap B##; i.e., to get to that node both A and B must be true. The next node down indicates that A is true but B is false.

Yeah, the logic remains the same in both cases. It's just that, like you said, the top right node represents the set ##A \cap B## and not the set ##B##, so it's a bit of a confusing shortcut to just write ##B##!
 
  • #7
etotheipi said:
Yeah, the logic remains the same in both cases. It's just that, like you said, the top right node represents the set ##A \cap B## and not the set ##B##, so it's a bit of a confusing shortcut to just write ##B##!
It's not confusing to me at all., and IMO, the figure on the left is better. Each node in the tree represents one event. Traversing the edges between nodes gives the probability of the branch.

BTW, as long as we're discussing the probability of events, rather than sets, the notation ##A \wedge B## would be more suitable.
 
  • #8
Mark44 said:
It's not confusing to me at all., and IMO, the figure on the left is better. Each node in the tree represents one event. Traversing the edges between nodes gives the probability of the branch.

BTW, as long as we're discussing the probability of events, rather than sets, the notation ##A \wedge B## would be more suitable.

I suppose there are two ways of looking at it.

If we use Kolmogorov's model of probability, then events are sets and I would be inclined to say that ##A \cap B## is a better fit for the top right node (i.e. the set of outcomes in A and B, which in this case happens to be an elementary set), as opposed to the set ##\overline{A} \cap B##.

If we discard the notion of sets and instead speak of events as "things that happen", then labelling the top right node as ##B## makes sense. Though I'm unaware of how often this latter convention is used, since my text puts forward all of the probability in terms of set theory and Kolmogorov's model.
 
  • #9
I don't have Kolmogorov at hand, so I can't check. Set theory and probability are closely related, so perhaps he's distinguishing between a set A and the event that ##x \in A##.
 
  • #10
Mark44 said:
I don't have Kolmogorov at hand, so I can't check. Set theory and probability are closely related, so perhaps he's distinguishing between a set A and the event that ##x \in A##.

I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes. So for two tosses of a coin, an event could be ##\{ (H,H), (H,T), (T,H) \}##. Though it could also be an elementary event, which has a size of one, like ##\{ (H,H) \}##.

So if ##A## is the event that the first toss is heads (the subset of ##S## containing the outcomes where this is true, that is, ##A = \{ (H,H), (H,T) \})## and ##B## is the event that the second toss is heads ##B = \{ (H,H), (T,H) \})##, then ##A \cap B## is the event ##\{ (H,H) \}##. So in this sense, it doesn't seem right to label the top right node ##B## since it doesn't represent the whole set ##B##, only a subset which is the intersection with ##A##. The other part of ##B## would be contained within ##\overline{A} \cap B## which would be below.
 
Last edited by a moderator:
  • #11
That is, if we wrote the sets instead of the letters, we'd end up with something like this using the example of an experiment consisting of two flips of a coin:

1582998574763.png
 
  • #12
etotheipi said:
I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes.

You are correct. Mathematical probability theory deals with assigning a probability to sets in a probability space. The notion of an event as a statement is used in applications of probability theory. An "outcome" in a probability space ##(\Omega, \mathcal{F}, P)## is defined to be an element of a certain set ##\Omega##. As a special case, one can define the members of ##\Omega## to be statements.

In that special case, for a ##A \subset \Omega##, we can identify the symbol "##A##" with a statement that describes the individual statements that constitute the set ##A##. (It is common in math to "abuse notation" by using the same symbol to mean two different things.) The leads to using notation like "P(A and B)" to mean ##P(A \cap B)##. This, in turn, leads to using the symbols employed in formal logic (##\land##, ##\lor##, ##\lnot##) to abbreviate "and", "or", "not".
 
Last edited:
  • Like
Likes sysprog and etotheipi
  • #13
etotheipi said:
I could be very wrong (!) since I'm still not too familiar with the formalisation, however as far as I'm aware an "event" is defined as a subset of the sample space, whose elements are outcomes. So for two tosses of a coin, an event could be ##\{ (H,H), (H,T), (T,H) \}##. Though it could also be an elementary event, which has a size of one, like ##\{ (H,H) \}##.
Yes, this makes sense.
etotheipi said:
So if ##A## is the event that the first toss is heads (the subset of ##S## containing the outcomes where this is true, that is, ##A = \{ (H,H), (H,T) \})## and ##B## is the event that the second toss is heads ##B = \{ (H,H), (T,H) \})##, then ##A \cap B## is the event ##\{ (H,H) \}##. So in this sense, it doesn't seem right to label the top right node ##B## since it doesn't represent the whole set ##B##, only a subset which is the intersection with ##A##. The other part of ##B## would be contained within ##\overline{A} \cap B## which would be below.
It seems to me to make more sense to me to label each node as H or T, as in this drawing. Labeling the nodes with A and B, etc., adds an unnecessary level of complexity, IMO.
tree.png


The top two edges represent the event (H, H). The upper branch represent the events (H, H) and (H, T). If I define A as the event "at least one head occurs in two throws of the coin" can be represented as the set ##A = \{(H, H), (H, T), (T, H)\}##. As a compound event, this would be ##(H, H) \vee (H, T) \vee (T, H)##.
 
  • Like
Likes etotheipi
  • #14
We could perhaps also let ##H_{1}## be the event that a head occurs on the first throw, ##H_2## that a head occurs on the second throw, etc.
Stephen Tashi said:
In that special case, for a ##A \subset \Omega##, we can identify the symbol "##A##" with a statement that describes the individual statements that constitute the set ##A##.

With this, events like ##H_{1}## do a double duty as (formally) sets and (informally) statements / logical propositions. We can translate "the event that a head occurs on the first throw" as "the set of outcomes where a head occurs on the first throw". I might draw it like this,

1583005525965.png


I think there are lots of different ways of denoting the same maths; I seem to find it simpler to keep the idea of events as sets at the forefront, since it means that the notation meshes together quite nicely.

For instance, ##P(H_2 \cap T_1) = P(T_1) \times P(H_2 | T_1)##

which is consistent with

etotheipi said:
each edge represents a probability of the event in the subsequent node occurring given that all the previous events along the path have occurred.
 
Last edited by a moderator:
  • #15
FactChecker said:
Your statement: "the probability of the event at any node occurring equals the product of the probabilities along the path from the root to that node. " is not always true in either diagram ... The statement is not appropriate at all on the right diagram.

@FactChecker I'm having difficulty understanding this part, why would this not apply to the second diagram?

If the diagram on the left implies the one on the right, shouldn't they both operate in the same manner?
 
Last edited by a moderator:
  • #16
etotheipi said:
@FactChecker I'm having difficulty understanding this part, why would this not apply to the second diagram?

If the diagram on the left implies the one on the right, shouldn't they both operate in the same manner?
Sorry if my statement was confusing. Both can be right if the probability calculation of the second level uses the conditional on the top level, like P(A)*P(A|B). I will leave it to you to decide if the two diagrams are equivalent in terms of how difficult it is to translate the diagram node labels to the correct probability calculation.
 
  • Like
Likes etotheipi
  • #17
Alright, awesome. To me at least it still makes more sense to draw the tree as branching into smaller and smaller subsets; i.e. if a node represents an event, the set must be unambiguously specified. And we might mentally substitute the intersection with all of the previous nodes (i.e. '##B##' ##\implies B\cap A##) if we do abbreviate the expression in the interest of clarity.

Although there is a close correspondence between set theory and logic theory (e.g. some have mentioned the ##\wedge## to ##\cap## isomorphism for acting on statements and events (sets) accordingly), it seems easier to treat probability theory entirely in terms of set theory, since this seems to be how it is formulated usually.
 
Last edited by a moderator:

1. What is a tree diagram?

A tree diagram is a visual representation of a set of possible outcomes or events. It is often used in probability and statistics to show the different paths or branches that can occur in a given situation.

2. How do you read a tree diagram?

To read a tree diagram, start at the top and follow the branches down to the end. Each branch represents a possible outcome, and the probability of that outcome is shown on the branch. The probabilities of all the branches should add up to 1.

3. What is the purpose of a tree diagram?

The purpose of a tree diagram is to help visualize and understand the possible outcomes and probabilities of a situation. It can be used to make predictions or decisions based on the likelihood of different events occurring.

4. Can a tree diagram have more than two branches at each level?

Yes, a tree diagram can have any number of branches at each level. The number of branches represents the number of possible outcomes or events at that point in the diagram.

5. How is a tree diagram different from a Venn diagram?

A tree diagram shows the different possible outcomes or events in a sequential manner, while a Venn diagram shows the relationships between different sets of data. Additionally, a tree diagram is often used for probability and statistics, while a Venn diagram is used for data analysis and set theory.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
18
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
12
Views
5K
  • Set Theory, Logic, Probability, Statistics
Replies
10
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
18
Views
893
Replies
3
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
11
Views
3K
  • Special and General Relativity
Replies
6
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
2K
  • Engineering and Comp Sci Homework Help
Replies
5
Views
2K
Back
Top