jeremy22511 said:
Everything was fine but then a definition of Conditional Probability P[A|B] = \frac{P[AB]}{P} appeared
Yes, the definition of conditional probability is a conceptual leap.
I'll assume you have studied the concept of a "probability space" (perhaps by a different name). It consists of a set ##\Omega## whose elements are called "outcomes" and a "probability measure" ##P## defined on a collection ##\Sigma## of subsets of ##\Omega##. The subsets are called "events". In an advanced course, there is the requirement that the collection of events is a "sigma algebra" of sets.
The laws of probability first taught, such as ##P(A \cup B) = P(A) + P(B) - P(A \cap B)## refer to a single probability space and single probability measure ##P##.
When students are told about conditional probabilities like ##P(A|B)## they often don't understand that conditional probability involves
two different probability spaces.
On can attempt to explain the distinction between ##P(A|B)## and ##P(A \cap B)## by focusing on the meaning of the English words "given" and "and". This type of explanation doesn't make it clear that the notations "##P(A|B)##" and "##P(A \cap B)##" refer to different probability measures applied to the same set of outcomes. In fact, students are liable to think that "##A|B##" and "##A \cap B##" denote different events ( i.e. different subsets of outcomes) and that the same probability measure ##P## is applied to them.
The correct way to look at conditional probability is that "##P(...|B)##" refers to a probability measure
different than ##P##. From the viewpoint of pure math, it would clearer to use notation like:
Definition of conditional probability: Given a probability space ##(\Omega, \Sigma,P) ##The conditional probability ##P(A|B)## is defined to be the probability of the event ##A\cap B## in the probability space ##(\Omega, \Sigma, Q)## where the probability measure ##Q## is given by ##Q(X) = P(X \cap B)/ P(B)##
Using the "##P(A|B)##" type notation, you should think of the "##P(...|B)##" part of it as notation for a probability measure ##Q## that is different than ##P##. Think of it that way instead of thinking of "##A|B##" and ##A \cap B## is different events that are assigned values by the same probability measure ##P##.