What motivates Bayes' Theorem?

  • Context: High School 
  • Thread starter Thread starter Agent Smith
  • Start date Start date
  • Tags Tags
    Revolution Theorem
Click For Summary

Discussion Overview

The discussion revolves around the motivations and implications of Bayes' Theorem, exploring its role in probability theory, its revolutionary aspects, and its applications in various contexts, including Bayesian statistics and frequentist interpretations. Participants examine the theorem's ability to reverse conditional probabilities and its significance in scientific reasoning.

Discussion Character

  • Exploratory
  • Debate/contested
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • Some participants describe Bayes' theorem as a means to update probabilities based on new evidence, suggesting it enhances probability estimation.
  • Others differentiate between Bayes' Theorem and Bayesian statistics, noting that the former is a fundamental component of probability theory while the latter extends its application.
  • A participant mentions that Bayes' Theorem allows for the calculation of reverse conditional probabilities, which is significant in scientific contexts.
  • Some argue that Bayesian methods can yield probabilities from limited or no evidence, contrasting with frequentist approaches that require more substantial data.
  • Concerns are raised about the meaningfulness of probabilities derived from Bayesian interpretations, particularly in contexts like cosmology.
  • Participants discuss the implications of the prosecutor's fallacy in relation to Bayes' Theorem, emphasizing the importance of distinguishing between probabilities and likelihoods.
  • There is a mention of the historical context of Bayes' Theorem, including its association with figures like Bayes and Laplace, and its purported revolutionary impact on probability theory.
  • Some express uncertainty about the preferred forms of Bayes' Theorem and the derivation process, indicating a desire for clarity in understanding its applications.
  • One participant questions the implications of having a zero probability in the denominator of Bayes' Theorem, referencing a caveat noted in external sources.

Areas of Agreement / Disagreement

Participants express a mix of agreement and disagreement, particularly regarding the interpretations of Bayesian versus frequentist approaches. There is no consensus on the meaningfulness of Bayesian probabilities in all contexts, and discussions about the implications of the theorem remain unresolved.

Contextual Notes

Some participants highlight limitations in understanding the derivation of Bayes' Theorem and its applications, indicating that certain assumptions and definitions may not be universally accepted.

  • #31
Agent Smith said:
A frequentist takes a sample (the least). A Bayesian ___? 🤔
Sometimes there is no sample. So a Bayesian can still choose a prior that represents their uncertainty.
 
  • Like
Likes   Reactions: Agent Smith
Physics news on Phys.org
  • #32
PeroK said:
There is only one universe, so the sample size is 1.
PeroK said:
The universe is either flat or it isn't. Probability doesn't apply.
This isn’t correct. Long term frequency doesn’t apply, but frequency isn’t probability.

What defines probability is the Kolomgorov axioms. Anything that fulfills those axioms is probability. Frequency is one example, and uncertainty is another.

There is no population of universes to draw a large sample from and form frequencies, but there is uncertainty about the universe’s flatness. So probability does apply.
 
  • Like
Likes   Reactions: Agent Smith
  • #33
Dale said:
This isn’t correct. Long term frequency doesn’t apply, but frequency isn’t probability.

What defines probability is the Kolomgorov axioms. Anything that fulfills those axioms is probability. Frequency is one example, and uncertainty is another.

There is no population of universes to draw a large sample from and form frequencies, but there is uncertainty about the universe’s flatness. So probability does apply.
The Kolmogorov axioms are a pure mathematical construction. Whether you can apply them to a given physical scenario is the question. The universe cannot, by itself, satisfy mathematical axioms. The universe is not a measure space.

If you conclude, for example, that the probability that the universe is flat is 90%, then (as a frequentist) I don't know what to make of that. It's a number and it may have some meaning, but I cannot relate it to what I understand as a probability.

Uncertainty, by itself, does not imply probabilities. One example is the Two-Envelope Problem:

https://en.wikipedia.org/wiki/Two_envelopes_problem

If you assume that there must be probabilities, then you end up with contradictory calculations and inconsistencies. Until, you realise that there must be a distribution. Without specifying the distribution, the numbers you calculate are not probabilities. To have probabilities relating to the universe, you must specify the distribution which applies to the universe(s). Without it, the numbers you calculate are not probabilities.
 
  • #34
PeroK said:
The Kolmogorov axioms are a pure mathematical construction.
Yes. But they are the mathematical construction that defines probability.

PeroK said:
The universe cannot, by itself, satisfy mathematical axioms
Nobody claimed or even implied otherwise.

PeroK said:
If you conclude, for example, that the probability that the universe is flat is 90%, then (as a frequentist) I don't know what to make of that. It's a number and it may have some meaning, but I cannot relate it to what I understand as a probability.
Which is precisely why I mention the axioms. A lot of frequentists make the same mistake, but it is a mistake.

A lot of frequentists mistakenly believe that probability is defined by long-run frequencies, but that is not correct. Probability is defined by the axioms. Long run frequencies are an example of probability because long run frequencies satisfy the axioms.

This is similar to vectors. Many students are introduced to vectors as little arrows with a magnitude and direction. Later, they are surprised to learn that polynomials are also vectors. We don’t think of polynomials as arrows, but they satisfy the axioms of vectors. Little arrows are the easiest example of vectors, but not the only example. Similarly frequencies are the easiest example of probability, but not the only example.

So a statement that the probability that the universe is flat is 90% cannot reasonably be understood as a frequency, but it certainly can be understood as an uncertainty. And since uncertainties are also probabilities, it is a valid statement.
 
  • #35
Dale said:
Yes. But they are the mathematical construction that defines probability.

Nobody claimed or even implied otherwise.

Which is precisely why I mention the axioms. A lot of frequentists make the same mistake, but it is a mistake.

A lot of frequentists mistakenly believe that probability is defined by long-run frequencies, but that is not correct. Probability is defined by the axioms. Long run frequencies are an example of probability because long run frequencies satisfy the axioms.

This is similar to vectors. Many students are introduced to vectors as little arrows with a magnitude and direction. Later, they are surprised to learn that polynomials are also vectors. We don’t think of polynomials as arrows, but they satisfy the axioms of vectors. Little arrows are the easiest example of vectors, but not the only example. Similarly frequencies are the easiest example of probability, but not the only example.

So a statement that the probability that the universe is flat is 90% cannot reasonably be understood as a frequency, but it certainly can be understood as an uncertainty. And since uncertainties are also probabilities, it is a valid statement.
That, as I understand it, is the Bayesian interpretation. That the frequentist interpretation is a mistake is pushing your luck. At the very least, there is a Bayesian prior the the frequentist interpretation is not a mistake with nonzero probability!
 
  • #36
PeroK said:
That the frequentist interpretation is a mistake is pushing your luck
That is not what I said was a mistake. What is a mistake is the belief that the frequentist interpretation defines probability. Long run frequencies and uncertainties are both valid examples of probabilities. But what defines probability is the axioms.
 
  • #37
Dale said:
Sometimes there is no sample. So a Bayesian can still choose a prior that represents their uncertainty.
Choose as opposed to compute? This would matter I feel.
 
  • #38
Dale said:
Which is precisely why I mention the axioms. A lot of frequentists make the same mistake, but it is a mistake.
So frequentism (I hope that's the correct term) is just one version of probability.
 
  • #39
Agent Smith said:
Choose as opposed to compute? This would matter I feel.
Yes. Choose. With no data how would you compute it? This is one of the best things about Bayesian statistics, and it is also one of the things that most bothers people learning about it. It allows you to include nebulous things like expert opinion that may not be something that you can compute.

I dive into this topic a bit in one of my insights articles, but I don’t remember which one

Agent Smith said:
So frequentism (I hope that's the correct term) is just one version of probability.
Yes. Just like an arrow is one version of a vector.
 
  • Like
Likes   Reactions: Agent Smith
  • #40
@Dale I'm a bit lost here, Bayes' theorem is ##\text{P(Hypothesis|Evidence)}##. Isn't the evidence a measurement with/without computation, a/the data? However I can see how the prior probability can be chosen (it's just a guess???). The posterior probability would be the result of a computation, using Bayes' Theorem.
 
  • #41
Also @Dale , can we reduce Bayesian uncertainty to a frequentism, give it a frequentist interpretation? So if I'm 90% certain the earth is flat, it would "mean" 9 out 10 scenarios I'm right and 1 out 10 I'm wrong.
 
  • #42
Agent Smith said:
@Dale I'm a bit lost here, Bayes' theorem is ##\text{P(Hypothesis|Evidence)}##. Isn't the evidence a measurement with/without computation, a/the data? However I can see how the prior probability can be chosen (it's just a guess???). The posterior probability would be the result of a computation, using Bayes' Theorem.
So in this context Bayes theorem is $$P(Hypothesis|Evidence)=\frac{P(Evidence|Hypothesis) \ P(Hypothesis)}{P(Evidence)}$$ What is chosen is ##P(Hypothesis)##. This is called the prior. It represents the uncertainty before looking at the evidence. It can be based on any prior studies and any prior data, but if there really is not any evidence then it can be based on things like expert opinion or rough estimation. Whatever your knowledge is, from whatever source you have, before looking at your new data, the prior is chosen to reflect that knowledge.

Agent Smith said:
Also @Dale , can we reduce Bayesian uncertainty to a frequentism, give it a frequentist interpretation? So if I'm 90% certain the earth is flat, it would "mean" 9 out 10 scenarios I'm right and 1 out 10 I'm wrong.
In situations where frequentist probabilities exist, they will match Bayesian probabilities. There reverse is not necessarily true. There are situations that there is no reasonable definition of frequentist probability that the Bayesian probability models just fine.

Also, Bayesian statistical tests are usually equivalent to frequentist statistical tests when the Bayesian test is performed using an uninformative prior.
 
  • Like
Likes   Reactions: Agent Smith
  • #43
@Dale muchas gracias.
 
  • Like
Likes   Reactions: Dale
  • #44
PeroK said:
The universe is either flat or it isn't. Probability doesn't apply.
@Dale

Not intending to start a fight, but in what way does Bayesian thinking (statistics/probability) inform Quantum Physics? Last I checked, from the very layman's discussions I've had, for a particle it isn't the case that it is "either flat or it isn't". Some say it's both here and there (position-wise). The (Bayesian) uncertainty is a an actual feature of the world of particles I believe.
 
  • #45
Agent Smith said:
in what way does Bayesian thinking (statistics/probability) inform Quantum Physics?
I don’t know. I am not a big quantum mechanics guy. My physics knowledge runs to classical physics, relativity, and biomedical engineering.

I do know that there is a quantum Bayesian interpretation called qbism, but I don’t know the details. I tend to be interpretation-agnostic in many things, so I probably would not be a particularly strong adherent of it.
 
  • #46
@Dale , but per the axioms of probability (gracias for that), the sum of the probabilities of a particle's position must add up to ##1##. Doesn't that mean they're mutually exclusive? Perhaps you can answer from a mathematician's point of view.
 
  • #47
Agent Smith said:
@Dale , but per the axioms of probability (gracias for that), the sum of the probabilities of a particle's position must add up to ##1##.
Yes. This means that the probability that the particle is somewhere is 1.

Agent Smith said:
Doesn't that mean they're mutually exclusive? Perhaps you can answer from a mathematician's point of view.
Again, I am not a big QM guy, but what are you referring to by mutually exclusive?
 
  • Like
Likes   Reactions: Agent Smith
  • #48
Dale said:
Yes. This means that the probability that the particle is somewhere is 1.

Again, I am not a big QM guy, but what are you referring to by mutually exclusive?
It's ok, I'll try and work it out on my own. Gracias,
 

Similar threads

  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 19 ·
Replies
19
Views
2K
  • · Replies 7 ·
Replies
7
Views
933
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 47 ·
2
Replies
47
Views
5K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 23 ·
Replies
23
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K