Discussion Overview
The discussion revolves around the computation of the probability ##\mathbb{P}(p_1 \geq p_2)## for two series of independent Bernoulli experiments with unknown success probabilities ##p_1## and ##p_2##. Participants explore various statistical approaches and assumptions related to this problem.
Discussion Character
- Exploratory
- Technical explanation
- Debate/contested
Main Points Raised
- One participant questions whether the problem is equivalent to the two distributions being equal, suggesting that if the ratios are equal, then the expected value of the difference would be zero.
- Another participant proposes a two-sample test with a null hypothesis of ##H_0: p_1 = p_2## and an alternative hypothesis of ##H_a: p_1 > p_2##, suggesting a one-tailed T test using pooled variance.
- A participant asserts that computing ##\mathbb{P}(p_1 \geq p_2)## is not possible without prior assumptions about the probabilities, indicating that the assumption of ##p_1 \geq p_2## is not specific enough for computation.
- In contrast, another participant explains that Bayes' theorem can be used to compute ##\mathbb{P}(p_1 \geq p_2)##, emphasizing the need for a joint prior probability distribution and detailing the process of deriving the posterior distribution based on observed successes.
- This latter participant notes that the result will depend on the chosen prior distribution and provides a mathematical formulation for the likelihood of the observed data.
Areas of Agreement / Disagreement
Participants express differing views on the feasibility of computing ##\mathbb{P}(p_1 \geq p_2)## without prior assumptions, with some advocating for Bayesian approaches while others highlight the limitations of the assumptions involved. No consensus is reached regarding the best method to approach the problem.
Contextual Notes
The discussion highlights the dependence on prior distributions in Bayesian statistics and the implications of different statistical tests. There are unresolved questions about the equivalence of the distributions and the assumptions necessary for computation.