Combining probabilities / results

  • Context: High School 
  • Thread starter Thread starter pinball1970
  • Start date Start date
  • Tags Tags
    Probabilities
Click For Summary

Discussion Overview

The discussion revolves around the combination of probabilities related to COVID-19 testing, specifically focusing on the interpretation of test results and the application of probability theory to assess the likelihood of having the virus based on test outcomes. Participants explore concepts such as test sensitivity, specificity, and the prevalence of the disease in the population.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant presents a scenario involving a lateral flow test with 75% accuracy and a 20% prevalence of COVID-19 in the UK, questioning the interpretation of a negative result and how to combine results from multiple tests.
  • Another participant outlines the factors affecting test results, including disease prevalence, test sensitivity, and specificity, and introduces a probability tree to analyze the situation.
  • Subsequent posts elaborate on the probability tree, detailing the calculations for the likelihood of having the virus after one or two negative tests, while noting the assumption of independence between test results.
  • Some participants express skepticism about the independence of tests, arguing that a negative result does not necessarily provide conclusive evidence of not having the virus, as it may not detect antigens present in the body.
  • There is a discussion about the construction of the probability tree and its advantages over algebraic methods, with one participant seeking clarification on how to build the tree correctly.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the implications of negative test results or the independence of tests. While some agree on the mathematical approach to combining probabilities, others challenge the assumptions made regarding test independence and the interpretation of results.

Contextual Notes

Limitations include assumptions about test independence, the accuracy of the test, and the specific conditions under which tests are administered. The discussion also highlights the complexity of interpreting test results in the context of varying sensitivity and specificity.

pinball1970
Gold Member
2025 Award
Messages
3,697
Reaction score
5,863
TL;DR
Six lateral flow tests and counting all negative. I started thinking if I took one test right after another how do I combine those probabilities correctly?
I had a LFT test this week which gives a 75% accuracy result.Negative.I also wanted to stick in some prior knowledge if I can, 20% of the UK currently has Covid say.Vanadium 50 illustrated this to me a while back so I hope I don’t mess it up
20False pos
No Covid in UK80%60Neg
Covid in UK20%5False Neg
15Pos
So if I am negative I am either positive with 5% probability or negative with 60% probability is this right?

If I had another test which is also negative how much confidence can I have now?

I am not sure how to combine the results, the “and” “or” x and add rule does not work I end up with probabilities over one
 
Physics news on Phys.org
There are three factors: the prevalence of the disease in the population (##d##); the sensitivity (##p##) of the test (which is the probability of a positive test given you have the virus); and, the specificity (##q##) of the test (which is the probability of a negative test given you do not have the virus).

The simplest way to analyse this is to use a probability tree:

##dp## - Virus and positive test
##d(1-p)## - Virus and negative test (false negative)

##(1-d)q## - No virus and negative test
##(1-d)(1-q)## - No virus and positive test (false positive)

If you test negative, then the probability you do not have the virus is:$$P_0 = \frac{(1-d)q}{(1-d)q + d(1-p)}$$
I had to post this to get the Latex to render. Notes about second test to follow.
 
  • Like
Likes   Reactions: sysprog and pinball1970
Doing a second test (if the first is negative) has the following effect:

##d(1-p)p## - Virus, negative first test, positive second test.
##d(1-p)(1-p)## - Virus and two negative tests

##(1-d)q^2## - No virus and two negative tests
##(1-d)q(1-q)## - No virus, negative first test and positive second test

If you have two negative tests, then the probability you do not have the virus is:$$P_1 = \frac{(1-d)q^2}{(1-d)q^2 + d(1-p)^2}$$
Note that assuming that the test is sufficently sensitive, then ##(1-p)^2## is very small compared to ##q^2## and ##P_1 \approx 1##.

(This assumes that the test results are independent and there is nothing specific about an individual that would imply repeated false negatives.)
 
  • Informative
  • Like
Likes   Reactions: sysprog and pinball1970
PeroK said:
Doing a second test (if the first is negative) has the following effect:

##d(1-p)p## - Virus, negative first test, positive second test.
##d(1-p)(1-p)## - Virus and two negative tests

##(1-d)q^2## - No virus and two negative tests
##(1-d)q(1-q)## - No virus, negative first test and positive second test

If you have two negative tests, then the probability you do not have the virus is:$$P_1 = \frac{(1-d)q^2}{(1-d)q^2 + d(1-p)^2}$$
Note that assuming that the test is sufficently sensitive, then ##(1-p)^2## is very small compared to ##q^2## and ##P_1 \approx 1##.

(This assumes that the test results are independent and there is nothing specific about an individual that would imply repeated false negatives.)
I will try this and feedback thanks
 
  • Like
Likes   Reactions: sysprog
PeroK said:
(This assumes that the test results are independent and there is nothing specific about an individual that would imply repeated false negatives.)
And therein lies the problem.

Assuming you are swabbing correctly then a negative test simply shows that the test has not detected antigens in your saliva/nasal mucus. It is perfectly possible that you have COVID-19 but do not currently have antigens in your saliva/nasal mucus*; you gain no information about this situation by doing another lateral flow test.
 
Last edited:
pbuk said:
And therein lies the problem.

Assuming you are swabbing correctly then a negative test simply shows that the test has not detected antigens in your saliva/nasal mucus. It is perfectly possible that you have COVID-19 but do not currently have antigens in your saliva/nasal mucus; you gain no information about this situation by doing another lateral flow test.
In general, the independence of tests on the same subject is another factor. That's a clinical question!
 
pbuk said:
And therein lies the problem.

Assuming you are swabbing correctly then a negative test simply shows that the test has not detected antigens in your saliva/nasal mucus. It is perfectly possible that you have COVID-19 but do not currently have antigens in your saliva/nasal mucus*; you gain no information about this situation by doing another lateral flow test.
I put this in maths rather medical as I am genuinely interested in probability side of it.
All the numbers I have got so far make sense, If I increase/decrease the sensitivity for instance.

I am assuming “all else being equal” for the medical side.

I am not 100% on how the tree is built.
 
pinball1970 said:
I am not 100% on how the tree is built.
For a simple case like this, the tree has an initial branch into ##d## (has the virus) and ##1-d## (does not have the virus). Then, each of these branches according to negative and positive tests.

You then look at the end of each branch and decide whether it's in your sample space - in this case a negative test - or not. Then you pick out the cases you are interested in (does not have the disease) from this reduced sample space This gives you the probability of not having the disease given a negative test.

This is a good alternative to using the algebraic version of Bayes' theorem, as the theorem is emedded in the tree structure.

In the case where you have two negative tests, you simply extend the tree by branching each of the first negative tests again.

One of the great advantages is that if we complicate the situation with more options and more levels, then using the raw Bayes' theorem becomes algebraically unworkable compared to the tree representation.
 
  • Like
  • Informative
Likes   Reactions: sysprog and pinball1970

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 27 ·
Replies
27
Views
3K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 9 ·
Replies
9
Views
6K
  • · Replies 41 ·
2
Replies
41
Views
6K