False Positive Rate of 1:1.5M Sampling Process

karamand · Aug 17, 2014

I have a sampling process of a very large population in which all items are of type A or type B. I have an analysis of the sampled objects which classifies type A and gives the wrong identification (a false positive) 1 in 1.5 million times.
I take a sample of 1 million and find 1 'hit' i.e classified as type A. What is the probability that it is a false positive?

I like Serena · Aug 18, 2014

philpq said:

I have a sampling process of a very large population in which all items are of type A or type B. I have an analysis of the sampled objects which classifies type A and gives the wrong identification (a false positive) 1 in 1.5 million times.
I take a sample of 1 million and find 1 'hit' i.e classified as type A. What is the probability that it is a false positive?

Hi philpq! Welcome to MHB! :)

Without more information, any 'hit' of type A has a probability of $\frac{1}{1.5\cdot 10^6} \approx 6.7 \cdot 10^{-5}$ of being a false positive.
We will still know basically nothing about the other 999999 observations without more information.

karamand · Aug 18, 2014

I like Serena said:

Hi philpq! Welcome to MHB! :)

Without more information, any 'hit' of type A has a probability of $\frac{1}{1.5\cdot 10^6} \approx 6.7 \cdot 10^{-5}$ of being a false positive.
We will still know basically nothing about the other 999999 observations without more information.

Thanks for your help. I suppose the answer is obvious when I think about it. The sample size is irrelevant. The probability of anyone 'hit' being a false positive is 1 in 1.5 million as stated :)

Jameson · Aug 18, 2014

Usually questions about false positives use Bayes' Theorem and for that you need a lot more information.

$$P(+|\text{ (actually negative)})=\frac{P(\text{(actually negative)}|+) \cdot P(+)}{P(\text{actually negative})}$$

In the above, $+$ means "reads positive". However, you already have this probability so the above isn't necessary to calculate. I'm just pointing out that these topics are very often related. Here is an example "false positive" question you can read on Wikipedia.

False Positive Rate of 1:1.5M Sampling Process

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad A variant of the Monty Hall problem

Undergrad My basic understanding of set theory

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

High School Onto set mapping is the surjective set mapping, and into injective?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers