You test positive: what are the chances that you actually have the disease?

VarlosZ · September 17, 2006, 2:10am

This is a poll of your intuitive response to the given scenario. Don’t do any calculations in your head, and don’t read the thread before deciding on an answer – just say what you’d guess the answer to be if you only had a few seconds to think. If you’ve come across similar questions in the past and are thus able to give a more accurate guess, please mention that.
You go to the hospital and are randomly screened for a serious but rare disease with which one person in a thousand is infected. The test for the disease has a false positive rate of 5% and false negative rate of 1%. That is, if you’ve got the disease, the test will come out ‘positive’ 99% of the time, while if you don’t have the disease, the test will come out ‘negative’ 95% of the time.

Your test comes back positive – what is the likelihood that you actually have the disease?

Dunderman · September 17, 2006, 2:12am

Just quick, no calculations: one in fifty.

Frank · September 17, 2006, 2:13am

97%

Do I win the audition?

susan · September 17, 2006, 2:15am

If I had any of the risk factors for exposure, I’d assume I had it.

I know this because even when I had minimal HIV risk exposures, I was still plenty nervous waiting for the test results.

ultrafilter · September 17, 2006, 2:22am

Probably somewhere around 25%, just based on the answers I’ve gotten for this problem before. Those usually use a 1% disease rate, though, so this might be lower.

I cheated and did the calculations. For those who are curious, the probability is in the spoiler box below.

The probability is roughly .02. This is a standard homework problem from an introductory probability class involving something known as Bayes’ theorem.

Frank · September 17, 2006, 2:31am

Obviously, I need to take a statistics class.

jackelope · September 17, 2006, 2:47am

I’d read this in one of John Allen Paulos’s books; I think it was A Mathematician Reads the Newspaper. I remembered the results were surprising, so I got out the calculator:

[spoiler]Take a random group of 1,000,000 people. 1,000 of them are infected with the disease.

Of those 1,000, 99%, or 990, will get a positive result.

Of the 999,000 who are NOT infected, 5%, or 49,950, will get a false positive result.

So 50,940 people will get positive results. Of those, 49,950 will NOT have the disease.

If you get a positive result, the odds are thus 98% that you do NOT have the disease. Weird.

Unless my math is wrong, in which case my only excuse is having been an English major.[/spoiler]

susan · September 17, 2006, 3:09am

Ah. I gave my “intuitive response” rather than attempting to think through the statistical probability. However, this highlights one of the problems in using statistics in relation to a single individual: I’m just a speck in the datapool from an objective perspective, but when you ask me what I intuit *my own *position in a data set, I’ve got all sorts of information that makes me non-random in terms of my own assessment of my reisk (which in this sample you’d call “error variance”), such as a knowledge of my risk factors for exposure to or development of the disease. I may have a statistically low chance of having the disease in your sample, but that chance is a function of my place in a random sample, and would differ if your pool weren’t random (e.g., if you screened out everyone who hadn’t eaten disease-bearing candy within the week, and only tested those of us who had).

On top of that, I imagine that our ability to intuit our own odds depends in part on our relative pessimism, versus our ability to intuit the general odds for a random person unknown to us.

Garfield226 · September 17, 2006, 3:18am

This is very similar to a question given as homework in my Economics and Business Statistics class (we’re on the ‘Probability’ chapter). The numbers are different, but the question is identical.

The instructor did the calculations for your question as an example, and then (using the same numbers) posed two more to us for homework: What proportion of all tests are positive? and If the test result is negative, what is the probability you do not have the disease?

Putting it in probability terms …

P(D) represents the probability of having the disease. P(D’) represents the probability of not having the disease.
P(T) represents the probability of a positive test. P(T’) represents the probability of a negative test.

You’re asking for P(D|T), or the probability of having the disease, given a positive test result. My teacher’s additional questions are asking for 1. P(T|D) + P(T|D’), or the probability of a positive test given the person has the disease plus the probability of a positive test given the person does not have the disease, and 2. P(D’|T’), or the probability of not having the disease given a negative test result.

I think.

VarlosZ · September 17, 2006, 4:24pm

I just copied the problem from Wikipedia’s article on Bayes’ Theorem. I wouldn’t be able to use Bayes’ Theorem off the top of my head, but I ran across a problem like this in college and thought it was interesting. And, of course, most people will get the question very wrong, giving an answer in the 90s.
Shoshana: And that, presumably, is why they don’t do random screening for very rare diseases: the flood of false positives would obscure the true positives.

jackelope: Heh, nice common sense solution. I don’t think it would have occurred to me to solve the problem like that.

Eureka · September 17, 2006, 4:58pm

I think I’ve heard a variation of this problem as a Car Talk Puzzler (or something like that) and so suspected that the answer was that it is a lot less likely that you have the disease than it seems like it should be. I mean, positive test result means you have the disease, doesn’t it?

cerberus · September 17, 2006, 8:51pm

There are four possible combinations of Disease and Test:

DandT ~ Disease Present and Screen Shows Positive
DandT* ~ Disease Present and Screen Shows Negative

DandT ~ Disease Absent and Screen Shows Positive
DandT* ~ Disease Absent and Screen Shows Negative

Pr{D|T} ~ Probability that a subject has the disease, given a positive test
Pr{D*|T} ~ Probability that a subject lacks the disease, given a positive test

Pr{D*|T*} ~ Probability that a subject lacks the disease, given a negative test
Pr{D|T*} ~ Probability that a subject has the disease, given a negative test

What we have is that

So, Pr{T|D*} = “False Positive” = .05, so then Pr{T*|D*} = .95
And Pr{T*|D} = “False Negative” = .01, so then Pr{T|D} = .99

What we seek is Pr{D|T}, the probability that disease is present given a positive screen.

Pr{D|T} =
Pr{DandT}/Pr{T} =
Pr{T|D}*Pr{D}/Pr{T} =
Pr{T|D}Pr{D}/(Pr{TandD} + Pr{TandD}) =
Pr{T|D}*Pr{D}/(Pr{T|D}Pr{D} + Pr{T|D}Pr{D})

Plugging in what we know:

Pr{D|T} =
(.99)Pr{D}/(.99Pr{D} + .05Pr{D}) =
(.99)Pr{D}/(.99Pr{D} + .05*(1-Pr{D})

In general, the answer depends on the true prevalence of disease, Pr{D}, which we have in this case as Pr{D} = 1/1000 = .001.

So then

Pr{D|T} =
(.99)(.001)/(.99(.001) + .05*(.999)) = .00099/(.00099+.04995), which is something like 1.9% …

Or not…

Long_Time_First_Time · September 17, 2006, 11:25pm

This is a very nice explanation as to why screening tests can be controversial, even though everyone agees that catching the disease you are screening against is a good thing.

For a screening test to be of much value, 99% sensitivity and specificity is pretty mucha given.

cerberus · September 18, 2006, 2:40am

Here’s the other side of interest.

What is Pr{D*|T*}, the probability that disease is absent given a negative screen?

Pr{D*|T*} =
Pr{DandT}/Pr{T*} =
Pr{T*|D*}Pr{D}/Pr{T*} =
Pr{T*|D*}Pr{D}/(Pr{TandD} + Pr{TandD)) =
Pr{T|D*}Pr{D}/(Pr{T*|D*}Pr{D} + Pr{T*|D}*Pr{D})

That is,

Pr{D*|T*} = Pr{T*|D*}Pr{D}/(Pr{T*|D*}Pr{D} + Pr{T*|D}*Pr{D})

We need the prevalence of disease (Pr{D}), Pr{T*|D*} = 1 - Pr{T|D*}, where Pr{T|D*} is the false-positive rate, and Pr{T*|D}, the false negative rate. Recall that Pr{D*) = 1 - Pr{D}.

Suppose that we are given that Pr{T|D*} = “False Positive” = .05, Pr{T*|D} = “False Negative” = .01 and Pr{D} = .001. Then Pr{T|D*} = “False Positive” = .05, so then Pr{T*|D*} = .95 and Pr{T*|D} = “False Negative” = .01, so then Pr{T|D} = .99

Plugging in what we know:

Pr{D*|T*} = (.95)(.999)/( (.95)(.999) + (.01)*(.001)) = .9999+. So in this case, approximately 99.99% of the negative tests actually indicate absence of disease.

DoctorJ · September 18, 2006, 3:24am

What they’re asking for here is called positive predictive value. While sensitivity and specificity (the chances of someone with and without the disease to have a positive or negative test, respectively) are inherent characteristics of a test, PPV and NPV depend on the prevalence of the disease. So an HIV test has the same sensitivity and specificity wherever you are, but its PPV and NPV change depending on the population.

Not really. A screening test needs to be positive in everyone with the disease, so ideally sensitivity is up around 100%. It would be nice if it were also 100% specific, but it doesn’t really have to be–that’s what the confirmatory tests are for.

For instance, lots of women without cervical cancer have abnormal Pap smears, and lots of women without breast cancer have abnormal mammograms. (In fact, neither of those is close to 99% sensitivity, either, but you get the point.)

Nava · September 18, 2006, 7:37am

Depends on how the test works, not just its probabilities. Many quick tests look for antibodies (which show that you’ve been in contact with the vector) and not for the vector that produces the antibodies.

40% of my 8th grade classmates tested positive for tuberculine; none had it, but the actual tuberculosis test isn’t the tuberculine… it’s the chest XRays.

Cicero · September 18, 2006, 9:59am

Hell, it took me a while to work out Jackalope. Cerberus really is a dog from Hades for trying to get me to work that out.

MysteryFellow63427 · September 22, 2006, 3:45pm

My intuitive response is 99%, which I of course know to be wrong.

davenportavenger · September 22, 2006, 3:48pm

Despite the fact that I research this kind of stuff for a living, I would for sure be freaking out and convinced I had it. That’s just my personality, which I know is not grounded in any actual reality.

cerberus · September 23, 2006, 3:36am

Pr{D} Pr{T|D*} Pr{T*|D} Pr{T|D} Pr{T*|D*} Pr{D|T} Pr{D*|T*}
.0000001 0.050 0.010 0.950 0.990 .0000019 1.000000
.0000010 0.050 0.010 0.950 0.990 .0000190 .9999999
.0000100 0.050 0.010 0.950 0.990 .0001900 .9999995
.0001000 0.050 0.010 0.950 0.990 .0018966 .9999949
.0010000 0.050 0.010 0.950 0.990 .0186640 .9999494
.0100000 0.050 0.010 0.950 0.990 .1610169 .9994901
.0000001 0.010 0.010 0.990 0.990 .0000099 1.000000
.0000010 0.010 0.010 0.990 0.990 .0000990 1.000000
.0000100 0.010 0.010 0.990 0.990 .0009890 .9999999
.0001000 0.010 0.010 0.990 0.990 .0098039 .9999990
.0010000 0.010 0.010 0.990 0.990 .0901639 .9999899
.0100000 0.010 0.010 0.990 0.990 .5000000 .9998980
.0000001 0.005 0.005 0.995 0.995 .0000199 1.000000
.0000010 0.005 0.005 0.995 0.995 .0001990 1.000000
.0000100 0.005 0.005 0.995 0.995 .0019861 .9999999
.0001000 0.005 0.005 0.995 0.995 .0195136 .9999995
.0010000 0.005 0.005 0.995 0.995 .1661102 .9999950
.0100000 0.005 0.005 0.995 0.995 .6677852 .9999492
.0000001 0.001 0.001 0.999 0.999 .0000999 1.000000
.0000010 0.001 0.001 0.999 0.999 .0009980 1.000000
.0000100 0.001 0.001 0.999 0.999 .0098913 1.000000
.0001000 0.001 0.001 0.999 0.999 .0908347 .9999999
.0010000 0.001 0.001 0.999 0.999 .5000000 .9999990
.0100000 0.001 0.001 0.999 0.999 .9098361 .9999899

Topic		Replies	Views
You selfish fuck! (Man with possible HIV cure in his veins refuses to undergo tests) The BBQ Pit	67	5173	November 25, 2005
Is the logic behind this HIV myth actually correct? Factual Questions	11	2863	December 24, 2012
Why would my own serological test be affected by total percentage of infected? The Quarantine Zone covid-quarantine	14	840	April 15, 2020
medical test result - can we find a better answer than M. vos Savant? Factual Questions	24	2630	February 10, 2010
What does 13% false positives on a medical test mean (not Covid-19 related)? Factual Questions	14	1045	February 11, 2021

You test positive: what are the chances that you actually have the disease?

Related topics