Why not alter the specificity?
Revised: 2001-10-28
Question: How is positive etiologic predictive value different from altering the specificity to predict post-test probability for disease? Could it be that altering the specificity makes Etiologic Predictive Value unnecessary?
Consider the following example: A conventional throat culture has been obtained during a summer period from 36 children of age 3-15 years having a sore throat possibly caused by group A beta-hemolytic streptococci (GABHS)1. Among those 36 cultures a throat culture indicated presence of the bacterium GABHS in 11 (31%).
However, some of the children might be ill due to a virus as well as carrying GABHS. To investigate this phenomenon we may collect data from healthy children. During the same period of time throat cultures were obtained from 290 healthy children of age 3-15 living in the same geographical area, and showing no signs indicating possible GABHS-caused tonsillopharyngitis1. Among those 290 cultures a throat culture indicated presence of GABHS in 37 (13%).
How well does the outcome of the throat culture predict whether the sore throat is caused by the bacterium GABHS? Let us compare three different approaches to this question:
If the sensitivity of throat culture to find presence of GABHS in the throat is assumed to be 90%2, 3, 4, 5, 6, 7 and the specificity to be 97%5, 6 then the predictive values in respect of presence of GABHS in the throat are estimated so (see also Appendix 1):
Table I - Calculating predictive
values - Marker |
|||
Gold standard |
|||
GABHS |
No GABHS |
||
Positive test (T+) |
10.26 |
0.74 |
11 |
Negative test (T-) |
1.14 |
23.86 |
25 |
|
11.40 |
24.60 |
36 |
PPV = 10.26/11 = 93.3% |
NPV = 23.86/25 = 95.4% |
||
(M denotes a Marker. In this case the marker is GABHS. One must also remember that the purpose of these calculations is to elucidate differences between different statistical approaches. In reality it would of course be difficult to have 10.26 individuals. If we round the numbers to avoid parts of individuals we get PPV 90.9% and NPV 96.0%). |
There are two alternatives to alter specificity to compensate for presence of carriers (we will later discuss if it is appropriate to do this).
a | One
alternative is to state that the specificity to predict absence of disease
is the same as the specificity to predict absence of the etiologic marker
(for example the bacteria) minus the probability for a healthy person to
have a positive test (and thus being a carrier). This approach can be
described as:
|
b | The other
alternative is to state that the specificity to predict absence of disease
is the same as 1 minus the probability for a healthy person to have a
positive test (and thus being a carrier). This approach can be described as:
|
Compensating for
carriers - lowering the specificity
In our example 30.6%
of the children with a sore throat had growth of GABHS. At the same time 12.8%
of healthy children had growth of GABHS. Let us compensate for carriers by
lowering the specificity (97%) with 12.8% (We will later discuss if it is
appropriate to do this).
The predictive
values in respect of presence of GABHS in the throat are estimated so
(see also Appendix
2):
Table II - Calculating predictive
values - Disease |
|||
Gold standard |
|||
Disease |
No Disease |
||
Positive test (T+) |
6.44 |
4.56 |
11 |
Negative test (T-) |
0.72 |
24.28 |
25 |
|
7.16 |
28.84 |
36 |
PPV = 6.44/11 = 58.5% |
NPV = 24.28/25 = 97.1% |
||
(M denotes a Marker. In this case the marker is GABHS.) |
Compensating for
carriers - specificity is 1 minus carriers
In our example 30.6%
of the children with a sore throat had growth of GABHS. At the same time 12.8%
of healthy children had growth of GABHS. Let us, as some authors do8, compensate for carriers by
letting the specificity to find absence of disease be 100% minus 12.8% (We will
later discuss if it is appropriate to do this).
The predictive
values in respect of presence of GABHS in the throat are estimated so
(see also Appendix 3):
Table III - Calculating predictive
values - Disease |
|||
Gold standard |
|||
Disease |
No Disease |
||
Positive test (T+) |
7.45 |
3.55 |
11 |
Negative test (T-) |
0.83 |
24.17 |
25 |
|
8.28 |
27.72 |
36 |
PPV = 7.45/11 = 67.7% |
NPV = 24.17/25 = 96.7% |
||
(M denotes a Marker. In this case the marker is GABHS.) |
As mentioned on other pages EPV will compensate for carriers. In the above mentioned example positive EPV is 67.9% and negative EPV is 96.7%.
As we can see (Table IV) the different approaches will yield quite different post-test probability for disease.
Table IV - Post-test probabilities for the disease a sore throat caused by GABHS if the test a throat culture is positive and post-test probabilities for absence of
this disease if
test is negative: |
||
Positive test | Negative test | |
|
93.3% | 95.4% |
|
58.5% 67.7% |
97.1% 96.7% |
|
67.9% | 96.7% |
It is easy to understand that ignoring carriers will yield a higher positive predictive value than if we consider carriers. However, it is interesting to see that compensating for carriers by altering the specificity might yield another estimate of predictive values than using EPV (Table IV). To understand this we must clarify the situation.
We can look at the relation between test outcome and presence or absence of the etiologic agent (Table V).
Table V - Relation between test outcome (T+, T-) and presence (M+) or absence (M-) of specified bacterium in patients having a sore throat | ||||
A sore throat caused by.... | ||||
....GABHS (D+) | ....other than GABHS (D-) | |||
GABHS present (M+) |
(GABHS not present) (M-) |
GABHS present (M+) |
GABHS not present (M-) |
|
Positive test (T+) |
True positive | ----- | True positive | False positive |
Negative test (T-) |
False negative | ----- | False negative | True negative |
Since the disease is caused by M then M must be present in the patient at some stage of the disease. Thus, the second column will have no patients (se also the page defining markers).
We can also look at the relation between test outcome and the presence or absence of disease (Table VI).
Table VI - Relation between test outcome (T+, T-) and presence (D+) or absence (D-) of specified disease in patients having a sore throat | ||||
A sore throat caused by.... | ||||
....GABHS (D+) | ....other than GABHS (D-) | |||
GABHS present | (GABHS not present) | GABHS present | GABHS not present | |
Positive test (T+) |
True positive | ----- | False positive* | False positive |
Negative test (T-) |
False negative | ----- | True negative** | True negative |
*The positive test does not represent true
disease
(D+) and is therefore considered to be
false positive. However, it indicates that the patient is a carrier
of GABHS. **Although the patient carry GABHS a negative test will correctly identify the patient as not having the disease, thus being (D-). |
From Table V and VI we can understand that the specificity for the test to correctly identify absence of the marker (GABHS) and the specificity for the test to correctly identify absence of the disease a sore throat caused by GABHS are not the same. The first may be denoted as P(T-|M-) and the second as P(T-|S+D-) where S+ denotes patients with a sore throat. The second specificity is only of interest in patients having a sore throat. The first specificity may be of interest in both patients having a sore throat and in healthy individuals.
Is it appropriate to
lower the specificity?
Using altered specificity, as in the
first alternative in the second approach,
will
mix the specificity for the test to correctly identify absence of the marker
(GABHS) and the specificity for the test to correctly identify absence of the
disease a sore throat caused by GABHS. This may be described as
where S+ denotes patients with a sore throat and S- denotes healthy individuals (not having a sore throat). Since the disease a sore throat caused by GABHS is impossible to have if the individual has no sore throat then P(T+|S-) is the same as P(T+|S-D-). Furthermore, if theta is 1 we will find that P(T+|S-D-) is the same as P(T+|S+D-). P(T+|S+D-) is the same as 1-P(T-|S+D-). Thus, the above expression may be rewritten as
and this expression can be simplified to
The conclusion is that this procedure is correct only if both theta and the specificity for the test to correctly identify absence of the marker (GABHS) are 1. In all other circumstances this procedure will result in a false estimate of the specificity to predict absence of disease. Thus, our estimate of 58.5% is not correct.
Is it appropriate to
assume specificity=1-carriers?
This
approach is similar to the above and may be described as
Simplifying this formula in a similar way as described above will result in that 1=1, which of course is true. The conclusion is that this procedure is correct if theta is 1. The specificity for the test to correctly identify absence of the marker (GABHS) is no longer needed. Thus, our estimate of 67.7% will be correct under the assumption that theta is 1. If we want to include the possibility of altering theta and have confidence intervals we will end up with EPV. Thus, this is an alternative deduction that also will lead to EPV.
The small difference seen in Table IV between this approach and EPV is due to that we have calculated our estimation in Table IV with a limited number of decimals. If we had used more decimals in every step (also when we calculate P(T+|S-)) then our estimate of the probability for the disease a sore throat caused by GABHS would be 67.8652% which is exactly the same as the prediction provided by positive EPV.
Other pages with subjects that might be of interest is:
(You can click on these links to quickly see the pages)
Table VII - Relation between T and M among patients |
|||
GABHS present (M+) |
(GABHS not present) (M-) |
||
Positive test (T+) |
0.9 x A |
0.03 x B |
11 |
Negative test (T-) |
0.1 x A |
0.97 x B |
25 |
|
A |
B |
36 |
We can see that
![]() |
Formula 1 |
and also that
![]() |
Formula 2 |
If we replace A in formula 2 with formula 1 we get
As we know B then the remaining cells in Table VII is easily calculated and we end up with Table I.
If we lower the specificity to predict absence of disease to adjust for carriers we may for example have a specificity of 97%-12.8% = 84.2%. Note that column four and five in Table VI has similar denotations in the rows for T+ and T-. Thus, these columns can be merged. Our example with children having a sore throat may then be described as (Table VIII)
Table VIII - Relation between test outcome (T+, T-) and presence (D+) or absence (D-) of specified disease in patients having a sore throat | |||||
A sore throat caused by.... | |||||
....GABHS (D+) | ....other than GABHS (D-) | ||||
GABHS present | (GABHS not present) | GABHS present | GABHS not present | ||
Positive test (T+) |
0.9 x A | ----- |
0.158 x B |
11 | |
Negative test (T-) |
0.1 x A |
----- | 0.842 x B | 25 | |
A | 0 | B | 36 |
In a similar way as presented in Appendix 1 we would find that
and thus B=28.84. As we know B then the remaining cells in Table VIII is easily calculated and we end up with Table II.
If we use the second alternative to lower the specificity to adjust for carriers we will in our example have a specificity of 100%-12.8% = 87.2%. As mentioned in Appendix 2 the fourth and fifth column will be merged. Our example with children having a sore throat may then be described as (Table IX)
Table IX - Relation between test outcome (T+, T-) and presence (D+) or absence (D-) of specified disease in patients having a sore throat | |||||
A sore throat caused by.... | |||||
....GABHS (D+) | ....other than GABHS (D-) | ||||
GABHS present | (GABHS not present) | GABHS present | GABHS not present | ||
Positive test (T+) |
0.9 x A | ----- | 0.128 x B | 11 | |
Negative test (T-) |
0.1 x A |
----- |
0.872 x B |
25 | |
A | 0 | B | 36 |
In a similar way as presented in Appendix 1 we would find that
and thus B=27.72. As we know B then the remaining cells in Table IX is easily calculated and we end up with Table III.
Ronny Gunnarsson MD
PhD
Department of Primary Health Care
Göteborg University
SWEDEN