Sensitivity, Specificity, PPV, and NPV

Understanding Diagnostic Test Performance

Download PDF

Sensitivity, Specificity, PPV, and NPV — and When They Lie

Understanding Diagnostic Test Performance

Every diagnostic test has a fixed sensitivity and specificity — properties of the test itself. But PPV and NPV are not fixed. They shift with every change in disease prevalence. Understanding why is the difference between ordering tests wisely and ordering them reflexively.

Part 1 — The 2×2 Table: What Every Test Produces

Order any diagnostic test and you get one of four outcomes. Every measure of test performance derives from these four cells.

  Disease Present Disease Absent
Test Positive True Positive (TP) False Positive (FP)
Test Negative False Negative (FN) True Negative (TN)

TP — test positive, disease present. You caught it.   FP — test positive, disease absent. False alarm.   FN — test negative, disease present. You missed it.   TN — test negative, disease absent. Correctly reassured.

Part 2 — The Four Metrics: Formulas and Plain English

Metric Formula Plain English
SensitivityTP / (TP + FN)Of all patients WITH the disease, how many did the test correctly identify?
SpecificityTN / (TN + FP)Of all patients WITHOUT the disease, how many did the test correctly clear?
PPVTP / (TP + FP)If the test is positive, what is the probability the patient actually has the disease?
NPVTN / (TN + FN)If the test is negative, what is the probability the patient is truly disease-free?
Accuracy(TP + TN) / TotalOf all patients tested, how many were classified correctly?
The mnemonic — SnNout / SpPin:

SnNout: High Snsitivity → Negative result rules OUT the diagnosis.
SpPin: High Specificity → Positive result rules IN the diagnosis.

A highly sensitive test misses very few cases — so a negative is reassuring.
A highly specific test rarely cries wolf — so a positive is meaningful.

Part 3 — Why PPV and NPV Depend on Prevalence

Sensitivity and specificity are fixed properties of the test. PPV and NPV are not — they depend on how common the disease is in the population you are testing.

Example: a test with 99% sensitivity and 95% specificity applied to two populations.

High Prevalence (50%) Low Prevalence (1%)
Population tested1,000 patients1,000 patients
Disease present50010
Disease absent500990
True positives495 (99% of 500)9.9 (99% of 10)
False positives25 (5% of 500)49.5 (5% of 990)
PPV495 / 520 = 95%9.9 / 59.4 = 17%
Cardiology example — troponin in the ED:

A high-sensitivity troponin is ~99% sensitive. In a chest pain unit where prevalence of true NSTEMI is 15–20%, a positive result is highly meaningful. In a low-risk outpatient with atypical chest pain (prevalence <1%), the same positive result is more likely a false alarm — demand, myocarditis, PE, CKD, sepsis — than true ACS.

Same test. Same result. Different clinical meaning. Pretest probability changes everything.

Stress testing follows the same logic. Exercise stress ECG has ~68% sensitivity and ~77% specificity for obstructive CAD. Order it in a 55-year-old male with typical exertional chest pain (pre-test probability ~65%) and a positive result is actionable. Order it in a 35-year-old woman with atypical chest pain (pre-test probability ~5%) and a positive result is more likely a false positive than true disease. The ACC/AHA appropriateness criteria exist precisely because of this math.

Part 4 — Likelihood Ratios: The Clinically Superior Tool

Likelihood ratios (LR) are better than PPV/NPV because they remain stable across populations with different prevalence. Use them to move from pretest to post-test probability.

Measure Formula Interpretation
LR+Sensitivity / (1 − Specificity)How much more likely is a positive result in someone WITH the disease vs. without it?
LR−(1 − Sensitivity) / SpecificityHow much more likely is a negative result in someone WITHOUT the disease vs. with it?
LR ValueClinical Meaning
LR+ > 10Large shift toward disease. Strong rule-in.
LR+ 5–10Moderate shift. Useful in intermediate pretest probability.
LR+ 2–5Small shift. Limited diagnostic value alone.
LR− < 0.1Large shift away from disease. Strong rule-out.
LR− 0.1–0.2Moderate shift. Useful but not definitive.

Worked example: High-sensitivity troponin with 99% sensitivity and 95% specificity. LR+ = 0.99 / (1 − 0.95) = 19.8 — strong rule-in when positive. LR− = (1 − 0.99) / 0.95 = 0.01 — near-definitive rule-out when negative.

Clinical Rule

High sensitivity rules OUT (SnNout). High specificity rules IN (SpPin). But PPV and NPV depend on prevalence — a “99% sensitive” test still produces mostly false positives when the disease is rare. Know the pretest probability before you order.

This is one of 13 free reference sheets from the APP Cardiology Academy — no account required.

Browse all resources See the full curriculum