0% found this document useful (0 votes)

49 views

Hani Tamim, PHD Associate Professor Department of Internal Medicine American University of Beirut

Here are some general guidelines: - 0.90-1 = Excellent (highly accurate) - 0.80-0.90 = Good - 0.70-0.80 = Fair - 0.60-0.70 = Poor - 0.50-0.60 = Fail (no better than chance) So in summary, the closer the AUC is to 1, the more accurate the test. An AUC >0.7 is generally considered acceptable discrimination.

Uploaded by

mitochondri

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views

Hani Tamim, PHD Associate Professor Department of Internal Medicine American University of Beirut

Uploaded by

mitochondri

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 44

Hani Tamim, PhD

Associate Professor
Department of Internal Medicine
American University of Beirut
 A graphical plot which illustrates the performance of a
binary classifier system as its discrimination threshold
is varied (3).

 Effective method of evaluating the performance of

diagnostic tests (3).

 ROC analysis is related in a direct and natural way to

cost/benefit analysis of diagnostic decision making (3).
 First developed by electrical engineers and radar
engineers during World War II for detecting enemy
objects in battlefields (3).

 ROC analysis since then has been used in medicine,

radiology, biometrics, and other areas for many decades
and is increasingly used in machine learning and data
mining research (3).
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos
Neg
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos TP ☺

Neg
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos FP 

Neg
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos
Neg FN 
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos
Neg TN ☺
Suppose we have a test statistic for predicting the
presence or absence of disease.

Test criterion
True disease status
Pos Neg
Pos TP FP

Neg FN TN

P N P+N
Test criterion
True disease status
Pos Neg
Pos TP FP

Neg PN TN

P N P+N
Accuracy = Probability that the test yields a
correct result (2).
= (TP+TN) / (P+N)
Test criterion
True disease status
Pos Neg
Pos TP FP
Neg FN TN

P N P+N
Sensitivity = Probability that a true case will test
positive.
= TP / P
Also referred to as True Positive Rate (TPR)
or True Positive Fraction (TPF).
Test criterion
True disease status
Pos Neg
Pos TP FP
Neg FN TN

P N P+N
Specificity = Probability that a true negative will test
negative.
= TN / N
Also referred to as True Negative Rate (TNR)
or True Negative Fraction (TNF).
Test criterion
True disease status
Pos Neg
Pos TP FP
Neg FN TN

P N P+N
1- Specificity = Probability that a true negative will test
positive.
= FP / N
Also referred to as False Positive Rate (FPR)
or False Positive Fraction (FPF).
Test criterion
True disease status
Pos Neg
Pos TP FP
FN TN
Neg
P N P+N
Positive Predictive = Probability that a positive test
Value (PPV) will truly have disease.
= TP / (TP+FP)
Test criterion
True disease status
Pos Neg
Pos TP FP

Neg FN TN

P N P+N
Negative Predictive = Probability that a negative test
Value (NPV) will truly be disease free.
= TN / (TN+FN)
Test criterion
True disease status
Pos Neg
Pos 27 173 200
Neg 73 727 800

1000
Se = 27/100 = .27
Sp = 727/900 = .81
FPF = 1- Sp = .19
Acc = (27+727)/1000 = .75
PPV = 27/200 = .14
NPV = 727/800 = .91
 Of these properties, only Se and Sp (and hence FPR)
are considered invariant test characteristics.

 Accuracy, PPV, and NPV will vary according to the

underlying prevalence of disease.

 Se and Sp are thus “fundamental” test properties and

hence are the most useful measures for comparing
different test criteria, even though PPV and NPV are
probably the most clinically relevant properties.
 Now assume that our test statistic is no longer binary , but
takes on a series of values (for instance how many of five
distinct risk factors a person exhibits).

 Clinically we make a rule that says the test is positive if the

number of risk factors meets or exceeds some threshold (#RF
> x).

 Suppose our previous table resulted from using x = 4.

 Let’s see what happens as we vary x.

Test criterion
True disease status
Pos Neg
Pos 245
45 200
Neg 755
55 700
1000
Se = 27/100 = .45
Sp = 727/900 = .78
FPF = 1- Sp = .22
Acc = (27+727)/1000 = .75
PPV = 27/200 = .18
NPV = 727/800 = .93
Se , Sp , and interestingly both PPV and NPV .
‘‘-’’
‘‘+’’

without the disease

with the disease
‘‘-’’
‘‘+’’

without the disease

with the disease
Threshold TPR FPR As we relax our threshold for
6 0.00 0.00 defining “disease,” our true positive
rate(sensitivity) increases, but so
5 0.10 0.11 does the false positive rate
(FPR).
4 0.27 0.19

3 0.45 0.22
The ROC curve is a way to visually
2 0.73 0.27 display this information.
1 0.98 0.80

0 1.00 1.00
Threshold TPR FPR

6 0.00 0.00

5 0.10 0.11

4 0.27 0.19

3 0.45 0.22

2 0.73 0.27

1 0.98 0.80

0 1.00 1.00

What might an even better The diagonal line shows what we would expect
ROC curve look like? from simple guessing (i.e., pure chance).
Threshold TPR FPR

6 0.00 0.00

5 0.10 0.11

4 0.27 0.19

3 0.45 0.22

2 0.73 0.27

1 0.98 0.80

0 1.00 1.00

Note the immediate sharp rise in

sensitivity. Perfect accuracy is
represented by upper left corner
 The ROC curve allows us to see, in a simple visual
display, how sensitivity and specificity vary as our
threshold varies (2).

 The shape of the curve also gives us some visual clues

about the overall strength of association between the
underlying test statistic (in this case #RFs that are
present) and disease status (2).
 The ROC methodology
easily generalizes to test
statistics that are continuous
(such as lung function or a
blood gas). We simply fit a
smoothed ROC curve
through all observed data
points (2).
 The total area of the grid
represented by an ROC
curve is 1, since both
TPR and FPR range from 0
to 1.

 The portion of this total area

that falls below the ROC
curve is known as the area
under the curve,or AUC.
 The AUC serves as a quantitative summary of the
strength of association between the underlying test
statistic and disease status (2).

 An AUC of 1.0 would mean that the test statistic could

be used to perfectly discriminate between cases and
controls (2).

 An AUC of 0.5 (reflected by the diagonal 45° line) is

equivalent to simply guessing (2).
 The AUC can be shown to equal the Mann- Whitney U
statistic, or equivalently the Wilcoxon rank statistic, for
testing whether the test measure differs for individuals
with and without disease (2).

 It also equals the probability that the value of our test

measure would be higher for a randomly chosen case
than for a randomly chosen control (2).
AUC~ 0.540
FPR
ROC curve
AUC~0.95
FPR
ROC curve
 What defines a “good” AUC?
 Opinions vary
 Probably context specific
 What may be a good AUC for predicting COPD may be
very different than what is a good AUC for predicting
prostate cancer
 .90-1.0 = excellent
 .80-.90 = good
 .70-.80 = fair
 .60-.70 = poor
 .50-.60 = fail
Remember that <.50 is worse than guessing (4).
 .97-1.0 = excellent
 .92-.97 = very good
 .75-.92 = good
 .50-.75 = fair (5).
 Suppose we have two candidate test statistics to use to
create a binary decision rule. Can we use ROC curves
to choose an optimal one?
 We can formally compare AUCs for two competing test
statistics, but does this answer our question?

 AUC speaks to which measure, as a continuous

variable, best discriminates between cases and
controls?

 It does not tell us which specific cutpoint to use, or

even which test statistic will ultimately provide the
“best” cutpoint.
2 methods:
 The first method assumes that the best cut-off point for
balancing the sensitivity and specificity of a test is the point on
the curve closest to the (0, 1) point. In this method, optimal
sensitivity and specificity are defined as those yielding the
minimal value for (1 − sensitivity)2 + (1 − specificity)2. The
cut-off point corresponding to these sensitivity and specificity
values is the one closest to the (0, 1) point and is taken to be
the cut-off point that best differentiates between people with
disease and those without disease (1).
 The second method that may be used to determine the optimal
cut-off point for a test is the Youden index (J) . J is defined as
the maximum vertical distance between the ROC curve and
the diagonal or chance line and is calculated as J = maximum
(sensitivity + specificity −1). Using this measure, the cut-off
point on the ROC curve which corresponds to J, that is, at
which (sensitivity + specificity − 1) is maximized, is taken to
be the optimal cut-off point. An intuitive interpretation of J is
that it corresponds to the point on the curve farthest from
chance (1).
 Open the excel file.
 The first column is the patient coding number that you do not really need.
The second column gives the age as a continuous variable The third
column is a categorical variable about having thrombosis (1) and not
having thrombosis (0)
 Exercise:

1. Calculate the sensitivity and 1-specificity Considering that age is the test
variable and thrombosis is the state variable. Calculate the ROC curves for
and plot it. Comment on the obtained results.
 Open the excel file.
 The first column is the patient age, The second column gives the sensitivity
and the column 2 the 1-specificity.
 Exercise:

1. Calculate the youden index for each age and choose according to it the
optimal cut off point.
 1- Akobeng AK. Understanding diagnostic tests 3: Receiver operating
characteristic curves. Acta Paediatrica 2007, vol 96, issue 5.

 2- Kaiser permanente center for health research. Receiver operating

characteristic (ROC) curves: assessing the predictive properties of a test
statistic – Decision theory 2009.

 3- http://en.wikipedia.org/wiki/Receiver_operating_characteristic

 4- http://gim.unmc.edu/dxtests/roc3.htm

 5- www.childrens-mercy.org/stats/ask/roc.asp

Test For Certificate - Coursera - Passed
100% (2)
Test For Certificate - Coursera - Passed
1 page
Test1 For Certificate - Coursera
0% (2)
Test1 For Certificate - Coursera
1 page
PS4 PDF
No ratings yet
PS4 PDF
10 pages
Test For Certificate - Coursera1
0% (1)
Test For Certificate - Coursera1
1 page
Module 2 - Intelligent Systems
100% (1)
Module 2 - Intelligent Systems
7 pages
Validity of Screening Test
No ratings yet
Validity of Screening Test
12 pages
Bayes Theorem and The Paradox of Medical Tests
No ratings yet
Bayes Theorem and The Paradox of Medical Tests
10 pages
Screening: To Sort Out Apparently Well Persons Who Probably Have A Disease From Those Who Probably Do Not."
No ratings yet
Screening: To Sort Out Apparently Well Persons Who Probably Have A Disease From Those Who Probably Do Not."
24 pages
Epid 600 Class 11 Screening
100% (1)
Epid 600 Class 11 Screening
65 pages
10 Screening (2)
No ratings yet
10 Screening (2)
33 pages
Accuracy of Observations and Measurements JRJ
No ratings yet
Accuracy of Observations and Measurements JRJ
24 pages
09 Screening
No ratings yet
09 Screening
49 pages
3 Screening in Public Health
No ratings yet
3 Screening in Public Health
32 pages
Diagnostic Talk 2012
No ratings yet
Diagnostic Talk 2012
28 pages
Measures of Validity1
No ratings yet
Measures of Validity1
21 pages
07.Screening
No ratings yet
07.Screening
37 pages
Summary of Diagnostic Test Accuracy For BR PDF
No ratings yet
Summary of Diagnostic Test Accuracy For BR PDF
33 pages
EBM Diagnosis Slide
0% (1)
EBM Diagnosis Slide
28 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
5 pages
Part II - Dignostic Research
No ratings yet
Part II - Dignostic Research
107 pages
Decision Analysis: Matthew Scotch, PHD, MPH
No ratings yet
Decision Analysis: Matthew Scotch, PHD, MPH
29 pages
Screening
No ratings yet
Screening
20 pages
Chapter 4
No ratings yet
Chapter 4
3 pages
Wa0013.
No ratings yet
Wa0013.
9 pages
A03 Screening
No ratings yet
A03 Screening
37 pages
Group 6 Validation Studies
No ratings yet
Group 6 Validation Studies
3 pages
Screening
No ratings yet
Screening
55 pages
10. ADRPLEXUS ONE LAST PUSH - PSM - typed
No ratings yet
10. ADRPLEXUS ONE LAST PUSH - PSM - typed
6 pages
SESSION 6. Assessing Value of Tests
No ratings yet
SESSION 6. Assessing Value of Tests
15 pages
Interpreting Diagnostic Tests: Ian Mcdowell Department of Epidemiology & Community Medicine January 2010
No ratings yet
Interpreting Diagnostic Tests: Ian Mcdowell Department of Epidemiology & Community Medicine January 2010
30 pages
Measures of Diagnostic Accuracy: Basic Definitions: Ana-Maria Šimundić
No ratings yet
Measures of Diagnostic Accuracy: Basic Definitions: Ana-Maria Šimundić
9 pages
Epi Lec 5
No ratings yet
Epi Lec 5
40 pages
Statics MRCGP 2015
No ratings yet
Statics MRCGP 2015
22 pages
Reglas de Predicción Clínica
No ratings yet
Reglas de Predicción Clínica
7 pages
Screening Tests by Basta
No ratings yet
Screening Tests by Basta
14 pages
1603 - EvaluatingDiagnosis - PDF Version 1
No ratings yet
1603 - EvaluatingDiagnosis - PDF Version 1
5 pages
Chemical Pathology Workshop II - Diagnostic Theory in Chemical Pathology (2017.11.14)
100% (1)
Chemical Pathology Workshop II - Diagnostic Theory in Chemical Pathology (2017.11.14)
57 pages
EPI 1.04 Accuracy of Observations
No ratings yet
EPI 1.04 Accuracy of Observations
5 pages
Diagnosis Studies: Levels of Evidence
No ratings yet
Diagnosis Studies: Levels of Evidence
5 pages
MM20802 Notes
No ratings yet
MM20802 Notes
15 pages
FA 2022 Small Size Export
No ratings yet
FA 2022 Small Size Export
2 pages
Screening - Master - 2022
No ratings yet
Screening - Master - 2022
57 pages
Measures of Diagnostic Accuracy by Preetam
No ratings yet
Measures of Diagnostic Accuracy by Preetam
24 pages
STT034 - 7 Conditional Probability
No ratings yet
STT034 - 7 Conditional Probability
3 pages
The Concept of Sensitivity and Specificity in Relation To Two Types of Errors and Its Application in Medical Research
No ratings yet
The Concept of Sensitivity and Specificity in Relation To Two Types of Errors and Its Application in Medical Research
6 pages
PH1700 Session 1 - Stu - Bayes and Screening Tests
No ratings yet
PH1700 Session 1 - Stu - Bayes and Screening Tests
27 pages
Measuring The Accuracy of Diagnostic Systems
No ratings yet
Measuring The Accuracy of Diagnostic Systems
9 pages
Test Statistics: Condition Present Absent
No ratings yet
Test Statistics: Condition Present Absent
3 pages
Test Statistics: Condition Present Absent
No ratings yet
Test Statistics: Condition Present Absent
3 pages
Screening For Disease: Dr.K.Arulanandem Lecturer/Coordinator
No ratings yet
Screening For Disease: Dr.K.Arulanandem Lecturer/Coordinator
40 pages
Idris M. Usman ( FMCB Presentation)
No ratings yet
Idris M. Usman ( FMCB Presentation)
26 pages
Sensitivity and Specificity
No ratings yet
Sensitivity and Specificity
3 pages
Clinical Approach To Patients
No ratings yet
Clinical Approach To Patients
22 pages
Sensitivity Vs Specificity
No ratings yet
Sensitivity Vs Specificity
16 pages
Disease Screening
No ratings yet
Disease Screening
41 pages
accuracy of test
No ratings yet
accuracy of test
26 pages
Receiver Operating Characteristic
No ratings yet
Receiver Operating Characteristic
19 pages
Screening of Diseases
100% (1)
Screening of Diseases
50 pages
Westgard1983 - Power Curves
No ratings yet
Westgard1983 - Power Curves
8 pages
Diagnostic Test ARt - Magister
No ratings yet
Diagnostic Test ARt - Magister
58 pages
Screening test
No ratings yet
Screening test
37 pages
Binary Diagnostic Tests - Single Sample
No ratings yet
Binary Diagnostic Tests - Single Sample
6 pages
Screening For Disease PPT Hadeel
No ratings yet
Screening For Disease PPT Hadeel
10 pages
Screening - DR Heba Mahmoud
No ratings yet
Screening - DR Heba Mahmoud
45 pages
Complementary and Alternative Medical Lab Testing Part 17: Oncology
From Everand
Complementary and Alternative Medical Lab Testing Part 17: Oncology
Ronald Steriti
No ratings yet
Test For Certificate - Coursera - Answers
0% (1)
Test For Certificate - Coursera - Answers
1 page
Letter of Recommendation
No ratings yet
Letter of Recommendation
1 page
Practice Quiz 1-8
No ratings yet
Practice Quiz 1-8
1 page
Process of Strategic Management-2
No ratings yet
Process of Strategic Management-2
12 pages
Chap7 KNN
No ratings yet
Chap7 KNN
15 pages
S&DM - Assignment - Group 9 - Section B
No ratings yet
S&DM - Assignment - Group 9 - Section B
2 pages
Steps To Be Followed For Subscribing To Mega Sports Complex Facilities
No ratings yet
Steps To Be Followed For Subscribing To Mega Sports Complex Facilities
5 pages
Ops Startegy
No ratings yet
Ops Startegy
2 pages
Course Oultine - International Finance
No ratings yet
Course Oultine - International Finance
5 pages
20220910171232HCTAN008C8-Topic 8-Risk Return
No ratings yet
20220910171232HCTAN008C8-Topic 8-Risk Return
105 pages
PID-SMC Controller For A 2-DOF Planar Robot: February 2019
No ratings yet
PID-SMC Controller For A 2-DOF Planar Robot: February 2019
6 pages
Dawak 2024
No ratings yet
Dawak 2024
15 pages
Analytics For Customer Engagement
No ratings yet
Analytics For Customer Engagement
16 pages
Mws Gen Pde TXT Parabolic
No ratings yet
Mws Gen Pde TXT Parabolic
27 pages
Assignment - 10: Shyam Shankar H R EE15B127 November 9, 2017
No ratings yet
Assignment - 10: Shyam Shankar H R EE15B127 November 9, 2017
15 pages
Simulation Language For Alternative Modeling
No ratings yet
Simulation Language For Alternative Modeling
6 pages
Developmental Dyslexia Detection Using Machine Lea
No ratings yet
Developmental Dyslexia Detection Using Machine Lea
7 pages
Draw A Net Work Diagram of Activities For Project
No ratings yet
Draw A Net Work Diagram of Activities For Project
2 pages
OCT-23 - Sathya QSTN Paper
No ratings yet
OCT-23 - Sathya QSTN Paper
4 pages
Business Math
No ratings yet
Business Math
21 pages
AMLFv1 EN PDF M02 SG
No ratings yet
AMLFv1 EN PDF M02 SG
55 pages
Lecture 1
No ratings yet
Lecture 1
35 pages
Numerical Methods Versus Bjerksund and Stensland Approximations For American Options Pricing
No ratings yet
Numerical Methods Versus Bjerksund and Stensland Approximations For American Options Pricing
9 pages
Final Exam-14122020
No ratings yet
Final Exam-14122020
2 pages
Tutorial Sheet 1
No ratings yet
Tutorial Sheet 1
1 page
CMP 409 2021 - 2022
No ratings yet
CMP 409 2021 - 2022
2 pages
Semester Project Controls System Lab
No ratings yet
Semester Project Controls System Lab
6 pages
Index: Practical Genetic Algorithms, Second Edition, by Randy L. Haupt and Sue Ellen Haupt
No ratings yet
Index: Practical Genetic Algorithms, Second Edition, by Randy L. Haupt and Sue Ellen Haupt
3 pages
Statistics and Probability
No ratings yet
Statistics and Probability
15 pages
Ahp Mathematical Models
No ratings yet
Ahp Mathematical Models
10 pages
Karnaugh Maps
No ratings yet
Karnaugh Maps
10 pages
Crashing of Network: Presented By
100% (2)
Crashing of Network: Presented By
16 pages
Module 5: Design of Sampled Data Control Systems: Lecture Note 7
No ratings yet
Module 5: Design of Sampled Data Control Systems: Lecture Note 7
7 pages
A-3 Ai Print
No ratings yet
A-3 Ai Print
6 pages
Power Method For EV
No ratings yet
Power Method For EV
37 pages
Data Mining Essen, Als 2: Data Mining in Prac, Ce, With Python
No ratings yet
Data Mining Essen, Als 2: Data Mining in Prac, Ce, With Python
31 pages
Graph Theory and Applications: Paul Van Dooren Université Catholique de Louvain Louvain-la-Neuve, Belgium
No ratings yet
Graph Theory and Applications: Paul Van Dooren Université Catholique de Louvain Louvain-la-Neuve, Belgium
110 pages