Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 7-Day Trial for You or Your Team.

Learn More →

DeepDyve requires Javascript to function. Please enable Javascript on your browser to continue.

Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve

Pepe, Margaret Sullivan; Cai, Tianxi; Longton, Gary 2006-03-01 00:00:00 Summary No single biomarker for cancer is considered adequately sensitive and specific for cancer screening. It is expected that the results of multiple markers will need to be combined in order to yield adequately accurate classification. Typically, the objective function that is optimized for combining markers is the likelihood function. In this article, we consider an alternative objective function—the area under the empirical receiver operating characteristic curve (AUC). We note that it yields consistent estimates of parameters in a generalized linear model for the risk score but does not require specifying the link function. Like logistic regression, it yields consistent estimation with case–control or cohort data. Simulation studies suggest that AUC‐based classification scores have performance comparable with logistic likelihood‐based scores when the logistic regression model holds. Analysis of data from a proteomics biomarker study shows that performance can be far superior to logistic regression derived scores when the logistic regression model does not hold. Model fitting by maximizing the AUC rather than the likelihood should be considered when the goal is to derive a marker combination score for classification or prediction. http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Biometrics Oxford University Press http://www.deepdyve.com/lp/oxford-university-press/combining-predictors-for-classification-using-the-area-under-the-rFoLLcsfpZ

Combining Predictors for Classification Using the Area under the Receiver Operating Characteristic Curve

Pepe, Margaret Sullivan; Cai, Tianxi; Longton, Gary

Biometrics , Volume 62 (1) – Mar 1, 2006

Read Article

Download PDF

Share Full Text for Free

9 pages

Loading next page...

References (24)

M. Pepe, H. Janes, G. Longton, W. Leisenring, P. Newcomb (2004)
Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker.
American journal of epidemiology, 159 9
(1999)
Early detection research
R. Prentice, R. Pyke (1979)
Logistic disease incidence models and case-control studies
Biometrika, 66
M. Pepe, Ruth Etzioni, Z. Feng, J. Potter, M. Thompson, M. Thornquist, M. Winget, Y. Yasui (2001)
Phases of biomarker development for early detection of cancer.
Journal of the National Cancer Institute, 93 14
Stefano Parodi, Alberto Izzotti, Marco Muselli (2005)
Re: The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer.
Journal of the National Cancer Institute, 97 3
P. Qiu (2005)
The Statistical Evaluation of Medical Tests for Classification and Prediction
Journal of the American Statistical Association, 100
(2004)
Revised March 2005
J. Copas, P. Corbett (2002)
Overestimation of the receiver operating characteristic curve for logistic regression
Biometrika, 89
Srivastava Srivastava (1999)
Early detection research network
Disease Markers, 15
Xiao-Hua Zhou, N. Obuchowski, D. McClish (2002)
Statistical Methods in Diagnostic Medicine
T. Hastie, R. Tibshirani, J. Friedman (2001)
The Elements of Statistical Learning
Y. Yasui, D. McLerran, B. Adam, M. Winget, M. Thornquist, Ziding Feng (2003)
An Automated Peak Identification/Calibration Procedure for High-Dimensional Protein Measures From Mass Spectrometers
Journal of Biomedicine and Biotechnology, 2003
S. Srivastava (2002)
The Early Detection Research Network Second Annual Scientific Workshop 14–16 October 2001, Seattle, Washington, USA
Disease Markers, 18
M. Pepe, M. Thompson (2000)
Combining diagnostic test results to increase accuracy.
Biostatistics, 1 2
J. Neyman, E. Pearson (1933)
On the Problem of the Most Efficient Tests of Statistical Hypotheses
Philosophical Transactions of the Royal Society A, 231
S. Baker (2000)
Identifying Combinations of Cancer Markers for Further Study as Triggers of Early Intervention
Biometrics, 56
M. McIntosh, M. Pepe (2002)
Combining Several Screening Tests: Optimality of the Risk Score
Biometrics, 58
R. Sherman (1993)
The Limiting Distribution of the Maximum Rank Correlation Estimator
Econometrica, 61
Baker (2003)
The central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer
Journal of the National Cancer Institute, 95
M. Pepe, H. Janes, G. Longton, W. Leisenring, P. Newcomb (2004)
Limitations of the Odds Ratio in Gauging the Performance of a Diagnostic or Prognostic Marker
Ker-Chau Li, N. Duan (1989)
Regression Analysis Under Link Violation
Annals of Statistics, 17
S. Eguchi, J. Copas (2002)
A class of logistic‐type discriminant functions
Biometrika, 89
Aaron Han (1987)
Non-parametric analysis of a generalized regression model: the maximum rank correlation estimator
Journal of Econometrics, 35
D. Green, J. Swets (1966)
Signal detection theory and psychophysics

Publisher: Oxford University Press
Copyright: Copyright © 2006 Wiley Subscription Services, Inc., A Wiley Company
ISSN: 0006-341X
eISSN: 1541-0420
DOI: 10.1111/j.1541-0420.2005.00420.x
pmid: 16542249
Publisher site: See Article on Publisher Site

There are no references for this article.