Access the full text.
Sign up today, get DeepDyve free for 14 days.
J. Hanley, B. McNeil (1982)
The meaning and use of the area under a receiver operating characteristic (ROC) curve.Radiology, 143 1
M. Pepe, H. Janes, G. Longton, W. Leisenring, P. Newcomb (2004)
Limitations of the odds ratio in gauging the performance of a diagnostic, prognostic, or screening marker.American journal of epidemiology, 159 9
M. Vijver, Yudong He, L. Veer, H. Dai, A. Hart, D. Voskuil, G. Schreiber, J. Peterse, C. Roberts, M. Marton, M. Parrish, D. Atsma, A. Witteveen, A. Glas, L. Delahaye, Tony Velde, H. Bartelink, S. Rodenhuis, E. Rutgers, S. Friend, René Bernards (2002)
A gene-expression signature as a predictor of survival in breast cancer.The New England journal of medicine, 347 25
C. Wu (2008)
JACKKNIFE , BOOTSTRAP AND OTHER RESAMPLING METHODS IN REGRESSION ANALYSIS ' BY
Pepe Pepe, Feng Feng, Gu Gu (2008)
Comments on ‘Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond’ by Pencina et al.Statistics in Medicine, 27
J. Ware (2006)
The limitations of risk factors as prognostic tools.The New England journal of medicine, 355 25
F Harrell, Kerry Lee, D. Mark (2005)
Prognostic/Clinical Prediction Models: Multivariable Prognostic Models: Issues in Developing Models, Evaluating Assumptions and Adequacy, and Measuring and Reducing Errors
L. George (2003)
The Statistical Analysis of Failure Time DataTechnometrics, 45
J. Yukich (1992)
Weak Convergence of Smoothed Empirical ProcessesScandinavian Journal of Statistics, 19
D. Bamber (1975)
The area above the ordinal dominance graph and the area below the receiver operating characteristic graphJournal of Mathematical Psychology, 12
Kathleen Kerr, R. McClelland, E. Brown, T. Lumley (2011)
Evaluating the incremental value of new biomarkers with integrated discrimination improvement.American journal of epidemiology, 174 3
M. Pencina, R. Agostino, R. Agostino, R. Vasan (2008)
Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyondStatistics in Medicine, 27
H. Uno, T. Cai, M. Pencina, R. D’Agostino, Lee-Jen Wei (2011)
On the C‐statistics for evaluating overall adequacy of risk prediction procedures with censored survival dataStatistics in Medicine, 30
H. Uno, T. Cai, L. Tian, Lee-Jen Wei (2011)
Graphical Procedures for Evaluating Overall and Subject‐Specific Incremental Values from New Predictors with Censored Event Time DataBiometrics, 67
Patrick Heagerty, Yingye Zheng (2005)
Survival Model Predictive Accuracy and ROC CurvesBiometrics, 61
M. Pencina, R. D'Agostino (2004)
Overall C as a measure of discrimination in survival analysis: model specific population value and confidence interval estimationStatistics in Medicine, 23
O. Borgan (2003)
The Statistical Analysis of Failure Time Data (2nd Ed.). John D. Kalbfleisch and Ross L. PrenticeJournal of the American Statistical Association, 98
J. Kalbfleisch, R. Prentice (2002)
The Statistical Analysis of Failure Time Data: Kalbfleisch/The Statistical
S. Cheng, Lee-Jen Wei, Z. Ying (1995)
Analysis of transformation models with censored dataBiometrika, 82
Harrell Harrell, Lee Lee, Mark Mark (1996)
Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errorsStatistics in Medicine, 15
(2000)
Asymptotic Statistics
(2005)
Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival.Proceedings of the National Academy of Sciences of the United States of America, 102 10
M. Pencina, R. D'Agostino, R. D'Agostino, R. Vasan (2008)
Comments on ‘Integrated discrimination and net reclassification improvements—Practical advice’Statistics in Medicine, 27
H. Wynn, S. Ogbonmwan (1986)
Discussion: Jackknife, Bootstrap and Other Resampling Methods in Regression AnalysisAnnals of Statistics, 14
T. Cai, S. Cheng (2007)
Robust combination of multiple diagnostic tests for classifying censored event times.Biostatistics, 9 2
N. Cook, J. Buring, P. Ridker (2006)
The Effect of Including C-Reactive Protein in Cardiovascular Risk Prediction Models for WomenAnnals of Internal Medicine, 145
M. Pencina, R. D'Agostino, E. Steyerberg (2011)
Extensions of net reclassification improvement calculations to measure usefulness of new biomarkersStatistics in Medicine, 30
D., R., Cox
Regression Models and Life-Tables
N. Hjort (1992)
On inference in parametric survival data modelsInternational Statistical Review, 60
Olga Demler, M. Pencina, R. D'Agostino (2012)
Misuse of DeLong test to compare AUCs for nested modelsStatistics in Medicine, 31
L. Veer, H. Dai, M. Vijver, Yudong He, A. Hart, M. Mao, H. Peterse, K. Kooy, M. Marton, A. Witteveen, G. Schreiber, R. Kerkhoven, C. Roberts, P. Linsley, R. Bernards, S. Friend (2002)
Gene expression profiling predicts clinical outcome of breast cancerNature, 415
D. Pollard (1990)
Empirical Processes: Theory and Applications
L. Chambless, C. Cummiskey, G. Cui (2011)
Several methods to assess improvement in risk prediction models: Extension to survival analysisStatistics in Medicine, 30
T. Cai, Lu Tian, H. Uno, S. Solomon, Lee-Jen Wei (2010)
Calibrating parametric subject-specific risk estimation.Biometrika, 97 2
M. Pepe, Ziding Feng, J. Gu (2008)
Comments on ‘Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond’ by M. J. Pencina et al., Statistics in Medicine (DOI: 10.1002/sim.2929)Statistics in Medicine, 27
E. Mammen (1997)
The Bootstrap and Edgeworth Expansion
N. Cook, P. Ridker (2009)
Advances in Measuring the Effect of Individual Predictors of Cardiovascular Risk: The Role of Reclassification MeasuresAnnals of Internal Medicine, 150
P. Greenland, P. O'Malley (2005)
When is a new prediction marker useful? A consideration of lipoprotein-associated phospholipase A2 and C-reactive protein for stroke risk.Archives of internal medicine, 165 21
(1997)
Measures for evaluating model performance
Risk prediction procedures can be quite useful for the patient's treatment selection, prevention strategy, or disease management in evidence‐based medicine. Often, potentially important new predictors are available in addition to the conventional markers. The question is how to quantify the improvement from the new markers for prediction of the patient's risk in order to aid cost–benefit decisions. The standard method, using the area under the receiver operating characteristic curve, to measure the added value may not be sensitive enough to capture incremental improvements from the new markers. Recently, some novel alternatives to area under the receiver operating characteristic curve, such as integrated discrimination improvement and net reclassification improvement, were proposed. In this paper, we consider a class of measures for evaluating the incremental values of new markers, which includes the preceding two as special cases. We present a unified procedure for making inferences about measures in the class with censored event time data. The large sample properties of our procedures are theoretically justified. We illustrate the new proposal with data from a cancer study to evaluate a new gene score for prediction of the patient's survival. Copyright © 2012 John Wiley & Sons, Ltd.
Statistics in Medicine – Wiley
Published: Jun 30, 2013
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.