Predicting risk of cardiovascular death in the high-dimensional cohort follow-up data in the presence of competing events: a guide for building a modeling pipeline
Abstract
Predictive models driven by time-to-event data are commonly used in survival analysis. Owing to the availability of high-dimensional epidemiological cohorts, there is a need for models and learning algorithms capable of utilizing hundreds or even thousands of predictors. Advanced machine learning tools with embedded variable selection are being modified for use with time-to-event data in the presence of competing risks and censoring. In this study, random survival forests were compared to...