SYNthetic Prediction Models (SYNPM) and Validation dataset

A list containing: (1) information on some (synthetic) existing prediction models (representing those available/published, which we want to validate in another independent dataset); and (2) a synthetic dataset that we wish to validate/update the models on.

Usage

SYNPM

Format

A list with six elements.

The first element is a data frame with the information about three existing binary (logistic regression) models for a binary outcome at one year
The second element is a data frame with the information about three existing time-to-event (Cox) models for the time-to-event outcome
The third, fourth and fifth elements are the cumulative baseline hazard information for the three time-to-event model
The sixth element is the (synthetic) validation dataset on which we want to validate the existing models. The dataset has 20000 rows and 8 variables:
Age

The age of the individual at baseline

SexM

The sex of the individual (1 = male; 0 = female)
Smoking_Status

Indicates whether the individual was or is a smoker (1=previous/ current smoker, 0=non-smoker)

Diabetes

Indicates whether the individual has diabetes (1=diabetic, 0=not diabetic)
Creatinine

The Creatinine value for the individual (mg/dL)
ETime

The time from baseline until either the event or censoring
Status

Indicator of whether the patient experienced the event or was censored at ETime
Y

Binary indicator of whether the individual experienced the event by 1 time-unit

Source

Simulated Data; see https://github.com/GlenMartin31/predRupdate