A simulated dataset containing demographic, clinical, and outcome variables for 500 individuals. Designed for demonstrating table creation and diagnostic testing analysis.
Format
A data frame with 500 rows and 19 variables:
- id
Unique patient identifier
- age
Age in years (Numeric)
- sex
Biological sex (Female, Male)
- bmi
Body Mass Index in kg/m² (Numeric, contains NAs)
- smoking
Smoking status (Never, Former, Current)
- exercise
Physical activity level (Low, Moderate, High)
- education
Educational attainment (High School, Some College, College+)
- income
Annual household income (<30k, 30-60k, 60k+)
- disease
Disease status - primary outcome (No, Yes)
- rapid_test
Result of rapid diagnostic test (Negative, Positive)
- lab_confirmed
Laboratory confirmation - gold standard (No, Yes)
- comorbidity_score
Score 0-5 based on medical history
- outcome1
Count of primary care visits in past year
- outcome2
Count of specialist visits in past year
- outcome3
Count of emergency department visits in past year
- hospitalized
Hospitalized in past year (No, Yes)
- systolic_bp
Systolic blood pressure in mmHg
- cholesterol
Total cholesterol in mg/dL
- region
Geographic region (North, South, East, West)
