Introduction
Methods
Data collection
Measures
Predictors
Outcome
Multiple imputation
Model development
Logistic regression
eXtreme gradient-boosting (XGBoost)
Support vector machine (SVM)
Artificial neural network (ANN)
Statistical analysis and calibration
Internal–external cross-validation
Results
Patient characteristics
Variable | Colorectal cancer—PROCORE (N = 316; N%) | Gynaecological cancer (N = 101; N%) | Prostate cancer (N = 586; N%) | Colorectal cancer—PLCRC (N = 1941; N%) | Bladder cancer (N = 216; N%) |
---|---|---|---|---|---|
Age at questionnaire, median (IQR) | 67.00 (14.00) | 65.00 (13.00) | 69.00 (9.00) | 66.06 (13.06) | 70.00 (11.00) |
Education level | |||||
Lower/Primary | 29 (9.2) | 9 (8.9) | 28 (4.8) | 89 (4.6) | 19 (8.8) |
Secondary (High school) | 70 (22.2) | 81 (80.2) | 192 (32.8) | 628 (32.4) | 42 (19.4) |
Secondary (Vocational) | 129 (40.8) | 11 (10.9) | 156 (26.6) | 432 (22.3) | 90 (41.7) |
Vocational/University | 88 (27.8) | 0 (0.0) | 210 (35.8) | 792 (40.8) | 65 (30.1) |
Marital status, N (%) | |||||
Married/ Cohabitating | 264 (83.5) | 84 (83.2) | 525 (89.6) | 1,609 (82.9) | 176 (81.5) |
Single/Divorced | 52 (16.5) | 17 (16.8) | 61 (10.4) | 332 (17.1) | 40 (18.5) |
Smoking history, N (%) | |||||
Never | 97 (30.7) | 53 (52.5) | 293 (50.0) | 657 (33.8) | 37 (17.1) |
Previous smoker | 185 (58.5) | 36 (35.6) | 255 (43.5) | 1186 (61.1) | 145 (67.1) |
Current smoker | 34 (10.8) | 12 (11.9) | 38 (6.5) | 98 (5.0) | 34 (15.7) |
Alcohol use, N (%) | |||||
Never or previous | 55 (17.4) | 46 (45.5) | 55 (9.4) | 1574 (29.6) | 23 (10.6) |
Current | 261 (82.6) | 55 (54.5) | 531 (90.6) | 1367 (70.4) | 193 (89.4) |
Sex | |||||
Male | 196 (62.0) | 0 (0.0) | 586 (100.0) | 1234 (63.6) | 173 (80.1) |
Female | 120 (38.0) | 101 (100.0) | 0 (0.0) | 707 (36.4) | 43 (19.9) |
BMI, median (IQR) | 26.37 (4.58) | 26.84 (7.41) | 26.25 (2.36) | 25.83 (4.67) | 26.11 (4.74) |
Comorbidities, N (%) | |||||
Zero | 87 (27.5) | 39 (38.6) | 167 (28.5) | 722 (37.2) | 58 (26.9) |
One | 104 (32.9) | 23 (22.8) | 185 (31.6) | 589 (30.3) | 59 (27.3) |
Two or more | 125 (39.6) | 39 (38.6) | 234 (39.9) | 630 (32.5) | 99 (45.8) |
Cancer type, N (%) | |||||
Ovarian Cancer | 63 (62.4) | ||||
Endometrial Cancer | 38 (37.6) | ||||
Colorectal Cancer | 316 (100.0) | 1941 (100.0) | |||
Bladder Cancer | 216 (100.0) | ||||
Prostate Cancer | 586 (100.0) | ||||
Cancer stage2, N (%) | |||||
I | 109 (34.5) | 73 (72.3) | 305 (52.0) | 523 (26.9) | 153 (70.8) |
II | 89 (28.2) | 7 (6.9) | 158 (27.0) | 518 (26.7) | 43 (19.9) |
III | 118 (37.3) | 21 (20.8) | 123 (21.0) | 900 (46.4) | 20 (9.3) |
Time since diagnosis (in days), median (IQR) | 20.00 (12.00) | 73.05 (36.53) | 40.00 (11.00) | 32.00 (23.00) | 42.03 (16.01) |
Receiving treatment at Tbaseline questionnaire, N (%) | 12 (3.8) | 42 (41.6) | 60 (10.8) | 561 (28.9) | 131 (60.6) |
Underwent systemic treatment, N (%) | 94 (29.7) | 31 (30.7) | 1 (0.2) | 247 (12.7) | 14 (6.5) |
Underwent radiotherapy, N (%) | 41 (13.0) | 21 (20.8) | 205 (35.0) | 414 (21.3) | 22 (10.2) |
Underwent surgery, N (%) | 312 (98.7) | 99 (98.0) | 214 (36.5) | 1,855 (95.6) | 47 (21.8) |
Quality of life, median (IQR) | 83.33 (16.67) | 75.00 (25.00) | 83.01 (15.63) | 83.33 (16.67) | 83.33 (16.67) |
Physical functioning (median, IQR) | 100.00 (13.33) | 86.67 (20.00) | 100.00 (13.33) | 100.00 (13.33) | 93.33 (26.67) |
Role functioning, median (IQR) | 100.00 (16.67) | 66.66 (33.33) | 100.00 (16.67) | 100.00 (33.33) | 100.00 (33.33) |
Emotional functioning, median (IQR) | 83.33 (25.00) | 83.33 (33.33) | 91.67 (25.00) | 83.33 (25.00) | 83.33 (33.33) |
Cognitive functioning, median (IQR) | 100.00 (16.67) | 83.33 (33.33) | 100.00 (16.67) | 100.00 (16.67) | 100.00 (16.67) |
Social functioning, median (IQR) | 100.00 (16.67) | 83.33 (33.33) | 100.00 (0.00) | 100.00 (33.33) | 100.00 (16.67) |
Fatigue, median (IQR) | 11.11 (33.33) | 33.33 (33.33) | 0.00 (22.22) | 22.22 (33.33) | 22.22 (33.33) |
Nausea/vomiting, median (IQR) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) |
Pain, median (IQR) | 0.00 (116.67) | 16.67 (33.33) | 0.00 (16.67) | 0.00 (33.33) | 16.67 (33.33) |
Dyspnoea,, median (IQR) | 0.00 (0.00) | 0.00 (33.33) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) |
Sleeping disturbance,, median (IQR) | 20.50 (33.33) | 33.33 (66.67) | 0.00 (33.33) | 0.00 (33.33) | 0.00 (33.33) |
Appetite loss, median (IQR) | 0.00 (0.00) | 0.00 (33.33) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) |
Constipation, median (IQR) | 0.00 (0.00) | 0.00 (33.33) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) |
Diarrhoea, median (IQR) | 0.00 (33.33) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (33.33) | 0.00 (0.00) |
Financial difficulties, median (IQR) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) | 0.00 (0.00) |
Has clinically-relevant fatigue (Tendpoint), N (%) | 36 (11.4) | 26 (25.7) | 53 (9.0) | 225 (11.6) | 37 (17.1) |
Predictor variable at Tbaseline | Clinically-relevant fatigue at Tendpoint (N = 377) | Not clinically-relevant fatigue at Tendpoint (N = 2,783) | P-value |
---|---|---|---|
Age at questionnaire, median (IQR) | 67.00 (15.03) | 67.06 (12.00) | < 0.01* |
BMI, median (IQR) | 26.81 (5.45) | 25.95 (3.93) | < 0.01* |
Days since diagnosis, median (IQR) | 35.00 (22.03) | 35.00 (22.00) | < 0.01* |
Days between diagnosis and treatment start, median (IQR) | 29.00 (18.00) | 30.06 (21.00) | < 0.01* |
Quality of life, median (IQR) | 66.67 (25.00) | 83.33 (25.00) | < 0.01* |
Physical functioning, median (IQR) | 80.00 (26.67) | 100.00 (13.33) | < 0.01* |
Role functioning, median (IQR) | 66.67 (66.67) | 100.00 (33.33) | < 0.01* |
Emotional functioning, median (IQR) | 75.00 (33.33) | 83.33 (25.00) | < 0.01* |
Cognitive functioning, median (IQR) | 83.33 (33.33) | 100.00 (16.67) | < 0.01* |
Social functioning, median (IQR) | 83.33 (33.33) | 100.00 (16.67) | < 0.01* |
Fatigue, median (IQR) | 44.44 (44.44) | 11.11 (33.33) | < 0.01* |
Nausea/vomiting, median (IQR) | 0.00 (16.67) | 0.00 (0.00) | < 0.01* |
Pain, median (IQR) | 16.67 (50.00) | 0.00 (16.67) | < 0.01* |
Dyspnoea, median (IQR) | 0.00 (33.33) | 0.00 (0.00) | < 0.01* |
Sleeping disturbance,, median (IQR) | 33.33 (66.67) | 0.00 (33.33) | < 0.01* |
Appetite loss, median (IQR) | 0.00 (33.33) | 0.00 (0.00) | < 0.01* |
Constipation, median (IQR) | 0.00 (33.33) | 0.00 (0.00) | < 0.01* |
Diarrhoea, median (IQR) | 0.00 (33.33) | 0.00 (33.33) | < 0.01* |
Financial difficulties, median (IQR) | 0.00 (0.00) | 0.00 (0.00) | < 0.01* |
Education level, N (%) | < 0.01* | ||
Lower/Primary | 36 (9.5) | 138 (4.9) | |
Secondary (High school) | 140 (37.1) | 870 (31.3) | |
Secondary (Vocational) | 102 (27.1) | 721 (25.9) | |
Vocational/ University | 99 (26.3) | 1054 (37.9) | |
Marital status, N (%) | < 0.01* | ||
Married/ Cohabitating | 268 (71.1) | 2217 (79.7) | |
Single/Divorced | 109 (28.9) | 566 (20.3) | |
Smoking history, N (%) | < 0.01* | ||
Never | 110 (29.2) | 1026 (36.9) | |
Previous smoker | 235 (62.3) | 1573 (56.5) | |
Current smoker | 32 (8.5) | 184 (6.6) | |
Alcohol use, N (%) | < 0.01* | ||
Yes | 173 (45.9) | 1372 (49.3) | |
No | 204 (54.1) | 1411 (50.7) | |
Sex | < 0.01* | ||
Male | 233 (61.8) | 1956 (70.3) | |
Female | 144 (38.2) | 827(29.7) | |
Comorbidities, N (%) | < 0.01* | ||
0 | 76 (20.2) | 998 (35.9) | |
1 | 103 (27.3) | 856 (30.8) | |
> 1 | 198 (52.5) | 929 (33.4) | |
Cancer type, N (%) | < 0.01* | ||
Ovarian Cancer | 13 (3.4) | 50 (1.8) | |
Endometrial Cancer | 13 (3.4) | 25 (0.9) | |
Colorectal Cancer | 261 (69.2) | 1996 (71.7) | |
Bladder Cancer | 37 (9.8) | 179 (6.4) | |
Prostate Cancer | 53 (14.1) | 533 (19.2) | |
Cancer stage, N (%) | 0.60 | ||
I | 131 (34.7) | 1032 (37.1) | |
II | 97 (25.7) | 718 (25.8) | |
III | 149 (39.5) | 1033 (37.1) | |
Currently undergoing treatment, N (%) | < 0.01* | ||
Yes | 119 (31.6) | 687 (24.7) | |
No | 258 (69.4) | 2,096 (76.3) | |
Underwent systemic treatment, N (%) | 0.08 | ||
Yes | 57 (15.1) | 330 (11.9) | |
No | 320 (84.9) | 2453 (88.1) | |
Underwent radiotherapy, N (%) | 0.12 | ||
Yes | 96 (25.5) | 607 (21.8) | |
No | 281 (74.5) | 2176 (78.2) | |
Underwent surgery, N (%) | 0.13 | ||
Yes | 290 (76.9) | 2237 (80.4) | |
No | 87 (23.1) | 546 (19.6) |
Predicting clinically relevant future fatigue at Tendpoint
Metric | Logistic Regression | XGBoost | SVM | ANN |
---|---|---|---|---|
Accuracy | 0.861 (0.011) | 0.863 (0.008) | 0.838 (0.015) | 0.825 (0.021) |
Balanced Accuracy | 0.652 (0.023) | 0.585 (0.014) | 0.627 (0.025) | 0.622 (0.043) |
Precision | 0.420 (0.049) | 0.366 (0.043) | 0.322 (0.043) | 0.308 (0.071) |
Sensitivity | 0.373 (0.045) | 0.220 (0.026) | 0.354 (0.049) | 0.353 (0.026) |
Specificity | 0.929 (0.010) | 0.949 (0.007) | 0.904 (0.010) | 0.891 (0.742) |
F1-score | 0.395 (0.044) | 0.273 (0.030) | 0.344 (0.053) | 0.347 (0.061) |
C-statistic | 0.769 (0.024) | 0.744 (0.024) | 0.728 (0.028) | 0.742 (0.034) |
R-squared | 0.035 (0.069) | -0.027 (0.135) | -0.148 (0.094) | -0.092 (0.137) |
Calibration Slope | 0.686 (0.077) | 0.458 (0.080) | 0.489 (0.067) | 0.553 (0.139) |
Internal–external cross-validation
Metric | Logistic Regression Output | Bladder Cancer | Colorectal Cancer | Prostate Cancer—PROCORE | Prostate Cancer—ProZib | Gynaecological cancer |
---|---|---|---|---|---|---|
Accuracy | 0.861 (0.011) | 0.796 (0.003) | 0.851 (< 0.001) | 0.836 (0.002) | 0.913 (0.003) | 0.757 (0.005) |
Balanced Accuracy | 0.652 (0.023) | 0.630 (0.002) | 0.619 (0.002) | 0.693 (0.007) | 0.611 (0.007) | 0.665 (0.009) |
Precision | 0.420 (0.049) | 0.399 (0.008) | 0.345 (0.001) | 0.345 (0.006) | 0.541 (0.038) | 0.532 (0.011) |
Sensitivity | 0.373 (0.045) | 0.378 (< 0.001) | 0.316 (0.004) | 0.511 (0.014) | 0.243 (0.014) | 0.2473 (0.019) |
Specificity | 0.929 (0.010) | 0.882 (0.004) | 0.921 (0.001) | 0.875 (< 0.001) | 0.979 (0.003) | 0.856 (0.006) |
F1-score | 0.395 (0.044) | 0.371 (0.002) | 0.329 (0.003) | 0.400 (0.002) | 0.347 (0.011) | 0.487 (0.017) |
C-statistic | 0.769 (0.024) | 0.776 (0.001) | 0.745 (0.001) | 0.818 (0.001) | 0.787 (0.003) | 0.736 (0.004) |