Ga naar de hoofdinhoud
Top

Predicting health-related quality of life two years post-diagnosis across seven cancer types: using machine learning to identify vulnerable patients

  • Open Access
  • 01-03-2026
Gepubliceerd in:

Abstract

Purpose

Cancer survivors often experience long-term consequences affecting their Health-Related Quality of Life (HRQoL). Sociodemographic factors, clinical characteristics, and health-related behaviours influence HRQoL, making some individuals vulnerable to adverse HRQoL. This study develops linear regression and machine learning models to predict HRQoL two-year post-diagnosis and to identify key vulnerability factors.

Methods

This longitudinal study included data of survivors of seven cancer types. Nineteen predictor variables were derived from questionnaires completed within three months post-diagnosis (baseline) from the Netherlands Cancer Registry. Linear regression, random forest, XGBoost, neural network, and Support Vector Machine (SVM) regressors were employed to predict the EORTC QLQ-C30 summary score 1.5–2.5 years post-diagnosis. Permutation testing assessed vulnerability factors.

Results

The analyses included 4,538 individuals. All models achieved similar R2 (0.3) and RMSE (9) scores. Linear regression, random forest, XGBoost, and SVM models identified lower physical, cognitive, and emotional functioning at diagnosis, along with more comorbidities, cancer type (especially endometrial), and higher BMI as the top vulnerability factors. Treatment, age, and education were not associated with vulnerability. All models tended to overestimate low HRQoL which might be due to the limited number of observations with low HRQoL values.

Conclusions

The predictors used in this analysis explained only 30% of the variation in long-term HRQoL. Similar to previous studies predicting HRQoL in cancer, these predictors miss crucial information. Baseline functioning, comorbidities, cancer type and BMI appeared to be the key vulnerability factors. Future studies should prioritize accurate prediction of low HRQoL scores.

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1007/s11136-026-04165-4.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Introduction

Globally, the number of cancer survivors is expected to increase in the upcoming years due to an increase in cancer incidence, and improvements in treatment, diagnostics and the organisation of healthcare [1]. The majority of these survivors are at risk for a number of adverse consequences following a cancer diagnosis and treatment including fatigue, pain, emotional symptoms, appetite loss and dyspnoea [24]. These consequences can influence an individual’s well-being and health-related quality of life (HRQoL) [5].
Research has demonstrated that HRQoL is influenced by sociodemographic and clinical factors as well as health-related behaviours, indicating that certain individuals may be more vulnerable to experiencing adverse HRQoL outcomes. For instance, illiteracy, unemployment, being female, being single, lacking social interactions, older age, comorbidities, African American ethnic background, and impaired mental health have been shown to contribute to increased vulnerability for impaired HRQoL among cancer survivors [610]. Certain health-related behavioural factors, such as alcohol consumption and smoking were associated with HRQoL in head and neck cancer patients [11]. Patient-reported functioning at diagnosis is linked to overall survival in both young adults with cancer [12] and lung cancer patients undergoing chemotherapy [13]. Because baseline functioning influences overall survival, it might also affect long-term HRQoL. Determining which factors and behaviours influence HRQoL serves a crucial role in identifying individuals who are vulnerable to adverse HRQoL outcomes and who might benefit from interventions.
As the complexity and volume of data on cancer survivors continues to grow, machine learning has emerged as a powerful tool in cancer research. Machine learning is predominantly applied to aid in assessing patient susceptibility for cancer prevention [15], facilitating (early) diagnosis [16], predicting patient prognosis regarding disease recurrence [17] and survival [18] as shown in recent reviews [1921]. In contrast, applications of ML to predict long-term patient-reported outcomes, such as HRQoL, remain limited. Existing studies have generally focused on a single cancer type and often lack integration of behavioral and psychosocial predictors, which are crucial to understanding HRQoL [22, 23]. This suggests that there is a research gap regarding the application of machine learning to predict the long-term consequences of cancer across various types of cancer.
This study aimed to develop linear regression and machine learning models to predict long-term HRQoL outcomes in cancer survivors. In particular, we aimed to predict the HRQoL at two years after diagnosis for survivors of ovarian, endometrial, colon, rectal, bladder, prostate, and breast cancer, which are among the most prevalent solid tumors with relatively high survival rates, and involve diverse clinical characteristics and treatment modalities. The models will incorporate demographic factors, clinical characteristics, health-related behaviours and functioning at time of diagnosis across various cancer types and compare these with a simple reference model. Additionally, we aimed to identify cancer survivors who are vulnerable to adverse HRQoL outcomes by identifying the most important factors contributing to poor HRQoL. These insights will aid clinicians in personalizing HRQoL prognosis and identifying vulnerable cancer survivors who may benefit from additional early-stage clinical support [24, 25].

Methods

Study design

We used data of multiple prospective studies within the Patients Reported Outcomes Following Initial treatment and Long term Evaluation of Survivorship (PROFILES) registry [26]. Clinical and sociodemographic data of the Netherlands Cancer Registry (NCR) were linked to these studies. For the patient-reported outcome (PRO) data collection, patients received information and were invited to participate via a letter by their (former) treating physician [2631]. We made a selection from the PROFILES registry to obtain colon and rectal cancer survivors from the Prospective Dutch Colorectal Cancer (PLCRC) cohort [27] and the Patient-Reported Outcomes in Colorectal Cancer (PROCORE) cohort [32]; ovarian and endometrial cancer survivors from the Registration system Oncological GYnaecology (ROGY) cohort [28]; bladder cancer survivors from the Insight into Bladder Cancer Care (BlaZIB) cohort [29]; prostate cancer survivors from the Insight into Prostate Cancer Care (ProZIB) cohort [30]; and finally breast cancer survivors from the Utrecht cohort for Multiple BReast cancer intErvention studies and Long-term evaLuAtion (UMBRELLA [31]) (Appendix Table 1). In this study, cancer survivor was defined as an individual being alive after a cancer diagnosis, regardless of whether they were cured of cancer [33].

Patient selection

However, for the purposes of the study, we selected the individuals who had completed a questionnaire within three months of diagnosis and one other between 1.5 and 2.5 years after diagnosis. So, each included patient completed two questionnaires. As such, our study population consists of individuals who survived at least 1.5 years post-diagnosis. Finally, we excluded individuals that were deceased within 2.5 years after diagnosis and individuals with metastatic cancer at initial diagnosis (stage IV). Our study does not capture long-term outcomes beyond 2.5 years.

Study measures

Predictor variables

Sociodemographic characteristics included highest completed educational level (lower/primary education, secondary education (high school), secondary vocational school, vocational education or university), and partner status (married/living together or single/divorced), which were self-reported via the questionnaires. Age at diagnosis (in years) and sex (male or female) were collected through the NCR.
Health-related behaviours including smoking habits (never, former or current smoker), alcohol consumption (never, former or current drinker) and Body Mass Index (BMI; weight in kg/ height in m2) were self-reported via the questionnaires.
Clinical characteristics included cancer stage according to the TNM classification [34], treatment (systemic therapy, surgery and radiotherapy (yes or no); whether the individual was undergoing treatment at the time of completing the baseline questionnaire (yes or no)) and cancer type (colon, rectal, ovarian, endometrial, bladder, prostate, and breast cancer) were collected through the NCR. The number of comorbidities (no-, one- or more than one) were self-reported within three months after diagnosis, using a modified version of the Charlson Comorbidity Index [35].
HRQoL was measured via EORTC QLQ-C30 [36]. This is a 30-item questionnaire that consists of 15 scales: five multi-item functional scales (physical, role, social, emotional, and cognitive functioning), nine multi- and single-item symptom scales (fatigue, pain, nausea or vomiting, dyspnoea, insomnia, appetite loss, constipation, diarrhoea, and financial difficulties), and a two-item global health status/QoL scale [37]. Raw scores for each multi-item scale are linearly transformed, where higher scores represent better functioning or worse symptoms [36]. As this resulted in a finite set of possible values between 0 and 100, the scales were treated as discrete variables (Appendix Table 2). For this study, we included the functional scales reported within three months after diagnosis, where higher scores indicate better functioning.

Response variable

The response variable was the EORTC QLQ-C30 summary score reported between 1.5- and 2.5-years post-diagnosis (ranging from 0 to 100, higher score indicates better HRQoL) [36]. Known group comparison analyses showed good validity and responsiveness to change over time has found to be good [38, 39]. The summary score was calculated as the average of the functional and symptom scales, excluding financial difficulties and global health status/QoL scores [4042]. We considered the summary score as a continuous variable because it can take on more than 1,000 distinct values.

Data exploration and preprocessing

Initially a backward filling imputation method was employed to estimate BMI, comorbidities, highest educational level, partner status, alcohol consumption, and smoking habits in case of missing values. This means that an individual’s baseline variable was imputed using the first available non-missing variable value from a subsequent questionnaire. The remainder of the missing values needed to be estimated for which we employed the random forest based MissRanger imputation algorithm [43]. This algorithm considers the relationships between all predictor variables (except for the response variable) to predict the missing data. We used Predictive Mean Matching (PMM) in missRanger for imputation to ensure realistic and plausible values. To evaluate the imputation method, analyses were also performed on complete case data, i.e., a subset of data with only observations without any missing values. All data selection and preprocessing steps were done using R 4.2.2 (2022-10-31) [44].

Model development

Multiple models were developed to predict the QLQ-C30 summary score. First, we built a linear regression model as an easy-to-interpret reference model. Furthermore, we explored tree-based algorithms, including random forest regressor [45] and eXtreme Gradient Boosting regressor. Subsequently, we implemented a Support Vector Machine (SVM) for regression. Finally, we developed a feedforward neural network [46]. Additionally, for each model we included the possibility to synthetically increase the few individuals with low HRQoL values in the training dataset, by oversampling them using the Synthetic Minority Oversampling Technique (SMOTE [47]) for regression [48], the artificial instances are created based on real data points. This study was conducted and reported in accordance with the TRIPOD-AI guidelines [49]. These models were developed using Python 3.11.1 [50] with the sklearn [51], XGBoost [52], and TensorFlow [53] libraries.
How well a model’s prediction matches unseen data is defined as the generalizability of the model [54]. To assess the generalizability and performance of the models, we used shuffle split cross-validation with 100 iterations for all predictions (Appendix Fig. 1). For each iteration, data were randomly divided, with 80% allocated to the training dataset and the remaining 20% to the testing dataset. The training data were used directly to train the random forest and XGBoost regressors. Because the other methods are sensitive to data standardization, with standardization typically improving model performance [55, 56], the training data were standardized beforehand for the linear model, the SVM, and the neural network.
Hyperparameters are external parameters of machine learning methods that need to be set manually. Nested cross-validation using a randomized grid search (Appendix Table 3) was used for this [57]. This involved splitting the total dataset in multiple folds into training (80%) and testing (20%) data. The training dataset was further split into training (80%) and validation (20%) subsets. The inner training subset was used to fit the models under different hyperparameter settings, and the validation subsets were used for their performance. The outer testing data was used for evaluating the final model’s performance. This nested cross-validation ensured that the validation data for tuning was distinct from the testing data. To tune the parameter that dictates the maximum number of training cycles without improvement before halting the training process in the neural network model, we included an additional step. In this step, 20% of the training dataset were allocated as validation data. Then, when the model performance on the validation data stopped improving, the training process was terminated.

Model evaluation

We evaluated model performance by plotting model predictions against true values and by quantification of prediction errors. For the quantitative analyses we employed Root Mean Squared Error (RMSE) and R-squared (R2) on the test dataset. The RMSE measures differences between model predictions and observed (true) values on the same scale as the response variable. When the RMSE is 0, predictions perfectly match observed values, and there is no upper limit to its value. To put the RMSE value in perspective, we also calculate the relative error R2, i.e., the variance explained by the model compared to a null model that simply returns the average for each test data point. R2 quantifies the proportion of variance in the response variable that can be explained by the model. When it has a value of 1, it means that predictions and true values correlate perfectly, while when it has a value of 0, there is no correlation between the two. In simple linear regression models, on training data, the R2 value is identical to the square of the correlation between observed and predicted values. Because the metrics are calculated on test data alone, the model could perform worse than the null model, in which case negative values of R2 would emerge. To ensure reliability of our evaluation, all metrics were calculated across all iterations of the cross-validation procedure and then averaged to obtain overall performance.

Feature importance

Feature importance was examined for the reference linear regression model, the random forest, XGBoost regressors and the SVM model as these models and the most-promising machine learning models using permutation testing [58]. The values of a single feature were shuffled in the test data. The model’s drop in R2 after shuffling the feature is an indication of its importance for the predictions. Permutation testing was conducted for each feature in every iteration. The results were then gathered and combined across the 100 iterations without summarizing to preserve iteration-specific variability allowing its analysis. The final linear regression model was trained on the entire dataset and the beta coefficients were used to determine the direction of the topmost important features, as identified through permutation testing. Model development and feature importance was done using Python 3.11.1 [50], including the sklearn library [51].

Results

Survivors’ characteristics

Approximately 45% of the initial population was included in the analyses (Appendix Fig. 2). Exclusions were related to the absence of a baseline questionnaire within the specified timeframe (30% of the total), missing data regarding the response variable (23% of the total), or death within 2.5 years since diagnosis. This resulted in a final study population of 4,538 survivors of 7 cancer types.
Survivors were on average 66 years old (9.58 SD), and half (51%) of the individuals had colon or rectal cancer (Table 1; Appendix Fig. 3). Appendix Table 4 contains a full breakdown by tumour type.
Table 1
Survivors’ characteristics including the imputed data
Characteristic
Survivors (n = 4538)
Age at diagnosis, min; mean ± SD; max
24; 66.17 ± 9.58; 94
Sex, No. (%)
 
 Female
1510 (33)
 Male
3028 (67)
BMI, mean (SD)
26.61 (4.58)
Education, No. (%)
 
 Lower/primary education
256 (6)
 Secondary education (high school)
1408 (31)
 Secondary vocational education
1219 (27)
 Higher vocational education, university
1655 (36)
Partner, No. (%)
 
 Married/living together
3762 (83)
 Single/divorced
776 (17)
Smoking, No. (%)
 
 Never
1295 (29)
 Previous
2572 (57)
 Current
671 (15)
Alcohol consumption, No. (%)
 
 Never/former drinker
2129 (47)
 Current drinker
2409 (53)
Comorbidities, No. (%)
 
 No comorbidities
1552 (34)
 One comorbidity
1372 (30)
 More than one comorbidity
1614 (36)
Tumor type, No. (%)
 
 Ovarian cancer
30 (1)
 Endometrial cancer
62 (1)
 Colon cancer
1389 (31)
 Rectal cancer
930 (20)
 Bladder cancer
761 (17)
 Prostate cancer
928 (20)
 Breast cancer
438 (10)
Stage (TNM), No. (%)
 
 I
1984 (44)
 II
1234 (27)
 III
1320 (29)
 Under treatment at baseline, No. (%)
1327 (29)
Treatment received, No. yes (%)
 
 Systemic therapy (chemotherapy,
targeted therapy and/or immunotherapy)
558 (12)
 Received surgery
3270 (72)
 Radiotherapy
1312 (29)
Functional scales at baseline, mean (SD)
 
 Role functioning
80.43 (27.42)
 Emotional functioning
81.89 (18.27)
 Cognitive functioning
88.91 (16.47)
 Physical functioning
89.35 (15.01)
 Social functioning
86.13 (20.46)
QLQ-C30 summary score
 
 Baseline, median (IQR), min–max
90.21 (80.98–95.73), 19.62–100
 Two years post-diagnosis, median (IQR), min–max
92.74 (85.13–97.44), 26.67–100

Model performance in predicting HRQoL two years post-diagnosis

The reported results incorporate hyperparameters tuning (Appendix Table 3). The reference linear regression model achieved a testing RMSE score of 8.9 and a testing R2 of 0.3 (Table 2). The XGBoost model performed best in terms of both RMSE and R2, but the performance metrics were fairly similar across the various models, with average testing RMSEs ranging from 8.9 and 9.1, and average testing R2 values between 0.28 and 0.30. Although there are some minor differences in performances of the models, when the 95% confidence intervals are taken into consideration, that show substantial overlap, these differences are not statistically significant. Overall, hyperparameter tuning mostly fixed overfitting of the random forest and XGBoost regressors. Analyses were conducted using both the imputed dataset and the complete case dataset to assess the validity of the imputation method. The results were highly similar across the two datasets (Table 2 and Appendix Table 5). The subset with complete case data resulted in 3528 individuals, thus 1010 individuals had missing values in their data. The similar performance indicates that imputation preserved the underlying structure in the data. Across all models, most data points occurred at high values for both the observed and the predicted HRQoL (Fig. 1). These high-density regions fell slightly below the perfect calibration line, suggesting that the models tended to underestimate high true values. To investigate this, we visualized predictions for discrete groups of the response variable, where the response variable was discretised into eight quantile-based groups, each containing approximately the same number of data points (Fig. 2). The groups consisted of increasingly smaller ranges of values, indicating a higher concentration of outcome values at high HRQoL levels, thus suggesting that most outcome values were skewed toward the higher end. The group with observed HRQoL values from 27 to 77, were estimated too high by all the models, while the models underestimated the maximum (100) observed values. In the case of the linear regression model (Fig. 2A), the neural network (Fig. 2D) and the SVM model (Fig. 2E) some predictions exceeded the maximum HRQoL score of 100. This is because these models typically do not enforce any specific output constraints and are thus not bounded by a maximum. The synthetically generated individuals with low HRQoL by SMOTE did not improve the models’ performances (Appendix Table 6). The random forest and XGBoost regressors (Fig. 2B and C) exhibited the worst underestimations. Overall, all models tended to predict within a narrow range of values across all eight groups.
Table 2
Average and standard deviation of training and testing performance metrics for each model over shuffle split cross-validation with 100 iterations for predicting HRQoL two years post-diagnosis across seven tumour types
Model
Test RMSE
Train RMSE
Test R2
Train R2
Linear regression
8.968 ± 0.298
8.662 ± 0.074
0.303 ± 0.031
0.345 ± 0.008
Random forest
9.042 ± 0.340
8.074 ± 0.081
0.297 ± 0.029
0.430 ± 0.008
XGBoost
8.897 ± 0.322
8.685 ± 0.077
0.304 ± 0.023
0.344 ± 0.007
Neural network
8.998 ± 0.32
8.393 ± 0.153
0.288 ± 0.029
0.387 ± 0.019
SVM
9.124 ± 0.370
8.950 ± 0.090
0.278 ± 0.028
0.301 ± 0.008
R2 values correspond to the fraction of variance in the response variable that is explained by the model. It has a maximum value of 1, indicating perfect correlation between predicted and true values. For simple linear regression R2 values for training data correspond to the square of the correlation between predicted and true values, e.g., if in simple regression the correlation between observed and predicted values is 0.55 for the training data, the R2 value would be 0.303
Fig. 1
Observed vs. predicted HRQoL values two years post-diagnosis across seven tumour types. Kernel density estimations are shown based on shuffle split cross-validation with 100 iterations. Colours indicate the density of data points across combinations of observed and predicted HRQoL values, with darker shades representing higher concentrations. Each plot corresponds to a different model: A Linear regression model, B Random forest regressor, C XGBoost regressor, D Neural network, and E SVM
Afbeelding vergroten
Fig. 2
Predicted vs. observed discrete HRQoL values two years post-diagnosis across seven tumour types. Model predictions are shown for each discretised observed HRQoL group based on shuffle split cross-validation with 100 iterations. The discretisation was done using quantile-based grouping to ensure an equal number of observations per group. For each group the observed HRQoL values are represented with jittered points. Each plot corresponds to a different model: A Linear regression model, B Random forest regressor, C XGBoost regressor, D Neural network, and E SVM
Afbeelding vergroten

Feature importance

The top four features were consistent across the reference linear regression model (Figs. 3A, 4A) and the machine learning models (Fig. 3B, D), namely lower physical functioning, emotional functioning, cognitive functioning, and more comorbidities at the time of diagnosis predict poorer HRQoL. For the reference linear regression model (Figs. 3A, 4A), higher BMI predicts poorer HRQoL as the fifth most important feature, which was also true for both tree-based models (Fig. 3B and C), the fifth most important feature for the SVM model was cancer type (Fig. 3D). Endometrial cancer exhibited the strongest negative association with HRQoL compared with colon cancer (Fig. 4B). For the remaining features, permutation testing resulted in a change in R2 of close to zero (overview in Appendix Table 7).
Fig. 3
The top 10 most important features over shuffle split cross-validation with 100 iterations for predicting HRQoL two years post-diagnosis across seven tumour types. Feature importance was determined based on permutation testing. Each box plot displays the distribution of changes in R2 after permutation testing per feature across 100 iterations of shuffle split cross validation. The central line inside each box represents the median. The boxes represent the interquartile range with the boxes’ edges representing the 25th and 75th percentile of the data. The whiskers extend to the minimum and maximum values within a range of 1.5 times the interquartile range. The black points beyond this range are outliers. Each plot corresponds to a different model: A Linear regression model, B Random forest regressor, C XGBoost regressor, D SVM regressor
Afbeelding vergroten
Fig. 4
Beta coefficients from the linear regression model for each feature used to predict HRQoL two years post-diagnosis across seven tumour types. The coefficients are calculated relative to a reference category for the categorical predictors. The zero line reflects this reference category. The reference category includes the lowest available functioning scores, absence of comorbidities, not receiving treatment at baseline, no radiotherapy, surgery, or systemic therapy, stage 1 cancer, never or former alcohol drinker, never smoked, male sex, married or living together with a partner, and lower/primary education level. Panel A presents the beta coefficients for all available predictors; color indicates the feature value, where red shows higher values of the predictors and blue lower (e.g., cognitive functioning has seven discrete levels, each depicted as a dot (relative to the reference level, i.e., the lowest functioning score). For example, the first blue dot for the feature cognitive functioningindicates that a low value of cognitive functioning (as indicated by the blue color) is associated with a negative beta coefficient (location on the x-axis), and thus a negative effect on HRQoL two year post-diagnosis). Panel B focuses on the beta coefficients for each tumour type with colon cancer as reference category.
Afbeelding vergroten

Discussion

This prospective study demonstrated that only about 30% of the variance in HRQoL two years post-diagnosis could be predicted for individuals with any of seven different cancer types, based on their characteristics at diagnosis. Notably, the machine learning models did not outperform the simpler linear regression model. Across all models, key vulnerability factors consistently included lower physical, cognitive and emotional functioning, as well as more comorbidities. However, all models tended to overestimate the low observed HRQoL values, indicating their current limitations for use in clinical practice.
Our performance metric results showed that the models performed very similar to each other with minor differences in metrics. Within the RMSE scores, the difference between the models’ performances was no more than 0.2 and within the R2 scores no more than 0.03.
The similar results between the analysis on the imputed dataset and the complete case data suggests that the imputation method did not alter the relationships within the data or introduce bias, this implies that the imputation methods were effective. Given the minimal impact of the imputation on the results, using the complete case dataset could be preferred to preserve the original dataset. However, by using the imputed dataset more data is retained.
The reference linear regression model achieved similar performance scores to the machine learning models. This suggests that this simpler model might be preferred because it is easier to interpret. Several studies find similar results, where the more recently developed machine learning algorithms performed similarly as the classical regression methods [5962]. However, other studies have reported that machine learning algorithms can outperform the classical regression methods under certain conditions [6365]. The limited explanatory power of the predictor variables included in this study may have restricted all models' ability to accurately predict HRQoL. Future research should revisit the selection of predictor variables, incorporating variables such as cancer recurrence, illiteracy [8], unemployment [8], ethnic background [10], and coping mechanisms [66], which a linear model might capture more accurately.
Overall, the models achieved an R2 score of around 0.3 which implies that 70% of the variability in HRQoL was left unexplained. While this R2 may seem low in comparison to studies predicting clinical outcomes such as survival [67], similar R2 values of 0.33 and 0.39 were found in previous studies predicting HRQoL outcomes in cancer patients receiving chemotherapy [68] and in young adult survivors [69], respectively. The low R2 in our analysis and previous studies may indicate there is inherent unpredictability in HRQoL outcomes.
An average RMSE of 9 indicates that approximately 95% of the residuals fall between − 18 and + 18. This implies that 95% of the predictions deviate no more than 18 units from the actual HRQoL values. This poor performance may be caused by the relatively few survivors with low HRQoL scores—16% had a score lower than 80. Because of this, the RMSE might be misleading: even if the models predicted high HRQoL values accurately but poorly predicted the few low values, the overall RMSE will still appear relatively small. The large number of near-accurate high-value predictions overshadow the poor low-value predictions. This is especially problematic because the low-value predictions are clinically relevant, as these indicate the vulnerable survivors. Furthermore, survivorship bias, which occurs when only patients who are alive at the study endpoint are included, may have further skewed our results by excluding those with more severe outcomes. While imputation of missing response data could reduce this bias, it could also introduce bias by distorting the true relationships between variables, as it may artificially smooth over the variability in HRQoL outcomes.
The linear regression model, the random forest, XGBoost, and SVM regressors agreed on the topmost important vulnerability factors. Our findings are in line with previous research showing that lower baseline physical functioning, emotional functioning, comorbidities and BMI are related to poor long-term HRQoL outcomes in cancer survivors according to review [71] and prediction [7275] studies. These findings may simply support the ‘better in is better out’ principle or the cancer and treatment may have more impact on their HRQoL. We did not find studies related to cognitive functioning and HRQoL outcomes. In contrast to other studies, we did not identify other factors to be predictive for HRQoL, such as treatment, age, smoking status, and educational level. It is likely that these factors are closely related to other factors in our models and do not appear to be of additional relevance. Additionally, endometrial and ovarian cancer had the strongest negative impact on long-term HRQoL compared to colon cancer. However, cancer type showed minimal influence on HRQoL in our analysis, suggesting that pooling different cancer types together does not compromise the model's generalizability across tumour types. Furthermore, as shown in previous literature, we saw that baseline functioning (being part of HRQoL), is above all other factors the most important predictor of future HRQoL [23]. Including these variables is clinically relevant, as they reflect early vulnerability and are already available in routine care, supporting timely identification of patients at risk. This implies that it may be relevant for future studies to address changes in HRQoL rather than HRQoL at a single follow-up point. Alternatively, future studies could explore HRQoL one year after diagnosis. This could potentially help in identifying critical periods when patients may need additional support. Lastly, it is crucial to mention that the feature importance present here cannot be given a causal interpretation as the models do not estimate marginal probabilities (e.g., of medical interventions across the whole sample) but only conditional ones (e.g., what is the HRQoL given a person is 89 years old) [76].
Given the complexity of predicting HRQoL, several limitations inherent to this study should be considered. First of all, the proportion of patients with low HRQoL was limited. To tackle this, we synthetically increased the number of individuals with low HRQoL value. This did not improve the models’ performance metrics. This could indicate that these synthetic cases did not accurately capture the true underlying patterns in the data. We recommend that future research efforts focus on improving the recruitment of patients with low HRQoL to participate in prospective studies. A second limitation was that HRQoL had a maximum value constraint, causing the models to underestimate the high observed HRQoL values. To address this, we transformed the response variable using a logit transformation, but overall model performance worsened. Further, although cancer type had a small overall contribution to the explained variance, the large proportion of colorectal cancer patients in our dataset may have skewed our results, potentially limiting the generalizability to other cancer types.
Future research may explore extended generalized linear models as they allow response variables with non-normal error distributions [77]. One possibility for improving both linear regression and machine learning models is through the exploration of different loss functions. Given the observed inaccuracies of the models predicting edge cases, a loss function that specifically penalizes errors occurring at the edges of the response variable could potentially improve the models’ accuracy. Such as the quantile loss function, specifically designed to handle predictions at different quantiles of the response variable’s distribution [78].
Predictive models may be promising in aiding healthcare providers identifying patients with poor prognosis of long-term HRQoL among cancer survivors. Increasingly, survivors complete PROMs in daily clinical care. These can, in combination with other vulnerability factors, be used to predict poor outcomes (e.g., treatment choice, HRQoL, survival) and inform personalized supportive care [79, 80].
In conclusion, this study demonstrated that HRQoL two years post-diagnosis can be predicted with an overall explained variance of about 30%. Similar to previous studies predicting HRQoL in cancer, the predictors included did not accurately capture long-term HRQoL. This indicates that predictors commonly used in prediction studies lack crucial information on long-term HRQoL [81]. Moreover, machine learning models were not better than a simple linear regression model. All models performed poorly in predicting low HRQoL and thus predicting who is vulnerable. Patients with low baseline HRQoL (i.e., physical functioning, cognitive functioning, emotional functioning), more comorbidities, endometrial or ovarian cancer and higher BMI may be more vulnerable for lower HRQoL outcomes and may be in need for early supportive care. Future studies may focus on improving the accuracy of predicting lower HRQoL scores by improving the recruitment of patients with poor HRQoL and by including a broader range of parameters.

Acknowledgements

We thank all the cohorts that contributed to this study for the collection of patient-reported outcomes, including Prospective Dutch Colorectal Cancer (PLCRC) cohort, the Patient-Reported Outcomes in Colorectal Cancer (PROCORE) cohort, the Registration system Oncological GYnaecology (ROGY) trial; the Insight into Bladder Cancer Care (BlaZIB) cohort; prostate cancer survivors from the Insight into Prostate Cancer Care (ProZIB) cohort; and the Utrecht cohort for Multiple BReast cancer intErvention studies and Long-term evaLuAtion (UMBRELLA). We also thank the registration team of the Netherlands Comprehensive Cancer Organisation (IKNL) for the collection of data for the Netherlands Cancer Registry. The ROGY Care Trial was supported by grant UVT 2010-4743 from the Dutch Cancer Society. PROCORE was supported by the Center of Research on Psychological disorders and Somatic diseases (CoRPS), Tilburg, University, the Netherlands; the Netherlands Comprehensive Cancer Organisation, Utrecht, the Netherlands; and an Investment Subsidy Large (2016/04981/ZONMW-91101002) of the Netherlands Organization for Scientific Research (The Hague, The Netherlands). PLCRC was supported by the Dutch Cancer Society (UVA 2013-6331); Stand Up to Cancer; ZonMw; Health Holland; Maag Lever Darm Stichting; Merck (unrestricted grant); Bristol-Myers Squibb (unrestricted grant); Bayer (unrestricted grant); and Servier (unrestricted grant). ProZIB was supported by the Dutch Cancer Society (project number 2013-5942). BlaZIB was supported by the Dutch Cancer Society (KWF; IKNL 2015–7914). UMBRELLA was supported by the Department of Radiation Oncology and the Julius Center for Health Sciences and Primary Care of the University Medical Center Utrecht, the Netherlands. The funding organisations had no role in the design, analysis, and interpretation of the results of this study.

Declarations

Conflict of interest

GV obtained research grants to institution: Bayer, BMS, Delfi Diagnostics, Merck, Natera, Nordic pharma, Labcorp (Personal Genome Diagnostics), Pierre Fabre, Servier, Sirtex.

Ethical approval

Ethical approval was previously obtained for each study cohort separately (ROGY: NL33429.008.10 [28]; PROCORE: NL51119.060.14 [32]; PLCRC: NL47888.041.14 [27]; UMBRELLA: NL52062.041.15 [75]). Both the BlaZIB [29] and ProZIB [30] studies were beyond the scope of the Medical Research Involving Human Subjects Act (WMO) and, therefore, an ethics approval was waived.
Informed consent was obtained from all individual participants included in the study.
Not applicable.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Titel
Predicting health-related quality of life two years post-diagnosis across seven cancer types: using machine learning to identify vulnerable patients
Auteurs
Willemijn F. Oudijk
Belle H. de Rooij
Koen J. van Benthem
Rampal S. Etienne
Simone Oerlemans
Helena M. Verkooijen
Katja K. H. Aben
Geraldine R. Vink
Anne M. May
Floortje Mols
Dimitris Katsimpokis
Nicole P. M. Ezendam
Publicatiedatum
01-03-2026
Uitgeverij
Springer International Publishing
Gepubliceerd in
Quality of Life Research / Uitgave 3/2026
Print ISSN: 0962-9343
Elektronisch ISSN: 1573-2649
DOI
https://doi.org/10.1007/s11136-026-04165-4

Supplementary Information

Below is the link to the electronic supplementary material.
1.
go back to reference Praagman, J., Slotman, E., van Disseldorp, L., & Lemmens, V. (2022). Kanker in Nederland trends & prognoses tot en met 2032. https://iknl.nl/getmedia/0ac9c06a-1b98-438e-93f5-db7cc979528f/trendrapport-kanker-in-NL_-14072023.pdf
2.
go back to reference de Rooij, B. H., Oerlemans, S., van Deun, K., Mols, F., de Ligt, K. M., Husson, O., Ezendam, N. P. M., Hoedjes, M., van de Poll-Franse, L. V., & Schoormans, D. (2021). Symptom clusters in 1330 survivors of 7 cancer types from the PROFILES registry: A network analysis. Cancer, 127(24), 4665–4674.CrossRefPubMedPubMedCentral
3.
go back to reference Baden, M., Lu, L., Drummond, F. J., Gavin, A., & Sharp, L. (2020). Pain, fatigue and depression symptom cluster in survivors of prostate cancer. Supportive Care in Cancer, 28(10), 4813–4824. https://doi.org/10.1007/s00520-019-05268-0CrossRefPubMedPubMedCentral
4.
go back to reference Oerlemans, S., Mols, F., Issa, D., Pruijt, J. H. F. M., Peters, W., Lybeert, M., Zijlstra, W., Coebergh, J. W., & van de Poll-Franse, L. (2013). A high level of fatigue among long-term survivors of non-Hodgkin’s lymphoma: Results from the longitudinal population-based PROFILES registry in the south of the Netherlands. Haematologica, 98(3), 479–486. https://doi.org/10.3324/haematol.2012.064907CrossRefPubMed
5.
go back to reference de Graeff, A., de Leeuw, J. R. J., Ros, W. J. G., Hordijk, G.-J., Blijham, G. H., & Winnubst, J. A. M. (2000). Long-term quality of life of patients with head and neck cancer. The Laryngoscope, 110(1), 98–106. https://doi.org/10.1097/00005537-200001000-00018CrossRefPubMed
6.
go back to reference Laghousi, D., Jafari, E., Nikbakht, H., Nasiri, B., Shamshirgaran, M., & Aminisani, N. (2019). Gender differences in health-related quality of life among patients with colorectal cancer. Journal of Gastrointestinal Oncology; Vol 10, No 3 (June 01, 2019): Journal of Gastrointestinal Oncology. https://jgo.amegroups.org/article/view/27616
7.
go back to reference Bellizzi, K. M., Aziz, N. M., Rowland, J. H., Weaver, K., Arora, N. K., Hamilton, A. S., Oakley-Girvan, I., & Keel, G. (2012). Double jeopardy? Age, race, and HRQOL in older adults with cancer. Journal of Cancer Epidemiology, 2012(1), Article 478642. https://doi.org/10.1155/2012/478642CrossRefPubMedPubMedCentral
8.
go back to reference Ramasubbu, S. K., Pasricha, R., Nath, U., Rawat, V., & Das, B. (2020). Quality of life and factors affecting it in adult cancer patients undergoing cancer chemotherapy in a tertiary care hospital. Cancer Reports. https://doi.org/10.1002/cnr2.1312CrossRefPubMedPubMedCentral
9.
go back to reference Hamel, J.-F., Pe, M., Coens, C., Martinelli, F., Eggermont, A. M. M., Brandberg, Y., & Bottomley, A. (2016). A systematic review examining factors influencing health related quality of life among melanoma cancer survivors. European Journal of Cancer, 69, 189–198. https://doi.org/10.1016/j.ejca.2016.10.008CrossRefPubMed
10.
go back to reference Odeo, S., & Degu, A. (2020). Factors affecting health-related quality of life among prostate cancer patients: A systematic review. Journal of Oncology Pharmacy Practice, 26(8), 1997–2010. https://doi.org/10.1177/1078155220959414CrossRefPubMed
11.
go back to reference Llewellyn, C. D., McGurk, M., & Weinman, J. (2005). Are psycho-social and behavioural factors related to health related-quality of life in patients with head and neck cancer? A systematic review. Oral Oncology, 41(5), 440–454. https://doi.org/10.1016/j.oraloncology.2004.12.006CrossRefPubMed
12.
go back to reference George, G. C., Andersen, C., Tang, X., Rodriguez, E., Jafry, M., Swartz, M. C., Ahmed, S., Barcenas, C. H., Livingston, J. A., Roth, M. E., & Hildebrandt, M. A. T. (2024). Patient-reported quality of life at diagnosis in adolescent and young adults with cancer. Journal of the National Comprehensive Cancer Network, 22(6), Article e247012. https://doi.org/10.6004/jnccn.2024.7012CrossRefPubMedPubMedCentral
13.
go back to reference Badaoui, S., Shahnam, A., McKinnon, R. A., Abuhelwa, A. Y., Sorich, M. J., & Hopkins, A. M. (2022). The predictive utility of patient-reported outcomes and performance status for survival in metastatic lung cancer patients treated with chemoimmunotherapy. Translational Lung Cancer Research; Vol 11, No 3 (March 28, 2022): Translational Lung Cancer Research. https://tlcr.amegroups.org/article/view/61746
14.
go back to reference Watson, E. K., Rose, P. W., Neal, R. D., Hulbert-Williams, N., Donnelly, P., Hubbard, G., Elliott, J., Campbell, C., Weller, D., & Wilkinson, C. (2012). Personalised cancer follow-up: Risk stratification, needs assessment or both? British Journal of Cancer, 106(1), 1–5. https://doi.org/10.1038/bjc.2011.535CrossRefPubMedPubMedCentral
15.
go back to reference Naji, M. A., Filali, S. E., Aarika, K., Benlahmar, E. H., Abdelouhahid, R. A., & Debauche, O. (2021). Machine learning algorithms for breast cancer prediction and diagnosis. Procedia Computer Science, 191, 487–492. https://doi.org/10.1016/j.procs.2021.07.062CrossRef
16.
go back to reference Zhang, Z.-M., Tan, J.-X., Wang, F., Dao, F.-Y., Zhang, Z.-Y., & Lin, H. (2020). Early diagnosis of hepatocellular carcinoma using machine learning method. Frontiers in Bioengineering and Biotechnology. https://doi.org/10.3389/fbioe.2020.00254CrossRefPubMedPubMedCentral
17.
go back to reference Tseng, C.-J., Lu, C.-J., Chang, C.-C., & Chen, G.-D. (2014). Application of machine learning to predict the recurrence-proneness for cervical cancer. Neural Computing and Applications, 24(6), 1311–1316. https://doi.org/10.1007/s00521-013-1359-1CrossRef
18.
go back to reference Chen, Y.-C., Ke, W.-C., & Chiu, H.-W. (2014). Risk classification of cancer survival using ANN with gene expression data from multiple laboratories. Computers in Biology and Medicine, 48, 1–7. https://doi.org/10.1016/j.compbiomed.2014.02.006CrossRefPubMed
19.
go back to reference Capobianco, E. (2022). High-dimensional role of AI and machine learning in cancer research. British Journal of Cancer, 126(4), 523–532. https://doi.org/10.1038/s41416-021-01689-zCrossRefPubMedPubMedCentral
20.
go back to reference Kourou, K., Exarchos, K. P., Papaloukas, C., Sakaloglou, P., Exarchos, T., & Fotiadis, D. I. (2021). Applied machine learning in cancer research: A systematic review for patient diagnosis, classification and prognosis. Computational and Structural Biotechnology Journal, 19, 5546–5555. https://doi.org/10.1016/j.csbj.2021.10.006CrossRefPubMedPubMedCentral
21.
go back to reference Zhang, B., Shi, H., & Wang, H. (2023). Machine learning and AI in cancer prognosis, prediction, and treatment selection: A critical approach. Journal of Multidisciplinary Healthcare, 16, 1779–1791. https://doi.org/10.2147/JMDH.S410301CrossRefPubMedPubMedCentral
22.
go back to reference Krepper, D., Cesari, M., Hubel, N. J., Zelger, P., & Sztankay, M. J. (2024). Machine learning models including patient-reported outcome data in oncology: A systematic literature review and analysis of their reporting quality. Journal of Patient-Reported Outcomes, 8(1), Article 126. https://doi.org/10.1186/s41687-024-00808-7CrossRefPubMedPubMedCentral
23.
go back to reference van der Heijden, T. G. W., de Ligt, K. M., Hubel, N. J., van der Mierden, S., Holzner, B., van de Poll-Franse, L. V., de Rooij, B. H., the EORTC Quality of Life Group. (2025). Exploring the role of health-related quality of life measures in predictive modelling for oncology: A systematic review. Quality of Life Research, 34(2), 305–323. https://doi.org/10.1007/s11136-024-03820-yCrossRefPubMed
24.
go back to reference Yasin, B. G. A., Silva, D. F. da, Chaves, J. B., Abreu, J. P. C. de, & Ortiz, L. P. G. (2023). Psycho-oncology practice: Interventions regarding the emotional vulnerability of cancer patients. Seven Editora. http://sevenpublicacoes.com.br/index.php/editora/article/view/2412
25.
go back to reference Rock, C. L., Doyle, C., Demark-Wahnefried, W., Meyerhardt, J., Courneya, K. S., Schwartz, A. L., Bandera, E. V., Hamilton, K. K., Grant, B., McCullough, M., Byers, T., & Gansler, T. (2012). Nutrition and physical activity guidelines for cancer survivors. CA: A Cancer Journal for Clinicians, 62(4), 242–274. https://doi.org/10.3322/caac.21142CrossRef
26.
go back to reference van de Poll-Franse, L. V., Horevoorts, N., van Eenbergen, M., Denollet, J., Roukema, J. A., Aaronson, N. K., Vingerhoets, A., Coebergh, J. W., de Vries, J., & Essink-Bot, M.-L. (2011). The patient reported outcomes following initial treatment and long term evaluation of survivorship registry: Scope, rationale and design of an infrastructure for the study of physical and psychosocial outcomes in cancer survivorship cohorts. European Journal of Cancer, 47(14), 2188–2194.CrossRefPubMed
27.
go back to reference Burbach, J. P. M., Kurk, S. A., van den Coebergh Braak, R. R. J., Dik, V. K., May, A. M., Meijer, G. A., Punt, C. J. A., Vink, G. R., Los, M., Hoogerbrugge, N., Huijgens, P. C., Ijzermans, J. N. M., Kuipers, E. J., de Noo, M. E., Pennings, J. P., van der Velden, A. M. T., Verhoef, C., Siersema, P. D., van Oijen, M. G. H., & Koopman, M. (2016). Prospective Dutch colorectal cancer cohort: An infrastructure for long-term observational, prognostic, predictive and (randomized) intervention research. Acta Oncologica, 55(11), 1273–1280. https://doi.org/10.1080/0284186X.2016.1189094CrossRefPubMed
28.
go back to reference van de Poll-Franse, L. V., Nicolaije, K. A., Vos, M. C., Pijnenborg, J. M., Boll, D., Husson, O., Ezendam, N. P., Boss, E. A., Hermans, R. H., Engelhart, K. C., Haartsen, J. E., Pijlman, B. M., Feijen, H. W., Mertens, H. J., Nolting, W. E., van Beek, J. J., Roukema, J. A., & Kruitwagen, R. F. (2011). The impact of a cancer Survivorship Care Plan on gynecological cancer patient and health care provider reported outcomes (ROGY Care): Study protocol for a pragmatic cluster randomized controlled trial. Trials, 12(1), Article 256. https://doi.org/10.1186/1745-6215-12-256CrossRefPubMedPubMedCentral
29.
go back to reference Ripping, T. M., Kiemeney, L. A., van Hoogstraten, L. M. C., Witjes, J. A., Aben, K. K. H., Boormans, J., de Reijke, T., Goossens, C. A., Helder, S., Hulshof, M. C. C. M., van Leenders, G. J. L. H., van Leliveld, A. M., Meijer, R. P., Mulder, S., Nooter, R. I., Noteboom, J. L., Oddens, J. R., Smilde, T. J., & Vanderbosch, G. W. J. (2020). Insight into bladder cancer care: Study protocol of a large nationwide prospective cohort study (BlaZIB). BMC Cancer, 20(1), Article 455. https://doi.org/10.1186/s12885-020-06954-7CrossRefPubMedPubMedCentral
30.
go back to reference Vernooij, R. W. M., Cremers, R. G. H. M., Jansen, H., Somford, D. M., Kiemeney, L. A., van Andel, G., Wijsman, B. P., Busstra, M. B., van Moorselaar, R. J. A., Wijnen, E. M., Pos, F. J., Hulshof, M. C. C. M., Hamberg, P., van den Berkmortel, F., van de Hulsbergen- Kaa, C. A., van Leenders, G. J. L. H., Fütterer, J. J., van Oort, I. M., & Aben, K. K. H. (2020). Urinary incontinence and erectile dysfunction in patients with localized or locally advanced prostate cancer: A nationwide observational study. Urologic Oncology: Seminars and Original Investigations, 38(9), 735.e17-735.e25. https://doi.org/10.1016/j.urolonc.2020.05.022CrossRefPubMed
31.
go back to reference Young-Afat, D. A., van Gils, C. H., van den Bongard, H. J. G. D., Verkooijen, H. M., Gernaat, S. A., Gregorowitsch, M. L., May, A. M., Peeters, P. H., van der Pol, C. C., Witkamp, A. J., Pijnappel, R. M., Bijlsma, R. M., Maarse, W., Ausems, M. G., van Diest, P. J., van der Wall, E., van Dalen, T., Burgmans, I. P., & de Roos, M. A. (2017). The Utrecht cohort for Multiple BREast cancer intervention studies and Long-term evaLuAtion (UMBRELLA): Objectives, design, and baseline results. Breast Cancer Research and Treatment, 164(2), 445–450. https://doi.org/10.1007/s10549-017-4242-4CrossRefPubMedPubMedCentral
32.
go back to reference Bonhof, C. S., van de Graaf, D. L., Wasowicz, D. K., Vreugdenhil, G., & Mols, F. (2022). Symptoms of pre-treatment anxiety are associated with the development of chronic peripheral neuropathy among colorectal cancer patients. Supportive Care in Cancer, 30(6), 5421–5429. https://doi.org/10.1007/s00520-022-06971-1CrossRefPubMedPubMedCentral
33.
go back to reference National Cancer Institute. (n.d.). Definition of cancer survivor. https://www.cancer.gov/publications/dictionaries/cancer-terms/def/survivor
34.
go back to reference O’Sullivan, B., Brierley, J., Byrd, D., Bosman, F., Kehoe, S., Kossary, C., Piñeros, M., Van Eycken, E., Weir, H. K., & Gospodarowicz, M. (2017). The TNM classification of malignant tumours-towards common understanding and reasonable expectations. The Lancet Oncology, 18(7), 849–851. https://doi.org/10.1016/S1470-2045(17)30438-2CrossRefPubMedPubMedCentral
35.
go back to reference Charlson, M., Szatrowski, T., Peterson, J., & Gold, J. (1994). Validation of a combined comorbidity index. Journal of Clinical Epidemiology, 47(11), 1245–1251. https://doi.org/10.1016/0895-4356(94)90129-5CrossRefPubMed
36.
go back to reference Aaronson, N., Ahmedzai, S., Bergman, B., Bullinger, M., Cull, A., Duez, N., Filiberti, A., Flechtner, H., & Fleishman, S. (1993). The European-organization-for-research-and-treatment-of-cancer QLQ-C30—a quality-of-life instrument for use in international clinical-trials in oncology. Journal of the National Cancer Institute, 85, 365–376. https://doi.org/10.1093/jnci/85.5.365CrossRefPubMed
37.
go back to reference Kassianos, A. P., & Tsounta, S. (2022). Defining Quality of Life. In A. P. Kassianos (Ed.), Handbook of Quality of Life in Cancer (pp. 3–5). Springer International Publishing. https://doi.org/10.1007/978-3-030-84702-9_1
38.
go back to reference Efficace, F., Cottone, F., Sommer, K., Kieffer, J., Aaronson, N., Fayers, P., Groenvold, M., Caocci, G., Lo Coco, F., Gaidano, G., Niscola, P., Baccarani, M., Rosti, G., Venditti, A., Angelucci, E., Fazi, P., Vignetti, M., & Giesinger, J. (2019). Validation of the European organisation for research and treatment of cancer quality of life questionnaire core 30 summary score in patients with hematologic malignancies. Value in Health, 22(11), 1303–1310. https://doi.org/10.1016/j.jval.2019.06.004CrossRefPubMed
39.
go back to reference Giesinger, J. M., Kieffer, J. M., Fayers, P. M., Groenvold, M., Petersen, M. A., Scott, N. W., Sprangers, M. A., Velikova, G., Aaronson, N. K., EORTC Quality of Life Group. (2016). Replication and validation of higher order models demonstrated that a summary score for the EORTC QLQ-C30 is robust. Journal of Clinical Epidemiology, 69, 79–88. https://doi.org/10.1016/j.jclinepi.2015.08.007CrossRefPubMed
40.
go back to reference Husson, O., de Rooij, B. H., Kieffer, J., Oerlemans, S., Mols, F., Aaronson, N. K., van der Graaf, W. T. A., & van de Poll‐Franse, L. V. (2020). The EORTC QLQ‐C30 summary score as prognostic factor for survival of patients with cancer in the “real‐world”: Results from the population‐based PROFILES registry. The Oncologist, 25(4), e722–e732. https://doi.org/10.1634/theoncologist.2019-0348CrossRefPubMed
41.
go back to reference Gundy, C. M., Fayers, P. M., Groenvold, M., Petersen, M. A., Scott, N. W., Sprangers, M. A. G., Velikova, G., & Aaronson, N. K. (2012). Comparing higher order models for the EORTC QLQ-C30. Quality of Life Research, 21(9), 1607–1617. https://doi.org/10.1007/s11136-011-0082-6CrossRefPubMed
42.
go back to reference Hinz, A., Einenkel, J., Briest, S., Stolzenburg, J.-U., Papsdorf, K., & Singer, S. (2012). Is it useful to calculate sum scores of the quality of life questionnaire EORTC QLQ-C30? European Journal of Cancer Care, 21(5), 677–683. https://doi.org/10.1111/j.1365-2354.2012.01367.xCrossRefPubMed
43.
go back to reference Wright, M. N., & Ziegler, A. (2017). Ranger: A fast implementation of Random Forests for high dimensional data in C++ and R. Journal of Statistical Software, 77(1), 1–17. https://doi.org/10.18637/jss.v077.i01CrossRef
44.
go back to reference R Core Team. (2022). R: A Language and Environment for Statistical Computing [Computer software]. R Foundation for Statistical Computing. https://www.R-project.org/
45.
go back to reference Ho, T. K. (1995). Random Decision Forests. Proceedings of the Third International Conference on Document Analysis and Recognition (Volume 1) - Volume 1, 278–282.
46.
go back to reference McCulloch, W. S., & Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 5(4), 115–133. https://doi.org/10.1007/BF02478259CrossRef
47.
go back to reference Chawla, N., Bowyer, K., Hall, L., & Kegelmeyer, W. (2002). SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research, 16, 321–357. https://doi.org/10.1613/jair.953CrossRef
48.
go back to reference Torgo, L., Ribeiro, R. P., Pfahringer, B., & Branco, P. (2013). SMOTE for Regression. In L. Correia, L. P. Reis, & J. Cascalho (Eds.), Progress in Artificial Intelligence (pp. 378–389). Springer.CrossRef
49.
go back to reference TRIPOD+AI statement: Updated guidance for reporting clinical prediction models that use regression or machine learning methods. (2024). BMJ, 385, q902. https://doi.org/10.1136/bmj.q902
50.
go back to reference Van Rossum, G., & Drake Jr, F. L. (1995). Python reference manual. Centrum voor Wiskunde en Informatica Amsterdam.
51.
go back to reference Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E., & Louppe, G. (2012). Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research, 12.
52.
go back to reference Chen, T., & Guestrin, C. (2016). XGBoost: A Scalable Tree Boosting System (p. 794). https://doi.org/10.1145/2939672.2939785
53.
go back to reference Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D. G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., … Zheng, X. (2016). TensorFlow: A system for large-scale machine learning. Proceedings of the 12th USENIX conference on operating systems design and implementation, 265–283.
54.
go back to reference Berggren, M., Kaati, L., Pelzer, B., Stiff, H., Lundmark, L., & Akrami, N. (2024). The generalizability of machine learning models of personality across two text domains. Personality and Individual Differences, 217, Article 112465. https://doi.org/10.1016/j.paid.2023.112465CrossRef
55.
go back to reference Ben-Hur, A., & Weston, J. (2010). A user’s guide to support vector machines. Methods in Molecular Biology (Clifton, N.J.), 609, 223–239. https://doi.org/10.1007/978-1-60327-241-4_13CrossRefPubMed
56.
go back to reference Greener, J. G., Kandathil, S. M., Moffat, L., & Jones, D. T. (2022). A guide to machine learning for biologists. Nature reviews. Molecular Cell Biology, 23(1), 40–55. https://doi.org/10.1038/s41580-021-00407-0CrossRefPubMed
57.
go back to reference Owen, L. (2022). Hyperparameter Tuning with Python: Boost your machine learning model’s performance via hyperparameter tuning. Packt Publishing Ltd.
58.
go back to reference Breiman, L. (2001). Random Forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324CrossRef
59.
go back to reference Alekseeva, D., Stepanov, N., Veprev, A., Sharapova, A., Lohan, E. S., & Ometov, A. (2021). Comparison of machine learning techniques applied to traffic prediction of real wireless network. IEEE Access. https://doi.org/10.1109/ACCESS.2021.3129850CrossRef
60.
go back to reference Alaka, S., Menon, B., Brobbey, A., Williamson, T., Goyal, M., Demchuk, A., Hill, M., & Sajobi, T. (2020). Functional outcome prediction in ischemic stroke: A comparison of machine learning algorithms and regression models. Frontiers in Neurology. https://doi.org/10.3389/fneur.2020.00889CrossRefPubMedPubMedCentral
61.
go back to reference Chen, J., de Hoogh, K., Gulliver, J., Hoffmann, B., Hertel, O., Ketzel, M., Bauwelinck, M., Van Donkelaar, A., Hvidtfeldt, U. A., & Katsouyanni, K. (2019). A comparison of linear regression, regularization, and machine learning algorithms to develop Europe-wide spatial models of fine particles and nitrogen dioxide. Environment International, 130, Article 104934.CrossRefPubMed
62.
go back to reference Chowdhury, M. Z. I., Leung, A. A., Walker, R. L., Sikdar, K. C., O’Beirne, M., Quan, H., & Turin, T. C. (2023). A comparison of machine learning algorithms and traditional regression-based statistical modeling for predicting hypertension incidence in a Canadian population. Scientific Reports, 13(1), Article 13. https://doi.org/10.1038/s41598-022-27264-xCrossRefPubMedPubMedCentral
63.
go back to reference Churpek, M. M., Yuen, T. C., Winslow, C., Meltzer, D. O., Kattan, M. W., & Edelson, D. P. (2016). Multicenter comparison of machine learning methods and conventional regression for predicting clinical deterioration on the wards. Critical Care Medicine. https://doi.org/10.1097/CCM.0000000000001571CrossRefPubMedPubMedCentral
64.
go back to reference Singal, A. G., Mukherjee, A., Elmunzer, J. B., Higgins, P. D. R., Lok, A. S., Zhu, J., Marrero, J. A., & Waljee, A. K. (2013). Machine Learning Algorithms Outperform Conventional Regression Models in Predicting Development of Hepatocellular Carcinoma. Official Journal of the American College of Gastroenterology | ACG, 108(11). https://journals.lww.com/ajg/fulltext/2013/11000/machine_learning_algorithms_outperform.10.aspx
65.
go back to reference Ye, J., Hua, M., & Zhu, F. (2021). Machine learning algorithms are superior to conventional regression models in predicting risk stratification of COVID-19 patients. Risk Management and Healthcare Policy, 14, 3159–3166. https://doi.org/10.2147/RMHP.S318265CrossRefPubMedPubMedCentral
66.
go back to reference Chirico, A., Lucidi, F., Merluzzi, T., Alivernini, F., Laurentiis, M. D., Botti, G., & Giordano, A. (2017). A meta-analytic review of the relationship of cancer coping self-efficacy with distress and quality of life. Oncotarget. https://doi.org/10.18632/oncotarget.15758CrossRefPubMedPubMedCentral
67.
go back to reference Babaei Rikan, S., Sorayaie Azar, A., Naemi, A., Bagherzadeh Mohasefi, J., Pirnejad, H., & Wiil, U. K. (2024). Survival prediction of glioblastoma patients using modern deep learning and machine learning techniques. Scientific Reports, 14(1), Article 2371. https://doi.org/10.1038/s41598-024-53006-2CrossRefPubMedPubMedCentral
68.
go back to reference Khandelwal, S., & B., L. K. (2016). Development of a predictor model for quality of life in cancer patients with adverse drug reactions due to cancer chemotherapy. Journal of Applied Pharmaceutical Science, 6(5), 022–028.CrossRef
69.
go back to reference Maurice-Stam, H., Oort, F. J., Last, B. F., & Grootenhuis, M. A. (2009). A predictive model of health-related quality of life in young adult survivors of childhood cancer. European Journal of Cancer Care, 18(4), 339–349. https://doi.org/10.1111/j.1365-2354.2007.00916.xCrossRefPubMed
70.
go back to reference de Rooij, B. H., Ezendam, N. P. M., Mols, F., Vissers, P. A. J., Thong, M. S. Y., Vlooswijk, C. C. P., Oerlemans, S., Husson, O., Horevoorts, N. J. E., & van de Poll-Franse, L. V. (2018). Cancer survivors not participating in observational patient-reported outcome studies have a lower survival compared to participants: The population-based PROFILES registry. Quality of Life Research, 27(12), 3313–3324. https://doi.org/10.1007/s11136-018-1979-0CrossRefPubMedPubMedCentral
71.
go back to reference El Haidari, R., Abou-Abbas, L., Nerich, V., & Anota, A. (2020). Factors Associated with Health-Related Quality of Life in Women with Breast Cancer in the Middle East: A Systematic Review. Cancers. https://doi.org/10.3390/cancers12030696CrossRefPubMedPubMedCentral
72.
go back to reference Tiselius, C., Rosenblad, A., Strand, E., & Smedh, K. (2021). Risk factors for poor health-related quality of life in patients with colon cancer include stoma and smoking habits. Health and Quality of Life Outcomes, 19(1), Article 216. https://doi.org/10.1186/s12955-021-01850-5CrossRefPubMedPubMedCentral
73.
go back to reference Østhus, A. A., Aarstad, A. K. H., Olofsson, J., & Aarstad, H. J. (2013). Comorbidity is an independent predictor of health-related quality of life in a longitudinal cohort of head and neck cancer patients. European Archives of Oto-Rhino-Laryngology, 270(5), 1721–1728. https://doi.org/10.1007/s00405-012-2207-0CrossRefPubMed
74.
go back to reference Engel, J., Kerr, J., Schlesinger-raab, A., Eckel, R., Sauer, H., & Hölzel, D. (2003). Predictors of quality of life of breast cancer patients. Acta Oncologica, 42(7), 710–718. https://doi.org/10.1080/02841860310017658CrossRefPubMed
75.
go back to reference Zandbergen, N., de Rooij, B. H., Vos, M. C., Pijnenborg, J. M. A., Boll, D., Kruitwagen, R. F. P. M., van de Poll-Franse, L. V., & Ezendam, N. P. M. (2019). Changes in health-related quality of life among gynecologic cancer survivors during the two years after initial treatment: A longitudinal analysis. Acta Oncologica, 58(5), 790–800. https://doi.org/10.1080/0284186X.2018.1560498CrossRefPubMed
76.
go back to reference Hernán, M. A. (2004). A definition of causal effect for epidemiological research. Journal of Epidemiology and Community Health, 58(4), 265. https://doi.org/10.1136/jech.2002.006361CrossRefPubMedPubMedCentral
77.
go back to reference Arnold, K. F., Davies, V., de Kamps, M., Tennant, P. W. G., Mbotwa, J., & Gilthorpe, M. S. (2020). Reflection on modern methods: Generalized linear models for prognosis and intervention—Theory, practice and implications for machine learning. International Journal of Epidemiology, 49(6), 2074–2082. https://doi.org/10.1093/ije/dyaa049CrossRef
78.
go back to reference Wang, Q., Ma, Y., Zhao, K., & Tian, Y. (2022). A comprehensive survey of loss functions in machine learning. Annals of Data Science, 9(2), 187–212. https://doi.org/10.1007/s40745-020-00253-5CrossRef
79.
go back to reference Langballe, R., Erik, J., Maria, I., Randi Valbjørn, K., Jeanette Haar, E., Mads Nordahl, S., Uffe, B., Ole, H., Susanne Oksbjerg, D., & Pernille Envold, B. (2023). Who are the vulnerable lung cancer patients at risk for not receiving first-line curative or palliative treatment? Acta Oncologica, 62(10), 1301–1308. https://doi.org/10.1080/0284186X.2023.2252581CrossRefPubMed
80.
go back to reference Langballe, R., Svendsen, L., Jakobsen, E., Dalton, S. O., Karlsen, R. V., Iachina, M., Freund, K. M., Leclair, A., Jørgensen, L. B., Skou, S. T., Ehlers, J. H., Torenholt, R., Svendsen, M. N., & Envold Bidstrup, P. (2023). Nurse navigation, symptom monitoring and exercise in vulnerable patients with lung cancer: Feasibility of the NAVIGATE intervention. Scientific Reports, 13(1), Article 22744. https://doi.org/10.1038/s41598-023-50161-wCrossRefPubMedPubMedCentral
81.
go back to reference Espressivo, A., Pan, Z. S., Usher-Smith, J. A., & Harrison, H. (2024). Risk prediction models for oral cancer: A systematic review. Cancers. https://doi.org/10.3390/cancers16030617CrossRefPubMedPubMedCentral