Genetic programming outperformed multivariable logistic regression in diagnosing pulmonary embolism

doi:10.1016/j.jclinepi.2003.10.011

Journal of Clinical Epidemiology

Volume 57, Issue 6, June 2004, Pages 551-560

https://doi.org/10.1016/j.jclinepi.2003.10.011 Get rights and content

Abstract

Objective

Genetic programming is a search method that can be used to solve complex associations between large numbers of variables. It has been used, for example, for myoelectrical signal recognition, but its value for medical prediction as in diagnostic and prognostic settings, has not been documented.

Study design and setting

We compared genetic programming and the commonly used logistic regression technique in the development of a prediction model using empirical data from a study on diagnosis of pulmonary embolism. Using part (67%) of the data, we developed and internally validated (using bootstrapping techniques) a diagnostic prediction model by genetic programming and by logistic regression, and compared both on their predictive ability in the remaining data (validation set).

Results

In the validation set, the area under the ROC curve of the genetic programming model was significantly larger (0.73; 95%CI: 0.64–0.82) than that of the logistic regression model (0.68; 0.59–0.77). The calibration of both models was similar, indicating a similar amount of overoptimism.

Conclusion

Although the interpretation of a genetic programming model is less intuitive and this is the first empirical study quantifying its value for medical prediction, genetic programming seems a promising technique to develop prediction rules for diagnostic and prognostic purposes.

Introduction

In the past decade there has been an increased interest in medical prediction research to answer prognostic and diagnostic questions. Generally, such research aims to develop a so-called prediction rule to predict a particular outcome as accurate as possible, preferably with a minimum of information or predictors. In diagnostic prediction research the outcome includes the presence of a disease, and in prognostic prediction research the future occurrence of a certain event. With the increasing availability of electronic patient records the interest in medical prediction research will further increase because electronic records facilitate the application of prediction rules in medical practice.

The most widely used method to develop prediction rules or models in clinical epidemiology is multivariable logistic regression [1], [2], [3], [4], [5], [6], [7]. In the past decade, new methods such as classification and regression trees (CART) and neural networks have been introduced for this purpose. However, it has repeatedly been shown that both methods do not produce prediction rules that achieve higher predictive accuracy than rules developed by multivariable logistic regression [8], [9], [10], [11], [12], [13]. Recently, the technique of genetic programming has emerged. Genetic programming is a search method inspired by the process of natural evolution, and may be used to solve complex associations between large numbers of variables [14], [15], [16]. This feature makes genetic programming also suitable for prediction research to estimate the mutual correlations between various predictors and the outcome.

Genetic programming is not restricted to any fixed model structure. Therefore, it may theoretically result in a model achieving higher predictive accuracy compared to a model ob tained by ordinary logistic regression analyses. However, the flexibility a logistic model can also be increased by including cubic splines for continuous variables (rather than only the linear terms) and interaction terms, potentially enhancing the model's predictive accuracy [4], [6], [17]. However, this is not commonly done, as it often decreases the interpretability of such model.

Like neural networks, genetic programming originates from the field of artificial intelligence and machine learning. But contrary to neural networks, genetic programming requires fewer prior restrictions to the structure of the model. Nevertheless, an often-cited disadvantage of both genetic programming and neural networks is the complexity of the developed prediction model (“black-box character”). Genetic programming has been used in medical research used for myoelectrical signal recognition, echocardiography, and medical imaging, but its value for medical prediction has not been documented yet.

Our aim was to compare genetic programming and multivariable logistic regression in the development of a diagnostic prediction model using empirical data from a study on diagnosis of pulmonary embolism (PE). We developed a prediction model using genetic programming and one using multivariable logistic regression, and compared both methods on their predictive ability in an independent data set. The feasibility to apply both prediction models in clinical practice is discussed, as well as the differences between genetic programming and neural networks.

Section snippets

Patients: description of the empirical data set

For the present analysis, data were used from a prospective diagnostic study among 398 patients in secondary care of 18 years or older who were suspected of PE. As data are used for illustration purposes only, we refer to literature for details on the design and main results of the study [18], [19], [20]. Briefly, all patients underwent a systematic patient history and physical examination, followed by blood gas analysis, chest radiography, leg ultrasound, ventilation-perfusion lung scanning

Descriptives

There were no major differences in patient characteristics between the derivation and validation set (Table 1). PE was diagnosed in 42.6% of the patients in the derivation set, which was 42.9% in the validation set. Table 2 shows the univariable associations and distribution of the 10 predictors across patients with and without PE in the derivation set. “History of collapse” and “previous deep venous thrombosis” were the strongest predictors.

Logistic regression

The overall logistic model yielded a ROC area of 0.77

Discussion

To our knowledge, this is the first study to address the value of genetic programming for medical prediction purposes compared to the well-known and widely applied logistic regression technique. Given that the amount of overoptimism in discriminative value was similar for both models as estimated from the bootstrap, the discriminative value of the genetic programming model in the validation set was significantly larger than that of the logistic regression model. Before any form of recalibration

Acknowledgements

We gratefully acknowledge the support by The Netherlands Organization for Scientific Research\ (ZON-MW904-66-112).

References (28)

J.V Tu
Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes
J Clin Epidemiol
(1996)
K.J Ottenbacher et al.
Comparison of logistic regression and neural networks to predict rehospitalization in patients with stroke
J Clin Epidemiol
(2001)
F.S Resnic et al.
Simplified risk score models accurately predict the risk of major in-hospital complications following percutaneous coronary intervention
Am J Cardiol
(2001)
E.J.R van Beek et al.
A normal perfusion lung scan in patients with clinically suspected pulmonary embolism: frequency and clinical validity
Chest
(1995)
C Stollberger et al.
Multivariate analyses-based prediction rule for pulmonary embolism
Thromb Res
(2000)
E.W Steyerberg et al.
Internal validation of predictive models: efficiency of some procedures for logistic regression analysis
J Clin Epidemiol
(2001)
D.J Spiegelhalter
Probabilistic prediction in patient management and clinical trials
Stat Med
(1986)
D Hosmer et al.
Applied logistic regression
(1989)
R Simon et al.
Statistical aspects of prognostic factor studies in oncology
Br J Cancer
(1994)
F.E Harrell et al.
Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors
Stat Med
(1996)

A Laupacis et al.

Clinical prediction rules. A review and suggested modifications of methodological standards

JAMA

(1997)

F.E Harrell

Regression modeling strategies

(2001)

K.G Moons et al.

Diagnostic studies as multivariable, prediction research

J Epidemiol Community Health

(2002)

H.P Selker et al.

A comparison of performance of mathematical predictive methods for medical diagnosis: identifying acute cardiac ischemia among emergency department patients

J Investig Med

(1995)

Cited by (11)

Development and validation of clinical prediction models: Marginal differences between logistic regression, penalized maximum likelihood estimation, and genetic programming
2012, Journal of Clinical Epidemiology
Citation Excerpt :
However, in medical data, it has been shown that these types of prediction models often do not achieve higher predictive accuracy [13–18]. Genetic programming, however, is a more novel and promising search method that may improve the selection and transformation of predictors, and it may lead to models with good predictive accuracy in new patients [19–22]. The modeling process starts with a large number of candidate prediction models that are stepwise optimized by selecting the best models and adding random variations (see also the Methods section).
Many prediction models are developed by multivariable logistic regression. However, there are several alternative methods to develop prediction models. We compared the accuracy of a model that predicts the presence of deep venous thrombosis (DVT) when developed by four different methods.
We used the data of 2,086 primary care patients suspected of DVT, which included 21 candidate predictors. The cohort was split into a derivation set (1,668 patients, 329 with DVT) and a validation set (418 patients, 86 with DVT). Also, 100 cross-validations were conducted in the full cohort. The models were developed by logistic regression, logistic regression with shrinkage by bootstrapping techniques, logistic regression with shrinkage by penalized maximum likelihood estimation, and genetic programming. The accuracy of the models was tested by assessing discrimination and calibration.
There were only marginal differences in the discrimination and calibration of the models in the validation set and cross-validations.
The accuracy measures of the models developed by the four different methods were only slightly different, and the 95% confidence intervals were mostly overlapped. We have shown that models with good predictive accuracy are most likely developed by sensible modeling strategies rather than by complex development methods.
Prediction of periventricular leukomalacia. Part II: Selection of hemodynamic features using computational intelligence
2009, Artificial Intelligence in Medicine
The objective of Part II is to analyze the dataset of extracted hemodynamic features (Case 3 of Part I) through computational intelligence (CI) techniques for identification of potential prognostic factors for periventricular leukomalacia (PVL) occurrence in neonates with congenital heart disease.
The extracted features (Case 3 dataset of Part I) were used as inputs to CI based classifiers, namely, multi-layer perceptron (MLP) and probabilistic neural network (PNN) in combination with genetic algorithms (GA) for selection of the most suitable features predicting the occurrence of PVL. The selected features were next used as inputs to a decision tree (DT) algorithm for generating easily interpretable rules of PVL prediction.
Prediction performance for two CI based classifiers, MLP and PNN coupled with GA are presented for different number of selected features. The best prediction performances were achieved with 6 and 7 selected features. The prediction success was 100% in training and the best ranges of sensitivity (SN), specificity (SP) and accuracy (AC) in test were 60–73%, 74–84% and 71–74%, respectively. The identified features when used with the DT algorithm gave best SN, SP and AC in the ranges of 87–90% in training and 80–87%, 74–79% and 79–82% in test. Among the variables selected in CI, systolic and diastolic blood pressures, and pCO₂ figured prominently similar to Part I. Decision tree based rules for prediction of PVL occurrence were obtained using the CI selected features.
The proposed approach combines the generalization capability of CI based feature selection approach and generation of easily interpretable classification rules of the decision tree. The combination of CI techniques with DT gave substantially better test prediction performance than using CI and DT separately.
Reverse engineering of biochemical equations from time-course data by means of genetic programming
2005, BioSystems
Increased research aimed at simulating biological systems requires sophisticated parameter estimation methods. All current approaches, including genetic algorithms, need pre-existing equations to be functional. A generalized approach to predict not only parameters but also biochemical equations from only observable time-course information must be developed and a computational method to generate arbitrary equations without knowledge of biochemical reaction mechanisms must be developed. We present a technique to predict an equation using genetic programming. Our technique can search topology and numerical parameters of mathematical expression simultaneously. To improve the search ability of numeric constants, we added numeric mutation to the conventional procedure. As case studies, we predicted two equations of enzyme-catalyzed reactions regarding adenylate kinase and phosphofructokinase. Our numerical experimental results showed that our approach could obtain correct topology and parameters that were close to the originals. The mean errors between given and simulation-predicted time-courses were 1.6 × 10⁻⁵% and 2.0 × 10⁻³%, respectively. Our equation prediction approach can be applied to identify metabolic reactions from observable time-courses.
Artificial intelligence in the prediction of venous thromboembolism: A systematic review and pooled analysis
2023, European Journal of Haematology
A genetic programming approach to development of clinical prediction models: A case study in symptomatic cardiovascular disease
2018, PLoS ONE
Genetic programming for the downscaling of extreme rainfall events on the east coast of peninsular Malaysia
2014, Atmosphere

View all citing articles on Scopus

View full text

Original ArticlesGenetic programming outperformed multivariable logistic regression in diagnosing pulmonary embolism

Abstract

Objective

Study design and setting

Results

Conclusion

Introduction

Section snippets

Patients: description of the empirical data set

Descriptives

Logistic regression

Discussion

Acknowledgements

J Clin Epidemiol

J Clin Epidemiol

Am J Cardiol

Chest

Thromb Res

J Clin Epidemiol

Probabilistic prediction in patient management and clinical trials

Stat Med

Applied logistic regression

Statistical aspects of prognostic factor studies in oncology

Br J Cancer

Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors

Stat Med

Clinical prediction rules. A review and suggested modifications of methodological standards

JAMA

Regression modeling strategies

Diagnostic studies as multivariable, prediction research

J Epidemiol Community Health

A comparison of performance of mathematical predictive methods for medical diagnosis: identifying acute cardiac ischemia among emergency department patients

J Investig Med

Original Articles
Genetic programming outperformed multivariable logistic regression in diagnosing pulmonary embolism