Ensemble-based Classification Models for Predicting Post-Operative Mortality Risk in Coronary Artery Disease

Main Article Content

Anita Brobbey
Peter Faris
Alberto Nettel-Aguirre
Tyler Williamson
Samuel Wiebe
Mingkai Peng
Hude Quan
Tolulope T Sajobi

Abstract

Introduction
There has been an increased demand for more accurate prediction tools to aid clinical decision-making regarding disease diagnosis prognosis for coronary artery disease(CAD) patients. Patients undergoing CABG surgery are older and a larger number have had previous heart surgery. Consequently, mortality after CABG is expected to increase despite procedural advances.


Objectives and Approach
This study aims to compare the predictive performance of random forest(RF) and logistic regression(LR) classifiers for predicting 30-day and 1-year post-operative mortality risk in CAD patients who underwent CABG. Data was obtained by linking the Alberta Provincial Project for Outcome Assessment in Coronary Heart Disease(APPROACH) registry, a prospective longitudinal data of patients undergoing cardiac catheterization in Alberta, Canada, to vital statistics database. All patients who underwent first-time isolated CABG between January 1, 2007 and December 31, 2012 were included in the analysis. Area under the receiver operating curve(AUC) was used to compare the predictive performance of LR and RF regression.


Results
Of the 4,908 eligible subjects who underwent isolated CABG during the study period, mortality estimates of 30-day and 1-year post CABG surgery were 1.59% and 3.85%, respectively. Descriptive analysis revealed that age, sex, hypertension, dialysis, cerebrovascular disease, chronic obstructive pulmonary disease, and chronic heart failure were associated with 30-day and 1-year mortality. The accuracy of the LR and RF regression classifiers in predicting 30-day mortality were 74.1, and 99.7%, respectively. While the accuracy of the former and latter classifiers in predicting 1-year post CABG mortality were 74% and 97.4%, respectively.


Conclusion/Implications
This study shows that RF classifier results in better predictive accuracy than LR in predicting post-operating mortality risk in CAD patients. Machine learning models are potentially usefully for developing clinical prediction models that can be used to aid the monitoring of post-discharge outcomes in the management of cardiovascular diseases.

Introduction

There has been an increased demand for more accurate prediction tools to aid clinical decision-making regarding disease diagnosis prognosis for coronary artery disease(CAD) patients. Patients undergoing CABG surgery are older and a larger number have had previous heart surgery. Consequently, mortality after CABG is expected to increase despite procedural advances.

Objectives and Approach

This study aims to compare the predictive performance of random forest(RF) and logistic regression(LR) classifiers for predicting 30-day and 1-year post-operative mortality risk in CAD patients who underwent CABG. Data was obtained by linking the Alberta Provincial Project for Outcome Assessment in Coronary Heart Disease(APPROACH) registry, a prospective longitudinal data of patients undergoing cardiac catheterization in Alberta, Canada, to vital statistics database. All patients who underwent first-time isolated CABG between January 1, 2007 and December 31, 2012 were included in the analysis. Area under the receiver operating curve(AUC) was used to compare the predictive performance of LR and RF regression.

Results

Of the 4,908 eligible subjects who underwent isolated CABG during the study period, mortality estimates of 30-day and 1-year post CABG surgery were 1.59% and 3.85%, respectively. Descriptive analysis revealed that age, sex, hypertension, dialysis, cerebrovascular disease, chronic obstructive pulmonary disease, and chronic heart failure were associated with 30-day and 1-year mortality. The accuracy of the LR and RF regression classifiers in predicting 30-day mortality were 74.1, and 99.7%, respectively. While the accuracy of the former and latter classifiers in predicting 1-year post CABG mortality were 74% and 97.4%, respectively.

Conclusion/Implications

This study shows that RF classifier results in better predictive accuracy than LR in predicting post-operating mortality risk in CAD patients. Machine learning models are potentially usefully for developing clinical prediction models that can be used to aid the monitoring of post-discharge outcomes in the management of cardiovascular diseases.

Article Details

How to Cite
Brobbey, A., Faris, P., Nettel-Aguirre, A., Williamson, T., Wiebe, S., Peng, M., Quan, H. and Sajobi, T. T. (2018) “Ensemble-based Classification Models for Predicting Post-Operative Mortality Risk in Coronary Artery Disease”, International Journal of Population Data Science, 3(4). doi: 10.23889/ijpds.v3i4.877.

Most read articles by the same author(s)

1 2 3 4 5 6 7 8 9 > >>