- Research
- Open access
- Published:
Machine learning-based prediction of new onset of atrial fibrillation after mitral valve surgery
International Journal of Arrhythmia volume 25, Article number: 18 (2024)
Abstract
Background
New-onset postoperative atrial fibrillation (nPOAF) is a common complication after cardiac surgery (30–50%), being associated with unfavorable long-term outcomes. Using the Society of Thoracic Surgeons National Adult Cardiac Database, we used machine learning (ML) to predict nPOAF and related 30-day outcomes following mitral valve (MV) surgery. A total of 27,856 MV operations were performed at 910 centers between 7/1/2017 and 6/30/2020 on patients without AF or a prior permanent pacemaker. The primary endpoint was nPOAF postoperatively. ML techniques utilized included penalized logistic regression, gradient boosting, decision trees, and random forests.
Results
The overall incidence of nPOAF was 35.4% and that of new pacemaker insertion was 5.6%. Patients who developed nPOAF were older (67 ± 10 vs 60 ± 13 years), had more mitral valve stenosis (14.1% vs 11.7%), and hypertension (72.1% vs 63.3%). They underwent more mitral valve replacement (39.1% vs 32.7%) and coronary artery bypass grafting (23.9% vs 16%). For predicting nPOAF, ML methods offer sensitivity, specificity and precision superior to logistic regression. The accuracy rate was identical with penalized and non-penalized logistic regression (0.672).
Conclusions
Predicting nPOAF and its short-term sequelae following MV surgery remains highly challenging. Machine learning methods offer a moderate degree of improvement in predicting nPOAF even in large national-level studies, in the absence of multi-modal data, such as real-time wearables data, electrocardiograms, heart rhythm monitoring, or cardiac imaging.
Background
Atrial fibrillation (AF) is the most common heart rhythm dysfunction in the USA, affecting more than 2 million individuals [1]. Its prevalence is expected to increase in the coming decades [1].
New-onset postoperative AF (POAF) is a common complication (30–50%) after cardiac surgery [1, 2]. It is associated with unfavorable near-term and long-term outcomes, including a higher risk of stroke, prolonged hospital length of stay, and strained hospital resources [3,4,5]. It is therefore important to develop methods to predict its occurrence. In our study, we adopted the Society of Thoracic Surgeons (sts.org) definition for new-onset POAF: the occurrence of any postoperative, in-hospital, atrial fibrillation/flutter episode longer than 1 h and/or requiring treatment [6, 7]. This definition applies only to patients who were not in AF at the start of surgery. Preoperative AF status was the first rhythm documented on the anesthesia record upon entry to the operating room. POAF diagnosis tools included ECG recording (1 or more leads), continuous ECG monitoring for 48–72 h postoperatively, loop memory monitors, symptom event monitors, patch recorders, or implantable loop recorders.
The 2017 HRS/EHRA/ECAS/APHRS/SOLACE expert consensus statement on catheter and surgical ablation of atrial fibrillation identifies modifiable (e.g., hypertension, obesity, alcohol consumption) and non-modifiable (e.g., age, sex, race, family history) risk factors for developing atrial fibrillation during the life course [1]. Enhancing the ability to predict the risk of new-onset AF following cardiac surgery may lead to better individualized treatment strategies [1]. State-of-the art machine learning methods have produced excellent results in surgical outcome prediction [8,9,10]. Using data from the Society of Thoracic Surgeons (STS) Adult Cardiac Surgery Database (ACSD) retrospectively for patients undergoing MV surgery, we sought to determine whether machine learning methods produce superior new-onset POAF predictions compared to standard methods.
Study population
The STS National ACSD version 2.9 was queried from July 1, 2017, to June 30, 2020. Deidentified data were obtained via provider user files, and included demographics, comorbid conditions, pre-/intra-/postoperative characteristics, and 30-day outcomes. We included patients age ≥ 18 years who underwent elective surgical MV repair or replacement (with or without tricuspid valve surgery and PFO closure) via full conventional sternotomy, partial sternotomy, right thoracotomy, or robotic initial operative approach. We excluded patients who had: (1) a history of AF or a prior permanent pacemaker, (2) a concomitant procedure for AF (cardioversion, catheter ablation), (3) a transcatheter MV repair or replacement procedure, (4) a cardiac congenital or concomitant surgical procedure (aortic or pulmonic valve surgery, aortic aneurysm), (5) a heart/heart–lung transplant or the implantation of a VAD/temporary/permanent assist device, (6) a percutaneous or port access operative approach, or (7) other non-cardiac procedures. In Figure 1, Consolidated Standards of Reporting Trials (CONSORT) diagram [11, 12] delineates the study cohort. The primary endpoint was the occurrence of de novo POAF postoperatively.
Missing data
For binary variables (‘Yes/No’), missingness was equated with a negative response (‘No’). Missing height and weight (n = 8 each), perfusion, and cross-clamp times (n = 59 and 89, respectively) were mean-imputed. Mode imputation was used when gender information was missing (n = 2).
Statistical analyses
Data were summarized using means/standard deviations, medians/interquartile ranges, and frequency counts/percentages, as appropriate. Group comparisons, including by POAF status, were based on the two-sample t test with unequal variances, Wilcoxon’s rank sum, and the Chi-square test.
To predict de novo POAF, we compared standard methodology (logistic regression) to several machine learning (ML) techniques: penalized logistic regression, random forest, and extreme gradient boosting (XGBoost) [13]. Preoperative MV lesion set information is presented in Supplemental Table 1. The list of variables used to predict POAF is available in Supplemental Table 2 and includes clinically important pre- and intraoperative variables available. Training and testing datasets have been created by randomly splitting the study data in 70:30 ratio. Classification error calculated on the test dataset was defined as the number of incorrect predictions (false negatives or false positives) divided by the size of the validation dataset. Method performance was assessed and compared across the different classification approaches based on the following optimal threshold measures: the concordance index or the area under the receiver operating characteristic (AUROC) curve, the F1 score, precision (or positive predicted value (PPV)), recall (sensitivity), accuracy, specificity, and negative predicted value (NPV). Test data permutation feature importance has been calculated based on the algorithm developed by Fisher et al. (2019) [14] and uses classification error as loss function. Statistical significance was declared at two-sided 5% level, and there were no adjustments for multiplicity. All analyses have been completed in SAS v 9.4 (SAS Institute, Cary, NC) and R v.4.1 (www.r-project.org) using package mlr3 [15]. Test data hyperparameter tuning was performed using the default options in mlr3, using a grid search. For in-depth detail, please refer to mlr3tuning.mlr-org.com.
Results
During the study period, a total of 28,856 MV operations were performed at 910 centers on patients. Of them, 9,863 (35.4%) experienced new-onset POAF. Risk factors for POAF included older age, diabetes, dyslipidemia, hypertension, cerebrovascular disease, prior myocardial infarction, sleep apnea, and chronic lung disease. Patients with POAF were more often on preoperative beta-blocker medication and had a higher STS PROM score (Table 1). Patients who developed POAF had longer postoperative hospital length of stay (Table 3). They experienced more complications postoperatively: higher stroke rates, more bleeding, more prolonged ventilation, and more renal failure (Table 2). Thirty-day mortality, hospital readmission, and arrhythmia-related hospital readmission rates were higher among patients who experienced POAF (Table 3).
Table 4 presents a comparison of de novo POAF prediction via ML and standard methods. The highest concordance index (AUROC) was achieved by penalized and non-penalized logistic regression models. Accuracy was relatively similar across methodologies, and numerically largest for logistic regression (0.667). XGBoost had the highest recall (sensitivity) at 0.311, and the highest F1 score. Random forest had the highest specificity (0.926) and precision (0.585). Feature importance (Supplemental Table 3 includes top 10%) reveals that all methods employed identify older age at surgery as top risk factor for postoperative nPOAF. Other risk factors identified were valvular disease (MV stenosis, AV/TV insufficiency, PV disease), as well as intraoperative characteristics (initial operative approach, coronary artery bypass grafting, perfusion time).
Discussion
There is vast literature identifying older age, surgery for valvular heart disease, and prior history of major cardiovascular disease as independent risk factors for postoperative AF following adult cardiac surgery. These findings were confirmed independently by Auer et al. (2005) [16], Omae and Kanmura (2012) [17], Greenberg et al. (2017) [18], Rezaei et al. (2020) [19], and Lopes and Agrawal (2022) [20], among others. Based on feature importance quantification, our study further confirms that patient older age, heart valve disease (MV stenosis, AV/TV insufficiency, PV disease), and intraoperative elements (initial operative approach, coronary artery bypass grafting, perfusion time) are associated with a greater risk of de novo postoperative POAF.
In our study, machine learning methods for predicting POAF performed better than standard statistical methodology in terms of precision, recall, F1 score, and specificity; their performance was comparable in terms of the other metrics, including the AUROC. Although artificial intelligence-enabled electrocardiogram interpretation may further benefit POAF prediction accuracy, scaling up such capabilities remains highly challenging [21,22,23,24].
In the current study, there were gains associated with the use of ML methods, but also limitations. The informational content available to predict POAF appears to be insufficient, or at least, insufficiently nonlinear in nature, to capitalize on the advantages of machine learning methods. These findings suggest that other multi-model features may need to be collected routinely to predict POAF better. They may include proteomic and multiomic molecular data [25,26,27,28], and real-time electrocardiograms and photoplethysmograms [29,30,31]. Growing capabilities of 24/7 wearable devices offer novel insights into AF pathogenesis by providing real-time physiological data, such as body temperature, pH, physical activity phenotype, and sleep patterns. Novel sweat monitors inform in real time about electrolyte, metabolic, and stress biomarkers, which could be critical to predicting POAF and other life-threatening arrhythmias [31, 32]. Integration of wearable devices into the clinical flow in the near future is progressing slowly due to regulatory challenges and lack of data standards [33,34,35,36,37]. Presently, fitness and consumer electronics industries lead the way in developing wearable physiological monitors, which offer ECG/PPG sensors and ML-enabled algorithms for detection of AF in general population. Close partnership among clinical, engineering, regulatory, and industrial communities, is needed to accelerate this potentially groundbreaking technology, which will likely revolutionize real-time detection, prediction, and prevention of life-threatening heart rhythm disorders. Leadership of professional clinical societies, such as the American Heart Association and the American College of Cardiology is needed to foster such a partnership.
Study limitations are those inherent to retrospective observational registries and include incomplete data reported by participants and the possibility of residual confounding. Hospital-to-hospital variation is a source of variability that undoubtedly plays an important role in limiting the ability to predict POAF.
Conclusions
Predicting the occurrence of POAF using machine learning methods remains challenging for multiple reasons. This study of STS ACS national registry data suggests that a further expansion of multi-modal real-time data sources may improve POAF prediction. In practice, this will be challenging due to resource allocation, regulatory aspects, and operational considerations.
Availability of data and materials
Deidentified data used in this research project were provided by The Society of Thoracic Surgeons’ National Database Participant User File Research Program. Data analysis was performed at the investigators’ institution.
Abbreviations
- ACSD:
-
Adult cardiac surgery database
- AF:
-
Atrial fibrillation
- AUROC:
-
Area under the receiver operating characteristic curve;
- AV:
-
Aortic valve
- CONSORT:
-
Consolidated standards of reporting trials
- nPOAF:
-
De novo (new) postoperative atrial fibrillation
- NPV:
-
Negative predicted value
- ML:
-
Machine learning
- MV:
-
Mitral valve
- PPV:
-
Positive predicted value
- PV:
-
Pulmonic valve
- STS:
-
Society of Thoracic Surgeons
- TV:
-
Tricuspid valve
References
Alonso A, Aparicio FH, Benjamin EJ, et al. Heart disease and stroke statistics—2021 update. Circulation. 2021;143:e254–743.
Calkins H, Hindricks G, Cappato R, et al. 2017 HRS/EHRA/ECAS/APHRS/SOLAECE expert consensus statement on catheter and surgical ablation of atrial fibrillation. Ep Europace. 2018;20(1):e1–160.
Brent Mitchell L. Incidence, timing and outcome of atrial tachyarrhythmias after cardiac surgery. In: Steinberg JS, editor. Atrial fibrillation after cardiac surgery. Boston: Springer; 2000. p. 37–50. https://doi.org/10.1007/978-0-585-28007-3_3.
Aranki SF, Shaw DP, Adams DH, Rizzo RJ, et al. Predictors of atrial fibrillation after coronary artery surgery: current trends and impact on hospital resources. Circ. 1996;94(3):390–7.
Mathew JP, Parks R, Savino JS, Friedman AS, et al. Atrial fibrillation following coronary artery bypass graft surgery: predictors, outcomes, and resource utilization. JAMA. 1996;276(4):300–6.
Adult Cardiac Surgery Database Collection. 2019. Accessed August 21, 2024, at https://www.sts.org/registries-research-center/sts-national-database/adult-cardiac-surgery-database/data-collection.
Matos JD, McIlvaine S, Grau-Sepulveda M, Jawitz OK, Brennan JM, Khabbaz KR, Sellke FW, Yeh R, Zimetbaum P. Anticoagulation and amiodarone for new atrial fibrillation after coronary artery bypass grafting: prescription patterns and 30-day outcomes in the United States and Canada. J Thor Cardiovasc Surg. 2021;162(2):616–24.
Auer J, Weber T, Berent R, Ng CK. Postoperative atrial fibrillation independently predicts prolongation of hospital stay after cardiac surgery. J Cardiovasc Surg. 2005;46(6):583.
Jalali A, Lonsdale H, Do N, et al. Deep learning for improved risk prediction in surgical outcomes. Sci rep. 2020;10(1):1–13.
Benedetto U, Dimagli A, Sinha S, et al. Machine learning improves mortality risk prediction after cardiac surgery: systematic review and meta-analysis. J Thor Cardiovasc Surg. 2022;163(6):2075–87.
Moher D, Hopewell S, Schulz KF, et al. CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials. Int J Surg. 2012;10(1):28–55.
Schulz KF, Altman DG, Moher D, CONSORT Group. CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials. Int J Pharm Pharmacoth. 2010;1(2):100–7.
Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001;29:1189–232.
Fisher A, Rudin C, Dominici F. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. J Mach Learn Res. 2019;20(177):1–81.
Becker M, Schratz P. mlr3spatial: support for spatial objects within the 'mlr3' ecosystem. https://mlr3spatial.mlr-org.com, https://github.com/mlr-org/mlr3spatial; 2023.
Auer J, Weber T, Berent R, Ng CK, Lamm G, Eber B. Risk factors of postoperative atrial fibrillation after cardiac surgery. J Card Surg. 2005;20(5):425–31.
Omae T, Kanmura Y. Management of postoperative atrial fibrillation. J Anesth. 2012;26:429–37.
Greenberg JW, Lancaster TS, Schuessler RB, Melby SJ. Postoperative atrial fibrillation following cardiac surgery: a persistent complication. Eur J Cardio-Thor Surg. 2017;52(4):665–72.
Rezaei Y, Peighambari MM, Naghshbandi S, Samiei N, Ghavidel AA, Dehghani MR, Haghjoo M, Hosseini S. Postoperative atrial fibrillation following cardiac surgery: from pathogenesis to potential therapies. Am J Cardiovasc Drugs. 2020;20:19–49.
Lopes LA, Agrawal DK. Post-operative atrial fibrillation: current treatments and etiologies for a persistent surgical complication. J Surg Res. 2022;5(1):159.
Sehrawat O, Kashou AH, Noseworthy PA. Artificial intelligence and atrial fibrillation. J Cardiovasc Electrophys. 2022;33(8):1932–43.
Heckman AJ, Carter RE, Adedinsewo D, et al. Utilizing artificial intelligence to predict post-operative atrial fibrillation in the non-cardiac transplant population. J Am Coll Card. 2023;7(81):2235.
Noseworthy PA, Attia ZI, Behnken EM, et al. Artificial intelligence-guided screening for atrial fibrillation using the electrocardiogram in sinus rhythm. Circ. 2022;8(146):A10856.
Noseworthy PA, Attia ZI, Behnken EM, et al. Artificial intelligence-guided screening for atrial fibrillation using electrocardiogram during sinus rhythm: a prospective non-randomised interventional trial. Lancet. 2022;8(400):1206–12.
Kornej J, Hanger VA, Trinquart L, Ko D, Preis SR, Benjamin EJ, Lin H. New biomarkers from multiomics approaches: improving risk prediction of atrial fibrillation. Cardiovasc Res. 2021;117(7):1632–44.
Zhang H, Wang L, Yin D, Zhou Q, Lv L, Dong Z, Shi Y. Integration of proteomic and metabolomic characterization in atrial fibrillation-induced heart failure. BMC Gen. 2022;23(1):789.
Ko D, Benson MD, Ngo D, Yang Q, Larson MG, Wang TJ, Trinquart L, McManus DD, Lubitz SA, Ellinor PT, Vasan RS. Proteomics profiling and risk of new-onset atrial fibrillation: Framingham Heart Study. J Am Heart Assoc. 2019;8(6):e010976.
Guo Y, Wang H, Zhang H, Liu T, Liang Z, Xia Y, Yan L, Xing Y, Shi H, Li S, Liu Y. Mobile photoplethysmographic technology to detect atrial fibrillation. J Am Coll Cardiol. 2019;74(19):2365–75.
Chan PH, Wong CK, Poh YC, Pun L, Leung WW, Wong YF, Wong MM, Poh MZ, Chu DW, Siu CW. Diagnostic performance of a smartphone-based photoplethysmographic application for atrial fibrillation screening in a primary care setting. J Am Heart Assoc. 2016;5(7):e003428.
Kwon S, Hong J, Choi EK, Lee E, Hostallero DE, Kang WJ, Lee B, Jeong ER, Koo BK, Oh S, Yi Y. Deep learning approaches to detect atrial fibrillation using photoplethysmographic signals: algorithms development study. JMIR mHealth uHealth. 2019;7(6):e12770.
Fukuma N, Hasumi E, Fujiu K, Waki K, Toyooka T, Komuro I, Ohe K. Feasibility of a T-shirt-type wearable electrocardiography monitor for detection of covert atrial fibrillation in young healthy adults. Sci Rep. 2019;9(1):11768.
Vairo D, Bruzzese L, Marlinge M, Fuster L, Adjriou N, Kipson N, Brunet P, Cautela J, Jammes Y, Mottola G, Burtey S. Towards addressing the body electrolyte environment via sweat analysis: pilocarpine iontophoresis supports assessment of plasma potassium concentration. Sci Rep. 2017;7(1):11801.
Bayoumy K, Gaber M, Elshafeey A, Mhaimeed O, Dineen EH, Marvel FA, Martin SS, Muse ED, Turakhia MP, Tarakji KG, Elshazly MB. Smart wearable devices in cardiovascular care: where we are and how to move forward. Nature Rev Cardiol. 2021;18(8):581–99.
Izmailova ES, Wagner JA, Perakslis ED. Wearable devices in clinical trials: hype and hypothesis. Clin Pharm Therap. 2018;104(1):42–52.
Arandia N, Garate JI, Mabe J. Embedded sensor systems in medical devices: requisites and challenges ahead. Sensors. 2022;22(24):9917.
Kasoju N, Remya NS, Sasi R, Sujesh S, Soman B, Kesavadas C, Muraleedharan CV, Varma PH, Behari S. Digital health: trends, opportunities and challenges in medical devices, pharma and bio-technology. CSI Trans on ICT. 2023;1(1):11–30.
Cohen IG, Kramer DB, Adler-Milstein J, Shachar C. Digital health care outside of traditional clinical settings: ethical, legal, and regulatory challenges and opportunities. Cambridge University Press; 2024.
Funding
Funding for this study was provided from the Frederick Henry Prince Memorial Fund on behalf of the Wood-Prince Family.
Author information
Authors and Affiliations
Contributions
ACA involved in study design and conceptualization, data preparation and analysis, and manuscript writing. SS, JK, and IRE involved in study design and conceptualization and manuscript writing. JLC, CM, AC, SCM involved in study design and conceptualization.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
ACA, SS, CM, AC, JK: none. JLC: Adagio Medical, Inc.—Laguna Hills, California (PRIVATE) Board of Directors, Stockholder, Consultant; Atricure, Inc.—Mason, Ohio (PUBLIC) Stockholder, Consultant; PAVmed, Inc.—New York City, NY (PUBLIC) Board of Directors, Stockholder; Lucid Diagnostics, Inc.—New York City, NY (PUBLIC) Board of Directors, Stockholder; PotentiaMetrics, Inc.—St. Louis, Missouri (PRIVATE) Clinical Advisory Board, Stockholder. IRE: Consultant fees from AliveCor, Samsung, Zoll. NuSera Biosystems: co-founder and stockholder. ER: NuSera Biosystems chief medical officer and stockholder. SCM: Consultant fees and research funding from Edwards, Artivion, Terumo. PMM: Edwards Lifesciences: speaking fees and royalties; Medtronic and Atricure: speaking fees; Abbott: Surgical primary investigator REPAIR-MR Trial (unpaid); advisory board.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Andrei, AC., Cox, J.L., Shah, S. et al. Machine learning-based prediction of new onset of atrial fibrillation after mitral valve surgery. Int J Arrhythm 25, 18 (2024). https://doi.org/10.1186/s42444-024-00127-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s42444-024-00127-4