A deep learning model to predict recurrence of atrial fibrillation after pulmonary vein isolation
International Journal of Arrhythmia volume 21, Article number: 19 (2020)
Background and Objectives
The efficacy of radiofrequency catheter ablation (RFCA) in atrial fibrillation (AF) is well established. The standard approach to RFCA in AF is pulmonary vein isolation (PVI). However, a large proportion of patients experiences recurrence of atrial tachyarrhythmia. The purpose of this study is to find out whether the AI model can assess AF recurrence in patients who underwent PVI.
Materials and methods
This study was a retrospective cohort study that enrolled consecutive patients who underwent catheter ablation for symptomatic, drug-refractory AF and PVI. We developed an AI algorithm to predict recurrence of AF after PVI using patient demographics and three-dimensional (3D) reconstructed left atrium (LA) images.
We included 527 consecutive patients in the study. The overall mean LA diameter was 42.0 ± 6.8 mm, and the mean LA volume calculated using 3D reconstructed images was 151.1 ± 46.7 ml. During the follow-up period, atrial tachyarrhythmia recurred in 158 patients. The area under the curve (AUC) of the AI model based on a convolutional neural network (including 3D reconstruction images) was 0.61 (95% confidence interval [CI] 0.53–0.74) using the test dataset. The total test accuracy was 66.3% (57.0–75.6), and the sensitivity was 53.3% (34.8–71.9). The specificity was 73.2% (51.8–75.0), and the F1 score was 52.5% 34.5–66.7).
In this study, we developed an AI algorithm to predict recurrence of AF after catheter ablation of PVI using individual reconstructed LA images. This AI model was unable to predict recurrence of AF overwhelmingly; therefore, further large-scale study is needed.
The efficacy of radiofrequency catheter ablation (RFCA) in atrial fibrillation (AF) is well established . Maintaining a normal sinus rhythm decreases the risk of stroke and heart failure [2, 3]. The standard approach for RFCA of AF is pulmonary vein isolation (PVI) for both paroxysmal and persistent AF because most triggers arise in the pulmonary veins [4, 5]. Nevertheless, many patients experience recurrence of atrial tachyarrhythmia and require repeat ablation. Many other strategies for adjuvant substrate modification are required to improve ablation outcomes [6, 7]. Still, the substrate modification group in a previous study did not demonstrate results superior to the PVI-only group in persistent AF . Therefore, patient selection is important in determining whether to ablate the PVI alone or to modify the substrate.
This study used a deep learning model to offer a prediction for recurrence of AF in patients who have undergone PVI alone. The purpose of this study is to find out whether the AI model can assess AF recurrence in patients who underwent PVI.
Materials and Methods
This study was a retrospective cohort study that enrolled consecutive patients who underwent catheter ablation for symptomatic, drug-refractory AF and PVI only from August 2013 to December 2016. We developed an AI algorithm to predict recurrence of AF after PVI with or without cavotricuspid isthmus (CTI) ablation. Therefore, we excluded patients who underwent other ablation procedures, including linear ablation or complex fractionated electrogram ablation. In these patients, any type of AF was included, either paroxysmal or persistent. This study was approved by the Institutional Review Board of the Catholic Medical Center, South Korea.
The study cohort was classified into two groups. The first is a recurrence group that demonstrated atrial tachyarrhythmia that lasted for more than 30 s and was detected by a 12-lead electrocardiogram (ECG) or 24-h Holter monitor after a three-month blanking period following a catheter ablation procedure. The second is a non-recurrence group with no history of atrial tachyarrhythmia on ECG or 24-h Holter monitoring after catheter ablation during follow-up.
We used patient demographics and three-dimensional (3D) reconstructed images of the left atrium (LA) as predictive variables to develop the algorithm. The demographic information comprised age, sex, body mass index, other underlying diseases including congestive heart failure, hypertension, diabetes, history of stroke, vascular disease, chronic obstructive pulmonary disease and thyroid disease, and LA size. The LA size was assessed in two ways: echocardiogram and 3D computed tomography (CT) imaging. We measured LA volume during diastolic and systolic phases, including any LA appendages, and calculated the LA ejection fraction. In addition, we obtained 3D reconstructed LA images from a 3D mapping system (Ensite NavX, Abbott) from an anterior–posterior view. Data without 3D reconstructed cardiac images or any of the variables were excluded. All patients were randomly assigned at an 8:2 ratio to either a training group or a test group. The training dataset was used to develop the algorithm, and the testing dataset, which was not used to train the network, was employed to assess the accuracy of the algorithm.
Electrophysiological study and ablation procedure
All patients underwent cardiac CT scan before the procedure. Intracardiac electrograms were filtered at 30–500 Hz with an amplifier in the Prucka Cardio Lab System (GE Healthcare, Milwaukee, WI, USA). Detailed electroanatomical data were obtained from the Ensite NavX (Abbott, St. Paul, MN, USA) 3D mapping system. The circular mapping catheter (Optima, Abbott) and the ablation catheter were advanced through a double trans-septal puncture. All ablation procedures were performed using RF energy with a 4-mm, open, irrigated catheter (Coolflex, Abbott). All four PVs (including the carina lines) were circumferentially ablated for PVI with an RF energy up to 25–35 W.
The primary outcome was recurrence of atrial tachyarrhythmia using an artificial intelligence (AI) model in participants who underwent PVI alone. A receiver operating characteristic (ROC) curve was created and used to assess the area under the curve (AUC), as well as the accuracy, sensitivity, specificity and F1 score.
Overview of the AI model
We developed two learning and inference models to determine the effectiveness of 3D reconstructed images on prediction of recurrence. The first model was a multimodal deep learning model in which demographic data were utilized along with 3D reconstructed images (Fig. 1). The other used only demographic data. In both models, a deep neural network (DNN) module with four fully connected, hidden layers is commonly used for processing demographic data. The hidden layers collectively consist of 1024 nodes, and the input layer of demographic data was directly connected to these layers. For the model that uses images, a convolutional neural network (CNN) module with a VGG16 model was exploited in the form of transfer learning, and the weights of the VGG16 model were adopted into our CNN module for faster learning with higher prediction accuracy . The 3D images of the LA reconstructed using the Ensite NavX mapping system were input into separate VGG16 models for anterior and posterior aspects. Another fully connected hidden layer with 1024 nodes was used in our CNN module for ensemble learning of the flattened results of the VGG16 models. Finally, a module of two hidden layers with batch normalization was included in both our models. Note that the outcomes of the DNN and CNN modules were merged in this module for models including both types of data. Our models were implemented using the Keras framework with a Tensorflow backend.
Statistical analysis was performed using Statistical Package for the Social Sciences (SPSS), version 18.0 (SPSS, Inc., Chicago, IL, USA). Continuous variables were compared using unpaired t test or Wilcoxon rank-sum test, while categorical variables were compared using Chi-squared test or Fisher’s exact test, as appropriate. We assessed the AUC using the ROC curve. A p-value < 0.05 was considered statistically significant.
In total, 527 consecutive patients were included in the study and the mean follow-up duration was 21.5 ± 10.2 months. Among these, 41 patients with missing data were excluded. The overall mean LA diameter was 42.0 ± 6.8 mm, and the mean LA volume calculated using the 3D reconstructed image was 151.1 ± 46.7 ml. During the follow-up period, atrial tachyarrhythmia recurred in 158 patients. As shown in Table 1, the baseline demographic data showed a significant difference between the recurrence and non-recurrence groups. Recurred patients had a significantly larger LA size that was consistently observed in any measurement method, including LA dimensions obtained by echocardiography and LA volume determined using a 3D system and CT images. The remaining baseline characteristics are summarized in Table 1.
A deep learning predictive model was developed with 400 cases, and the performance test was conducted on 86 randomly selected patients. The AUC of the AI model based on CNN learning including 3D reconstruction images was 0.61 (95% CI 0.53–0.74) using the test dataset (Fig. 2), and the total test accuracy was 66.3% (range 57.0–75.6). The sensitivity was 53.3% (range 34.8–71.9), the specificity was 73.2% (range 51.8–75.0), and the F1 score was 52.5% (range 34.5–66.7). When the model was tested using only demographic data (except 3D reconstructed images), AUC was 0.46 (95% CI 0.41–0.53), accuracy was 69.4% (range 62.7–77.6), sensitivity was 7.14% (range 0–18.2), specificity was 85.9% (range 81.1–93.4), and F1 score was 8.9% (range 0–21.3) (Fig. 3). The results indicate that the learning capacity of CNN significantly outperformed DNN using only demographic data.
In this study, we developed an AI algorithm to predict recurrence of AF after catheter ablation of PVI only. This study demonstrated that the performance of the AI model using convolution layers with reconstructed LA images was superior to that of the AI model that used only demographic data including LA diameter and volume.
Catheter ablation is the most effective therapy for rhythm control of AF. However, this approach remains challenging as some patients experience recurrence. Many attempts are being made to improve the outcome. It is well known that the pulmonary veins are an important trigger of paroxysmal AF . Therefore, current guidelines recommend electrical isolation of PVs as a routine procedure for catheter ablation [6, 7, 9]. To maintain a normal sinus rhythm after the procedure, PVI durability is critical. Several methods have demonstrated the ability to achieve a durable PVI, such as a confirming bidirectional block and a dormant conduction test [10, 11]. Currently, cryoballoon ablation is an alternative method to achieve PVI . In particular, the efficacy of cryoablation with PVI alone is strongest in select patients, such as those with paroxysmal AF or younger patients with no structural heart disease . Other patients require substrate modification in addition to PVI. Therefore, patient selection is vital in the decision to perform PVI alone or in conjunction with an additional procedure to reduce total procedure time and improve the outcome. Sanhoury et al. suggested the CAAP-AF risk scoring system to predict AF recurrence after balloon cryoablation . The AUC of a CAAP-AF score ≥ 5 was 0.71, and it had a sensitivity of 64% and a specificity of 68%. In this study, the AUC of the AI model was 0.61, and the total test accuracy was 66.3%. The sensitivity was 53.3%, and the specificity was 73.2%. These findings are similar to the results of other studies and also compare favorably. Many variables were involved due to the individual characteristics of the study participants, limiting the power of the result. In addition, the purpose of the study was not to make predictions that were limited to objective findings, such as diameter and volume, but instead to study the morphology of the LA itself and the location of PVs. An AI model that can learn using reconstructed images can better predict whether the trigger should be targeted or further substrate modification is needed. Therefore, we predicted that machine learning would be better than a conventional statistical analysis. The result of the AI model using 3D images was better than that using only demographic data. This hypothesis also may be supported by variability in atrial fiber architecture. A study of myofiber architecture of the human atria using high-resolution, 3D diffusion tensor magnetic resonance techniques revealed heterogeneity of transmural fibers and variability of the pattern of atrial architecture. These structural variability factors also may contribute to atrial rhythm and pump function .
Several limitations were present in our study. First, deep learning is based on use of big data. However, we included only a small, single-center population. Therefore, we could not conduct external validation, which could have caused overfitting. In addition, a small study population is not appropriate for model development. Second, this study population was not randomly selected, and the decision whether to perform PVI alone or in conjunction with another ablation procedure was made at the physician’s discretion, Therefore, there were several opportunities for bias, such as smaller LA size or younger age. Despite these limitations, the AI model demonstrated favorable predictive performance, and further large-scale study is needed to confirm our results.
An AI algorithm was developed from AF catheter ablation data, including reconstructed individual LA images, and it was favorable for predicting need for additional procedures after PVI. However, this AI model was not outperformed to predict recurrence of AF compared with other methods, so further large-scale studies are needed.
Availability of data and materials
radiofrequency catheter ablation
pulmonary vein isolation
area under the curve
receiver operating characteristic
deep neural network
convolutional neural network
Wilber DJ, Pappone C, Neuzil P, De Paola A, Marchlinski F, Natale A, Macle L, Daoud EG, Calkins H, Hall B, Reddy V, Augello G, Reynolds MR, Vinekar C, Liu CY, Berry SM, Berry DA. Comparison of antiarrhythmic drug therapy and radiofrequency catheter ablation in patients with paroxysmal atrial fibrillation: a randomized controlled trial. JAMA. 2010;303:333–40.
Marrouche NF, Brachmann J, Andresen D, Siebels J, Boersma L, Jordaens L, Merkely B, Pokushalov E, Sanders P, Proff J, Schunkert H, Christ H, Vogt J, Bänsch D. Catheter ablation for atrial fibrillation with heart failure. N Engl J Med. 2018;378:417–27.
Packer DL, Mark DB, Robb RA, Monahan KH, Bahnson TD, Poole JE, Noseworthy PA, Rosenberg YD, Jeffries N, Mitchell LB, Flaker GC, Pokushalov E, Romanov A, Bunch TJ, Noelker G, Ardashev A, Revishvili A, Wilber DJ, Cappato R, Kuck KH, Hindricks G, Davies DW, Kowey PR, Naccarelli GV, Reiffel JA, Piccini JP, Silverstein AP, Al-Khalidi HR, Lee KL. Effect of catheter ablation vs antiarrhythmic drug therapy on mortality, stroke, bleeding, and cardiac arrest among patients with atrial fibrillation: the CABANA randomized clinical trial. JAMA. 2019;321:1261–74.
Haïssaguerre M, Jaïs P, Shah DC, Takahashi A, Hocini M, Quiniou G, Garrigue S, Le Mouroux A, Le Métayer P, Clémenty J. Spontaneous initiation of atrial fibrillation by ectopic beats originating in the pulmonary veins. N Engl J Med. 1998;339:659–66.
Verma A, Jiang CY, Betts TR, Chen J, Deisenhofer I, Mantovan R, Macle L, Morillo CA, Haverkamp W, Weerasooriya R, Albenque JP, Nardi S, Menardi E, Novak P, Sanders P. Approaches to catheter ablation for persistent atrial fibrillation. N Engl J Med. 2015;372:1812–22.
Calkins H, Hindricks G, Cappato R, Kim Y-H, Saad EB, Aguinaga L, Akar JG, Badhwar V, Brugada J, Camm J, Chen P-S, Chen S-A, Chung MK, Nielsen JC, Curtis AB, Davies DW, Day JD, d’Avila A, de Groot NMS, Di Biase L, Duytschaever M, Edgerton JR, Ellenbogen KA, Ellinor PT, Ernst S, Fenelon G, Gerstenfeld EP, Haines DE, Haissaguerre M, Helm RH, Hylek E, Jackman WM, Jalife J, Kalman JM, Kautzner J, Kottkamp H, Kuck KH, Kumagai K, Lee R, Lewalter T, Lindsay BD, Macle L, Mansour M, Marchlinski FE, Michaud GF, Nakagawa H, Natale A, Nattel S, Okumura K, Packer D, Pokushalov E, Reynolds MR, Sanders P, Scanavacca M, Schilling R, Tondo C, Tsao H-M, Verma A, Wilber DJ, Yamane T. 2017 HRS/EHRA/ECAS/APHRS/SOLAECE expert consensus statement on catheter and surgical ablation of atrial fibrillation. Heart Rhythm. 2017;14:e275–444.
Yu HT, Jeong DS, Pak H-N, Park H-S, Kim JY, Kim J, Lee JM, Kim K-H, Yoon NS, Roh S-Y, Oh Y-S, Cho YJ, Shim J. 2018 Korean guidelines for catheter ablation of atrial fibrillation: part II. Int J Arrhythm. 2018;19:235–84.
Simonyan K and Zisserman A. Very deep convolutional networks for large-scale image recognition. CoRR. 2015;abs/1409.1556.
Kirchhof P, Benussi S, Kotecha D, Ahlsson A, Atar D, Casadei B, Castella M, Diener HC, Heidbuchel H, Hendriks J, Hindricks G, Manolis AS, Oldgren J, Popescu BA, Schotten U, Van Putte B, Vardas P. 2016 ESC Guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. 2016;37:2893–962.
Kim JY, Kim SH, Song IG, Kim YR, Kim TS, Kim JH, Jang SW, Lee MY, Rho TH, Oh YS. Achievement of successful pulmonary vein isolation: methods of adenosine testing and incremental benefit of exit block. J Interv Card Electrophysiol. 2016;46:315–24.
Macle L, Khairy P, Weerasooriya R, Novak P, Verma A, Willems S, Arentz T, Deisenhofer I, Veenhuyzen G, Scavée C, Jaïs P, Puererfellner H, Levesque S, Andrade JG, Rivard L, Guerra PG, Dubuc M, Thibault B, Talajic M, Roy D, Nattel S. Adenosine-guided pulmonary vein isolation for the treatment of paroxysmal atrial fibrillation: an international, multicentre, randomised superiority trial. Lancet. 2015;386:672–9.
Kuck KH, Brugada J, Fürnkranz A, Metzner A, Ouyang F, Chun KR, Elvan A, Arentz T, Bestehorn K, Pocock SJ, Albenque JP, Tondo C. Cryoballoon or radiofrequency ablation for paroxysmal atrial fibrillation. N Engl J Med. 2016;374:2235–45.
Sanhoury M, Moltrasio M, Tundo F, Riva S, Dello Russo A, Casella M, Tondo C, Fassini G. Predictors of arrhythmia recurrence after balloon cryoablation of atrial fibrillation: the value of CAAP-AF risk scoring system. J Interv Card Electrophysiol. 2017;49:129–35.
Pashakhanloo F, Herzka DA, Ashikaga H, Mori S, Gai N, Bluemke DA, Trayanova NA, McVeigh ER. Myofiber architecture of the human atria as revealed by submillimeter diffusion tensor imaging. Circ Arrhythm Electrophysiol. 2016;9:e004133.
This study was supported by a grant from the Korean Heart Rhythm Society 2019.
Ethical approval and consent to participate
The study protocol was approved by the Institutional Review Board of the Catholic Medical Center.
Consent for publication
All authors agree with publication of the manuscript.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Kim, J.Y., Kim, Y., Oh, GH. et al. A deep learning model to predict recurrence of atrial fibrillation after pulmonary vein isolation. Int J Arrhythm 21, 19 (2020). https://doi.org/10.1186/s42444-020-00027-3