Date: Monday, November 8, 2021
Session Type: Poster Session C
Session Time: 8:30AM-10:30AM
Background/Purpose: Nearly 20% of pregnancies in patients with Systemic lupus erythematosus (SLE) result in an adverse pregnancy outcome (APO); early identification of women with SLE who are at high risk of APO is vital. We previously derived a risk model for APO using logistic regression and data from the PROMISSE Study, a large multi-center, multi-ethnic/racial study of APO in women with mild/moderate SLE and/or aPL. While this highly interpretable regression model showed promising predictive performance, we sought to determine if novel and increasingly popular machine learning (ML) approaches would enhance APO risk prediction using all available predictors and potential complex relationships such as interactions or higher order terms. We compared logistic regression modeling to LASSO, a regression approach that handles high-dimensionality and correlated predictors through shrinkage of estimated coefficients, as well as several “black box” ML algorithms. ML techniques are well-suited to high-dimensional data, require no variable selection, and unlike regression-based approaches are able to explore complex relationships without explicit input by the user.
Methods: We used the original PROMISSE data (41 predictor variables from 385 subjects) with APO (71/385, 18.4%) defined as preterm delivery due to placental insufficiency or preeclampsia, fetal or neonatal death, or fetal growth restriction. Logistic regression with stepwise selection (LR-S) was compared to LASSO, random forest (RF), neural network (NN) with 2 hidden neurons, support vector machines with RBF kernel (SVMRBF), and gradient boosting (GB). To summarize discrimination we present cross-validated area under the receiver operating curve (AUC), along with sensitivity (Sn) and specificity (Sp) at an optimal cut-point.
Results: Regression based classifiers confirmed the predictors of APO identified in our previously reported model: non-white race, use of anti-hypertensive medication, low platelets, SLE disease activity, lupus anticoagulant (LAC) +, and high diastolic blood pressure (DBP). RF additionally revealed two novel interaction variables that increased APO risk: LAC+ with anti-b2GPI IgM, high DBP with low C3. LR-S and LASSO were observed to have similar overall discrimination (AUC=0.75 vs. 0.77) but LASSO had higher sensitivity (Sn=0.71 vs. 0.65). ML classifiers RF and SVMRBF had similar good performance (AUC=0.77-0.78), while NN and GB were inferior.
Conclusion: Several popular ML algorithms did not provide meaningful improvements to the previously identified model for APO prediction. The strong relative performance of regression-based models with this large and well-characterized clinical data set is notable as these models are highly interpretable, well-understood, and generally require fewer variables to generate a risk prediction. New clinical and laboratory markers may improve predictions in the future.
To cite this abstract in AMA style:Fazzari M, Guerra M, Salmon J, Kim M. Predicting Adverse Pregnancy Outcomes in Women with Systemic Lupus Erythematosus: A Comparison of Machine Learning Methods [abstract]. Arthritis Rheumatol. 2021; 73 (suppl 10). https://acrabstracts.org/abstract/predicting-adverse-pregnancy-outcomes-in-women-with-systemic-lupus-erythematosus-a-comparison-of-machine-learning-methods/. Accessed December 6, 2021.
« Back to ACR Convergence 2021
ACR Meeting Abstracts - https://acrabstracts.org/abstract/predicting-adverse-pregnancy-outcomes-in-women-with-systemic-lupus-erythematosus-a-comparison-of-machine-learning-methods/