ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2025
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • 2020-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 2572

Multi-Modal Machine Learning Prediction and Phenotyping of Systemic Lupus Erythematosus Using Longitudinal EHR and Genomic Data from the All of Us Program

Hunter Sporn1, Roshni Parulekar-Martins1, Haopeng Wang1, Xinran Yu1, Jeong Yee2, Youngmin Kim3, Jing Cui4 and Karen H. Costenbader5, 1Massachusetts Institute of Technology, Boston, 2Sungkyunkwan University, Suwon, MA, South Korea, 3Brigham and Women's Hospital, Boston, MA, 4Brigham and Women's Hospital/ Harvard Medical School, Boston, 5Harvard Medical School and Brigham and Women's Hospital, Boston, MA

Meeting: ACR Convergence 2025

Keywords: Biostatistics, genomics, longitudinal studies, lupus-like disease, Systemic lupus erythematosus (SLE)

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Tuesday, October 28, 2025

Title: Abstracts: Epidemiology & Public Health II (2567–2572)

Session Type: Abstract Session

Session Time: 2:15PM-2:30PM

Background/Purpose: Timely diagnosis and clinically meaningful stratification of systemic lupus erythematosus (SLE) remain major unmet needs. Existing risk models rely on limited genetic or lifestyle predictors and are often built on non-representative cohorts. Phenotyping studies, meanwhile, typically use curated registries that fail to capture the real-world complexity of SLE presentations. The NIH All of Us program, a US-wide cohort, offers an unprecedented opportunity to integrate genetic risk scores (GRS), structured electronic health record (EHR) data, using unsupervised learning at scale to address both challenges.

Methods: Within All of Us (version 8), we identified SLE cases (≥2 ICD-10 codes on separate days) and matched controls by age, sex, and genetically inferred ancestry. Supervised models (logistic regression, random forest, XGBoost) were trained using 5-year pre-diagnosis features: GRS, medications, symptoms, reproductive, and self-reported lifestyle factors. Performance was assessed across time windows on a held-out test set.For phenotyping, we implemented two unsupervised clustering pipelines. Approach A used normalized immune, renal, and metabolic lab data during flare and non-flare periods. Approach B integrated post-diagnosis lab values, medication classes, and grouped conditions (6-18 months post-index). Dimensionality reduction (PCA, UMAP) preceded clustering (k-means, HDBSCAN). Clusters were interpreted based on lab profiles, treatment exposure, and organ-specific manifestations.

Results: 1526 SLE cases were matched to 3942 non-SLE controls. (Table 1) Prediction models yielded AUCs 0.61–0.64. GRS, joint pain, depression, and obesity were consistent predictors. Accuracy peaked near diagnosis (AUC 0.71), though at the cost of recall—highlighting the challenge of early signal detection.Clustering revealed distinct, interpretable phenotypes (Figures 1-2). A flare-associated cluster had concurrent dsDNA elevation, proteinuria, and transaminitis (renal/hepatic flare). A non-flare cluster featured persistent hypocomplementemia and hypertriglyceridemia, consistent with systemic inflammation and metabolic dysregulation. A 3rd group had high psychiatric and musculoskeletal burden with minimal serologic activity or immunosuppressant exposure—suggesting a care-intensive, serologically quiescent phenotype. Clusters demonstrated high clinical interpretability and mapped to established organ-domain patterns, with approach-specific silhouette scores from 0.27–0.49.

Conclusion: This is among the 1st studies to apply integrated machine learning across genomic and EHR and self-reported data to model both SLE risk and phenotypic heterogeneity. While predictive performance remains limited, phenotyping revealed actionable, domain-aligned subtypes with potential to inform prognosis, treatment, and clinical trial design. These findings highlight the promise of real-world, multimodal data to advance precision medicine in rheumatology.

Supporting image 1Table 1. Selected Demographic and Clinical Characteristics of the SLE Cases and their Matched Controls for 5 year – 1 year Pre-Diagnosis (SLE Diagnosis Index Date) in the All of Us Cohort, version 8

Supporting image 2Figure 1. SLE Patient Phenotypes for New Onset SLE Cases in the All of Us Program (version 8) using Genomic, EHR and Self-reported data (Flare-State Patients, Phenotyping Approach A)

Supporting image 3Figure 2. Top 10 Differentiating Features by Patient Type/Cluster (Phenotyping Approach B)


Disclosures: H. Sporn: None; R. Parulekar-Martins: None; H. Wang: None; X. Yu: None; J. Yee: None; Y. Kim: None; J. Cui: None; K. Costenbader: AbbVie, 2, 5, Bain, 2, 5, Biogen, 2, 5, Brigham & Women’s Hospital, 3, GSK, 2, 5.

To cite this abstract in AMA style:

Sporn H, Parulekar-Martins R, Wang H, Yu X, Yee J, Kim Y, Cui J, Costenbader K. Multi-Modal Machine Learning Prediction and Phenotyping of Systemic Lupus Erythematosus Using Longitudinal EHR and Genomic Data from the All of Us Program [abstract]. Arthritis Rheumatol. 2025; 77 (suppl 9). https://acrabstracts.org/abstract/multi-modal-machine-learning-prediction-and-phenotyping-of-systemic-lupus-erythematosus-using-longitudinal-ehr-and-genomic-data-from-the-all-of-us-program/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2025

ACR Meeting Abstracts - https://acrabstracts.org/abstract/multi-modal-machine-learning-prediction-and-phenotyping-of-systemic-lupus-erythematosus-using-longitudinal-ehr-and-genomic-data-from-the-all-of-us-program/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

Embargo Policy

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM CT on October 25. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology