ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0033

A Novel Transcriptome-Based Machine Learning Pipeline Predicts Phenotypes of Lupus Patients

Emily Leventhal1, Andrea Daamen1, Amrie Grammer2 and Peter Lipsky2, 1AMPEL BioSolutions, LLC, and The RILITE Research Institute, Charlottesville, VA, 2AMPEL BioSolutions, Charlottesville, VA

Meeting: ACR Convergence 2023

Keywords: autoimmune diseases, Bioinformatics, Nephritis, Systemic lupus erythematosus (SLE)

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Sunday, November 12, 2023

Title: (0013–0039.5) Genetics, Genomics & Proteomics Poster

Session Type: Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: Gene expression analysis coupled with machine learning (ML) holds the promise of identifying subsets of patients with heterogeneous diseases, such as systemic lupus erythematosus (SLE), a multi-organ autoimmune disease with known diversity in both clinical presentation and gene expression profiles. However, phenotype prediction based on gene expression has often relied on the performance of individual genes without a systems biology context. To address this, we created an interpretable ML approach to predict clinical phenotype of SLE patients in a non-invasive manner using blood transcriptomic profiles.

Methods: We developed a sequential grouped feature importance algorithm to assess the performance of gene sets, including those identifying immune and metabolic pathways and cell types known to be abnormal in SLE, in predicting the presence of lupus as well as disease activity and organ involvement. We normalized, merged, and batch-corrected publicly available datasets and created six ML models to predict SLE from healthy controls (CTL), inactive SLE from CTL, active from inactive SLE, lupus nephritis (LN) from CTL, LN from non-renal lupus, and rheumatoid arthritis (RA) from SLE.

Results: The SGFI algorithm first selects the best gene set to predict phenotype for the model, and then sequentially adds in additional gene sets if they improve the performance of the model (Figure 1A). SGFI provides a way to reduce the high dimensionality of transcriptomic datasets meaningfully, as it incorporates prior knowledge of biology into the data while also selecting pathways in an unbiased manner, leading to biologically informative conclusions. After feature selection, the best gene set combination was found via 10-fold cross validation on the train set and then evaluated on the test set (Figure 1B-D). We then performed gene set variation analysis to examine how these pathways differ across clinical phenotypes (Figure 1E). Gene sets related to interferon, tumor necrosis factor, the mitoribosome, and anergic/activated T cell were the best predictors of phenotype in all classifications (Table 1). The ML models created with those genes as features had excellent performance with AUCs ranging from 0.842 to 0.989 and accuracies of 0.824 to 0.942 (Table 1).

Conclusion: A novel feature selection approach combined with interpretable ML performs extremely well in predicting SLE phenotypes and in separating patients with lupus from both normal and those with RA. Moreover, since interpretable ML can be used to suggest potential causal relationships, these results point to associations between the molecular pathways identified in each model and manifestations of SLE pathogenesis. This innovative ML approach can help improve recognition of SLE phenotypic subsets and additionally be applied to other diseases and tissues.

Supporting image 1

Figure 1. Results from the SLE vs. CTL ML pipeline. A) Sankey plot of the results from feature selection. Feature groups chosen in five or more subsamples are shown in the plot. Feature groups are separated by the symbol, ‘+’. The average mean misclassification error (MMCE) of each combination across the subsamples and the number of subsamples (n) that module was shown is next to the gene sets. B) Results from evaluation of the final model on the test set. C) Receiver operating characteristic and D) precision-recall curves of final model on the test set. GSVA results of E) interferon (IFN), F) large subunit of mitoribosome (Mito Large Ribo), and G) tumor necrosis factor induced genes (TNF).

Supporting image 2

Table 1. Summary of the accuracy metrics in the six machine learning models constructed for different patient classification problems.


Disclosures: E. Leventhal: None; A. Daamen: None; A. Grammer: None; P. Lipsky: None.

To cite this abstract in AMA style:

Leventhal E, Daamen A, Grammer A, Lipsky P. A Novel Transcriptome-Based Machine Learning Pipeline Predicts Phenotypes of Lupus Patients [abstract]. Arthritis Rheumatol. 2023; 75 (suppl 9). https://acrabstracts.org/abstract/a-novel-transcriptome-based-machine-learning-pipeline-predicts-phenotypes-of-lupus-patients/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2023

ACR Meeting Abstracts - https://acrabstracts.org/abstract/a-novel-transcriptome-based-machine-learning-pipeline-predicts-phenotypes-of-lupus-patients/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology