ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 756

Novel Machine Learning Classifier Accurately Predicts Intrinsic Molecular Subsets for Patients with Systemic Sclerosis

Jennifer Franks1, Viktor Martyanov1, Guoshuai Cai1 and Michael L. Whitfield2, 1Department of Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH, 2Molecular and Systems Biology, Geisel School of Medicine at Dartmouth, Hanover, NH

Meeting: 2017 ACR/ARHP Annual Meeting

Date of first publication: September 18, 2017

Keywords: Gene Expression, Heterogeneous, skin and systemic sclerosis

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Sunday, November 5, 2017

Title: Systemic Sclerosis, Fibrosing Syndromes and Raynaud's – Pathogenesis, Animal Models and Genetics Poster I

Session Type: ACR Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: High-throughput gene expression profiling of skin biopsies from patients with systemic sclerosis (SSc) has identified four “intrinsic” gene expression subsets conserved across multiple cohorts and tissues. These are the inflammatory, fibroproliferative, normal-like, and limited subsets. In order to classify patients in clinical trials or for diagnostic purposes, supervised methods that can assign a single sample to a molecular subset are required. Here, we introduce a novel machine learning classifier which is a robust predictor of intrinsic subset and test it on multiple independent patient cohorts.

Methods: Three independent SSc cohorts (Milano et al. 2008, Pendergrass et al. 2012, Hinchcliff et al. 2013) with gene expression data and intrinsic subset assignments were carefully curated and merged to create a training dataset covering a broad set of 297 skin biopsies representing 97 unique patients. Supervised machine learning algorithms were rigorously trained and evaluated using repeated three-fold cross-validation. We performed external validation using two independent SSc datasets: Chakravarty et al. 2015, which contains 16 samples/8 patients and Gordon et al. 2015, which contains 12 samples/6 patients. Additionally, we validated the classifier on a cohort of SSc patients with gene expression data independently generated by Assassi et al. 2015 (102 samples/97 patients). We used weighted gene co-expression network analysis and g:Profiler to identify and functionally characterize gene modules associated with the intrinsic subsets.

Results: Repeated cross-fold validation identified gene expression features using multinomial elastic net and incorporated them into the final model which achieved an average classification accuracy of 88%. All molecular subsets were classified with high average sensitivity and specificity, particularly inflammatory (83.3% sensitivity, 95.8% specificity) and fibroproliferative (89.7% sensitivity, 94.1% specificity). Through multiple rounds of external validation, the classifier maintained an accuracy ranging from 70% to 85%. In a re-analysis of gene expression data from Assassi et al. study, we identified subsets of patients that represent the canonical inflammatory, fibroproliferative, and normal-like subsets. The inflammatory subset showed upregulated gene modules enriched in biological processes such as inflammatory response, lymphocyte activation, and stress response. Similarly, gene modules enriched for cell cycle processes were increased in the fibroproliferative subset.

Conclusion: We have developed a highly accurate and reliable classifier for SSc molecular subsets for single samples trained and tested on diverse cohorts comprised of 427 skin biopsies from 208 independent patients. These analyses show that the intrinsic gene expression subsets are a common feature of SSc found across multiple internal and external validation cohorts. Machine learning methods provide a robust and accurate mechanism for stratifying intrinsic gene expression subsets and can be used to aid clinical decision-making and interpretation for SSc patients and in clinical trials.


Disclosure: J. Franks, None; V. Martyanov, None; G. Cai, None; M. L. Whitfield, Corbus, UCB, glaxosmithkline, 5,Celdara medical llc, 9.

To cite this abstract in AMA style:

Franks J, Martyanov V, Cai G, Whitfield ML. Novel Machine Learning Classifier Accurately Predicts Intrinsic Molecular Subsets for Patients with Systemic Sclerosis [abstract]. Arthritis Rheumatol. 2017; 69 (suppl 10). https://acrabstracts.org/abstract/novel-machine-learning-classifier-accurately-predicts-intrinsic-molecular-subsets-for-patients-with-systemic-sclerosis/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to 2017 ACR/ARHP Annual Meeting

ACR Meeting Abstracts - https://acrabstracts.org/abstract/novel-machine-learning-classifier-accurately-predicts-intrinsic-molecular-subsets-for-patients-with-systemic-sclerosis/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology