ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2025
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • 2020-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 1888

Developing Machine Learning Algorithm in Electronic Health Record to Accurately Identify Psoriasis Patients

Dilli Poudel1, Leslie Crofford2, Avery Fortier3, Lee Wheless4, Georgina Sellyn4, Gyaljin Sherpa5, Alexis Ogdie6 and Paras Karmacharya7, 1Vanderbilt University Medical Center, Nashville, TN, 2Vanderbilt University Medical Center, Melbourne, AR, 3Vanderbilt University, Lexington, MA, 4Vanderbilt University, Nashville, TN, 5Mercy Health - St. Elizabeth Youngstown Hospital, Newyork, NY, 6Department of Medicine, Perelman School of Medicine at the University of Pennsylvania, Wilmington, DE, 7Vanderbilt University, Nashville, TN, Hermitage, TN

Meeting: ACR Convergence 2025

Keywords: autoimmune diseases, Statistical methods

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Tuesday, October 28, 2025

Title: (1877–1913) Epidemiology & Public Health Poster III

Session Type: Poster Session C

Session Time: 10:30AM-12:30PM

Background/Purpose: Accurate identification of patients with psoriasis from large electronic health record (EHR) databases is crucial for conducting robust real-world research. While EHRs offer an efficient resource, traditional identification methods are often laborious. Machine learning (ML) can present a promising high throughput approach with accuracy. The objective of this study was to develop and validate machine learning algorithms using EHR data to identify patients with psoriasis.

Methods: We used data from Vanderbilt University Medical Center’s de-identified EHR database, the Synthetic Derivative (SD), which contains longitudinal clinical data from approximately 3.5 million subjects. Adult patients (≥18 years) were initially screened based on potential psoriasis indicators like International Classification of Diseases (ICD) codes or keywords. Non-overlapping random samples formed the training (N&#3f200) and validation (N&#3f300) sets. A true psoriasis case was defined through manual chart review as documented confirmatory or probable diagnosis by a dermatologist or rheumatologist, or a positive skin biopsy. Potential machine learning features included ICD-9/10 codes for psoriasis and related conditions, provider specialty associated with codes (dermatology, rheumatology), problem list keywords, and use of relevant medications (e.g., topical treatments, systemic therapies, biologics). Three ML models (logistic regression, random forest, XGBoost) were developed on the training set, with the random forest model undergoing hyperparameter tuning via repeated 10-fold cross-validation. Models were tested on the independent validation set, assessing performance using F1-score, sensitivity, positive predictive value (PPV), area under the receiver operating characteristic curve (AUC), and accuracy.

Results: All three ML models demonstrated strong performance identifying psoriasis patients in the validation set. Tuned random forest model achieved the highest F1-score (0.757), with an AUC of 0.871, sensitivity of 0.791, and PPV of 0.725 (Table 1, Figure 1). Feature importance analysis confirmed that ICD codes related to psoriasis diagnoses documented by dermatologists and rheumatologists, overall psoriasis visit codes, and specific medication usage patterns were among the most significant predictors of a true psoriasis diagnosis (Figure 2).

Conclusion: Machine learning algorithms, particularly tuned random forest models, leveraging readily available EHR data could provide an accurate and efficient method for identifying psoriasis patients comparable to true diagnosis made by dermatologist or rheumatologist or with skin biopsy. Such validated algorithms represent valuable tools for researchers seeking to assemble large, real-world cohorts of patients with psoriasis enabling studies on disease epidemiology, treatment effectiveness, and patient outcomes.

Supporting image 1Comparison of Performances Among Three Different Machine Learning Models

Supporting image 2Receiver Operating Curves with Area Under Curve for Random Forest Model in Training and Validation Sets

Supporting image 3Top 15 Predictors of a True Psoriasis Diagnosis


Disclosures: D. Poudel: None; L. Crofford: Argenyx, 5, Evergreen, 5, Kezar, 1, Otsaka, 1, UCB, 1; A. Fortier: None; L. Wheless: None; G. Sellyn: None; G. Sherpa: None; A. Ogdie: AbbVie, 5, Amgen, 5, 11, Bristol Myers Squibb, 5, Celgene, 5, CorEvitas, 2, Eli Lilly, 5, Novartis, 5, 11, Pfizer, 5, 11; P. Karmacharya: None.

To cite this abstract in AMA style:

Poudel D, Crofford L, Fortier A, Wheless L, Sellyn G, Sherpa G, Ogdie A, Karmacharya P. Developing Machine Learning Algorithm in Electronic Health Record to Accurately Identify Psoriasis Patients [abstract]. Arthritis Rheumatol. 2025; 77 (suppl 9). https://acrabstracts.org/abstract/developing-machine-learning-algorithm-in-electronic-health-record-to-accurately-identify-psoriasis-patients/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2025

ACR Meeting Abstracts - https://acrabstracts.org/abstract/developing-machine-learning-algorithm-in-electronic-health-record-to-accurately-identify-psoriasis-patients/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

Embargo Policy

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM CT on October 25. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology