ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2025
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • 2020-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 2260

Generative AI model identifies patients with Rheumatoid Arthritis (RA) months prior the diagnosis date: results from a large real-world RA cohort

Shant Ayanian1, Siavash Rezaei2, Daniel Darveaux3, Marc Blasi3 and Elena myasoedova1, 1Mayo Clinic, Rochester, MN, 2Cerebras, Santa Clara, CA, 3Mayo Clinic, Rochester

Meeting: ACR Convergence 2025

Keywords: Decision analysis, rheumatoid arthritis

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Tuesday, October 28, 2025

Title: (2227–2264) Rheumatoid Arthritis – Diagnosis, Manifestations, and Outcomes Poster III

Session Type: Poster Session C

Session Time: 10:30AM-12:30PM

Background/Purpose: Clinical notes typically contain a valuable trove of information which is rarely used in predictive modeling given the complexity of working with unstructured data. As artificial intelligence (AI) advances and multimodal models become necessary to discern complex clinical relationships, the data encoded in clinical notes become extremely important to use. We aimed to apply a generative AI model, i.e. Jina-embeddings –V3 (Jina) model to medical records of patients with rheumatoid arthritis (RA) and show its ability to extract relevant phenotypic features of RA, testing the hypothesis that the model will be able to predict RA diagnosis before the clinical diagnosis.

Methods: The study included 4100 patients with RA diagnosis between 2000 and 2024 mean age was 61.1 (13.4) , 2870 (70%) females, 2450 (59.7%) RF and/or CCP antibody positive). RA was defined as 2 ICD 9/10 codes at least 30 days apart, and each case was confirmed by manual record review, as well as around 80000 persons without RA with notes from the same time period. The notes were arranged in chronological order. Jina was chosen as the embedding model to fine tune given its excellent performance as compared to its relatively small size of 570 million parameters. The model was hosted on a Google cloud container which also hosted a separate storage bucket for all clinical notes extracted for the manually curated dataset of patients with and without RA. The Jina model was fine-tuned on 70% of these notes while the rest of the notes were saved for testing and validation. The fine tuning was performed on the equivalent of 8 H100 GPU (graphic processing units) over the course of 12 hours.

Results: The total number of notes used during the fine-tuning process was 7.8 million. This included 1.2 million notes from patients with RA and 6.6 million notes from persons without RA. These notes included all physician provider notes from any specialty, as well as allied health notes such as nursing notes and clinical communications, as summarized in Table 1. After the model was trained, it was tested on 3.2 million notes (2.6 million non RA and 650K RA patients). Mean duration of available follow up prior to RA diagnsis was: 8.7 yrs (SD 11 yrs). The model was able to discern between patients with RA and without RA with an average precision of 0.8 up to 12 months in advance of their RA diagnosis date.

Conclusion: We showed the ability of the generative AI model (i.e., the fine-tuned Jina embedding model) to predict RA onset months prior to the clinical diagnosis, based on clinical notes. Such predictions can be used to alert a non-rheumatological provider about a concern for RA in advance, enabling earlier referral to a rheumatologist. The work is ongoing on benchmarking this model against a manually curated set of phenotypic characteristics specifically as it relates to the model’s ability to encode clinically relevant information which can be used in different clinical predictive models.

Supporting image 1


Disclosures: S. Ayanian: None; S. Rezaei: None; D. Darveaux: None; M. Blasi: None; E. myasoedova: None.

To cite this abstract in AMA style:

Ayanian S, Rezaei S, Darveaux D, Blasi M, myasoedova E. Generative AI model identifies patients with Rheumatoid Arthritis (RA) months prior the diagnosis date: results from a large real-world RA cohort [abstract]. Arthritis Rheumatol. 2025; 77 (suppl 9). https://acrabstracts.org/abstract/generative-ai-model-identifies-patients-with-rheumatoid-arthritis-ra-months-prior-the-diagnosis-date-results-from-a-large-real-world-ra-cohort/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2025

ACR Meeting Abstracts - https://acrabstracts.org/abstract/generative-ai-model-identifies-patients-with-rheumatoid-arthritis-ra-months-prior-the-diagnosis-date-results-from-a-large-real-world-ra-cohort/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

Embargo Policy

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM CT on October 25. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology