ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2025
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • 2020-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0183

Optimizing the Clinical Application of Rheumatology Guidelines Using Large Language Models: A Retrieval-Augmented Generation Framework Integrating EULAR and ACR Recommendations

Alfredo Madrid1, Diego Benavent2, Chamaida Plasencia-Rodríguez3, Zulema Rosales-Rosado4, Beatriz Merino-Barbancho5 and DALIFER FREITES6, 1Roche, Global Data Science and Analytics, Madrid, Spain., Madrid, Madrid, Spain, 2Hospital Universitari de Bellvitge, Madrid, Spain, 3Hospital Universitario La Paz, MADRID, Spain, 4Grupo de Patología Musculoesquelética. Hospital Clínico San Carlos. Instituto de Investigación Sanitaria San Carlos (IdISSC), Madrid, Spain, Madrid, Madrid, Spain, 5Escuela Técnica Superior de Ingenieros de Telecomunicación. Universidad Politécnica de Madrid. Avenida Complutense, Madrid, Spain, Madrid, Madrid, Spain, 6Rheumatology Service, San Carlos Clinical Hospital, Madrid, Madrid, Spain

Meeting: ACR Convergence 2025

Keywords: informatics, Managed Care

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Sunday, October 26, 2025

Title: (0175–0198) Health Services Research Poster I

Session Type: Poster Session A

Session Time: 10:30AM-12:30PM

Background/Purpose: Timely access to current rheumatology guidelines at the point of care presents a significant challenge. Large Language Models (LLMs) offer potential solutions, but their propensity for “hallucinations” raises safety concerns. The primary objective of this study was to develop and evaluate a novel Retrieval-Augmented Generation (RAG) system, the first of its kind specifically for adult rheumatology, integrating European Alliance of Associations for Rheumatology (EULAR) and American College of Rheumatology (ACR) guidelines to provide clinicians with timely, evidence-based recommendations.

Methods: Seventy-four clinically relevant EULAR and ACR management guidelines for adult rheumatology were selected and processed. A RAG system was implemented using the LangChain framework, voyage-3 embedding model, and a Qdrant vector database, see Figure 1. For evaluation, 740 guideline-specific questions were generated. Answers were produced by an LLM (ChatGPT-o3-mini) with context retrieval (RAG) and without (baseline). Performance was assessed by an LLM-as-a-judge (Gemini 2.0 Flash) using a 5-point Likert scale across five dimensions (relevance, factual accuracy, safety, completeness, conciseness) and by determining preference. Wilcoxon signed-rank and Binomial tests were used for statistical analysis. Two blinded rheumatologists independently validated a random 15% sample of questions.

Results: The LLM-as-a-judge evaluation revealed that the RAG system significantly outperformed the baseline system across all criteria (p< 0.001). The RAG system was significantly preferred by the LLM-as-a-judge in 92.8% of comparisons (p< 0.001), Table 1 Manual evaluation by rheumatologists confirmed these findings, with significant improvements in accuracy, safety, and completeness for the RAG system (p< 0.001), which was preferred in 71.2%-74.8% of comparisons (p< 0.001), Table 2.

Conclusion: This study successfully developed and validated a RAG system integrating EULAR and ACR guidelines for adult rheumatology. The system significantly enhances the quality and reliability of LLM-generated answers, providing a robust foundation for AI-driven clinical decision support tools. Such tools have the potential to improve guideline adherence and evidence-based practice in rheumatology by offering clinicians rapid, context-aware access to recommendations.

Supporting image 1Figure 1: Walkthrough of the entire process—from initial creation to final evaluation—of the RAG architecture proposed

Supporting image 2Table 1: LLM-as-a-judge evaluation results.

Supporting image 3Table 2: Manual evaluation results.


Disclosures: A. Madrid: Roche, 3; D. Benavent: AbbVie/Abbott, 2, 6, Eli Lilly, 6, Janssen, 6, Novartis, 5, 6, Pfizer, 6, Savana, 7, UCB, 2, 6; C. Plasencia-Rodríguez: AbbVie/Abbott, 5, 6, Eli Lilly, 6, Novartis, 6, Pfizer, 5, 6, UCB, 6; Z. Rosales-Rosado: None; B. Merino-Barbancho: None; D. FREITES: None.

To cite this abstract in AMA style:

Madrid A, Benavent D, Plasencia-Rodríguez C, Rosales-Rosado Z, Merino-Barbancho B, FREITES D. Optimizing the Clinical Application of Rheumatology Guidelines Using Large Language Models: A Retrieval-Augmented Generation Framework Integrating EULAR and ACR Recommendations [abstract]. Arthritis Rheumatol. 2025; 77 (suppl 9). https://acrabstracts.org/abstract/optimizing-the-clinical-application-of-rheumatology-guidelines-using-large-language-models-a-retrieval-augmented-generation-framework-integrating-eular-and-acr-recommendations/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2025

ACR Meeting Abstracts - https://acrabstracts.org/abstract/optimizing-the-clinical-application-of-rheumatology-guidelines-using-large-language-models-a-retrieval-augmented-generation-framework-integrating-eular-and-acr-recommendations/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

Embargo Policy

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM CT on October 25. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology