ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 2183

Accuracy of AI Tools for Diagnosis of Connective Tissue Disease

Indira Sriram1, Gabriel Tarshish2 and Megan Curran2, 1University of Colorado, Aurora, CO, 2Children's Hospital Colorado, Aurora, CO

Meeting: ACR Convergence 2024

Keywords: dermatomyositis, informatics, Systemic lupus erythematosus (SLE), Systemic sclerosis

  • Tweet
  • Email
  • Print
Session Information

Date: Monday, November 18, 2024

Title: Pediatric Rheumatology – Clinical Poster III

Session Type: Poster Session C

Session Time: 10:30AM-12:30PM

Background/Purpose: In this work, we study the ability of generative artificial intelligence (AI) based tools to diagnose pediatric rheumatological diseases. Specifically, we seek to answer two questions: 1. What is the impact of symptomatology on the robustness of output provided by AI tools? 2) What is the impact of non-technical terminology on the diagnostic accuracy of AI tools, and could it be safe for self-diagnosis? We examined the ability of AI to diagnose four connective tissue diseases: juvenile dermatomyositis (JDM), lupus, systemic sclerosis (SSc), and mixed connective tissue disease (MCTD).

Methods: We compared the two large language models, ChatGPT 3.5 and Claude Sonnet. Both systems were provided with a standardized prompt with variable symptom list. We obtained lists of common symptoms for these diseases from a pediatric rheumatology textbook (Cassidy and Petty). These symptom lists came with an associated probability. We then generated a symptom list based on these probabilities and input them into the program using the standardized prompt. We generated 250 symptom lists using textbook descriptions of the symptoms, and 1000 symptom lists with non-medical language for each of the four disorders we studied. We then parsed the output to determine if AI was able to generate the correct diagnosis.

Results: The tools were better at diagnosing conditions when presented with prompts containing standard medical terminology. This is likely related to the presence of this information in the sources used to train these models.  AI tools were also able to diagnose diseases with pathognomonic findings more readily than those with more subtle findings. Specifically, any prompt containing “Gottron’s papules” consistently leads to a diagnosis of juvenile dermatomyositis.

The diagnostic accuracy of the tools dropped significantly when they were provided with colloquial language. This suggests that they are not able to provide accurate diagnoses when they are not given strictly clinical language.

The tools consistently included rheumatological diagnoses within their top five diagnoses for most prompts, even if provided with non-medical terminology. This indicates that they may assist with triggering the correct referral. Claude Sonnet appeared to outperform ChatGPT, so it may draw from a different collection of medical information.

Conclusion: We examined the ability of AI to diagnose four pediatric rheumatological diseases accurately (JDM, lupus, MCTD, and SSc). We noted that AI is more likely to diagnose diseases accurately when there are clear pathognomonic findings (i.e. Gottron’s papules). We determined that the diagnostic accuracy of AI drops when it is presented with non-medical terminology, suggesting that it is drawing information primarily from medical sources. As a result, AI may not be effective nor safe for individuals who are attempting to self-diagnose. Further improvement is required in these AI systems before they can be safely used by clinicians for diagnostic support.

Supporting image 1

Standardized prompt provided to AI tools

Supporting image 2

Comparison of diagnostic accuracy of ChatGPT 3.5 and Claude Sonnet for JDM, SLE, MCTD, and SSc.


Disclosures: I. Sriram: None; G. Tarshish: None; M. Curran: None.

To cite this abstract in AMA style:

Sriram I, Tarshish G, Curran M. Accuracy of AI Tools for Diagnosis of Connective Tissue Disease [abstract]. Arthritis Rheumatol. 2024; 76 (suppl 9). https://acrabstracts.org/abstract/accuracy-of-ai-tools-for-diagnosis-of-connective-tissue-disease/. Accessed .
  • Tweet
  • Email
  • Print

« Back to ACR Convergence 2024

ACR Meeting Abstracts - https://acrabstracts.org/abstract/accuracy-of-ai-tools-for-diagnosis-of-connective-tissue-disease/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology