ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: L04

Performance of an Artificial Intelligence Model Compared to Multiple Human Experts in Scoring Synovitis Severity and Osteophyte Severity on Joint Ultrasound Images

Anders Weber1, Mads Ammitzbøll Danielsen2, Bill Aplin Frederiksen3, Hilde Berner Hammer4, Benjamin Schultz Overgaard3, Lene Terslev2, Thiusius Rajeeth Savarimuthu5 and Soren Andreas Just3, 1ROPCA, Odense, Denmark, 2Center for Rheumatology and Spine Disease, Rigshospitalet, Glostrup, Denmark, 3Section of Rheumatology, Department of Medicine, Svendborg Sygehus OUH, Svendborg, Denmark, 4Center for Treatment of Rheumatic and Musculoskeletal Diseases (REMEDY), Diakonhjemmet Hospital, Oslo, Norway, Faculty of Medicine, University of Oslo, Oslo, Norway, 5Mærsk Mc-Kinney Møller Institute, University of Southern Denmark, Odense, Denmark

Meeting: ACR Convergence 2024

Date of first publication: October 24, 2024

Keywords: Imaging, Late-Breaking 2024, Osteoarthritis, Synovitis

  • Tweet
  • Email
  • Print
Session Information

Date: Monday, November 18, 2024

Title: (L01–L14) Late-Breaking Posters

Session Type: Poster Session C

Session Time: 10:30AM-12:30PM

Background/Purpose: To evaluate the agreement of an artificial intelligence (AI) model designed to assess greyscale and Doppler synovitis severity and osteophyte severity in hand joints compared to human expert raters, using a consensus score as the gold standard.

Methods: Ultrasound images of metacarpophalangeal (MCP), proximal interphalangeal (PIP), distal interphalangeal (DIP), and interphalangeal (IP) joints were collected from patients with hand pain. Rheumatologists, all EULAR-certified ultrasound instructors, scored the images for synovial hypertrophy (SH) (5 raters), Doppler activity (3 raters), and osteophyte severity (4 raters) on a scale from 0 to 3 using the Global OMERACT-EULAR Synovitis Score (GLOESS) and the corresponding osteophyte scoring system. The AI model was trained, validated, and tested on 7314 images. The disease classifications of the AI model were tested against the raters on 1280 ultrasound images to assess SH, 840 ultrasound videos to assess Doppler activity and 351 ultrasound images to assess osteophytes. The agreement with the consensus was calculated as the AI’s average agreement with all raters. Performance metrics, including Cohen’s Kappa, Percent Exact Agreement (PEA), Percent Close Agreement (PCA), sensitivity, specificity, Positive Predictive Value (PPV), and Negative Predictive Value (NPV), were calculated with 95% confidence intervals (CI).

Results: As illustrated in Figure 1, the AI and human raters achieved comparable results across all metrics.

SH: The AI vs. consensus showed a Kappa of 0.39 (95% CI: 0.35–0.44), PEA of 51.77% (95% CI: 48.83–54.70%), PCA of 91.03% (95% CI: 89.21–92.63%), sensitivity of 46.19% (95% CI: 39.13–53.32%), and specificity of 90.43% (95% CI: 88.35–92.25%).

Doppler Activity: The AI vs. consensus had a Kappa of 0.61 (95% CI: 0.54–0.67), PEA of 80.49% (95% CI: 77.51–83.22%), PCA of 97.13% (95% CI: 95.69–98.18%), sensitivity of 67.31% (95% CI: 51.86–80.24%), and specificity of 96.29% (95% CI: 94.65–97.52%).

Osteophyte Grading: The AI vs. consensus showed a Kappa of 0.55 (95% CI: 0.46–0.63), PEA of 70.69% (95% CI: 65.57–75.45%), PCA of 96.28% (95% CI: 93.70–98.01%), sensitivity of 56.43% (95% CI: 31.56–73.36%), and specificity of 95.36% (95% CI: 92.44–97.36%).

These metrics, along with overlapping 95% confidence intervals depicted in Figure 1, indicate that the AI’s performance is comparable to that of experienced human raters across all metrics.

Conclusion: The AI model performed at the level of expert human raters in assessing synovial hypertrophy, Doppler activity, and osteophyte severity in hand joints. This suggests that AI can be a reliable tool for evaluating joint ultrasound images, potentially aiding clinical decision-making by providing consistent and standardized assessments.

Supporting image 1

Performance metrics for synovial hypertrophy, Doppler activity, and osteophyte scoring for the AI and human raters, with 95% confidence intervals.


Disclosures: A. Weber: ROPCA ApS, 3; M. Ammitzbøll Danielsen: None; B. Aplin Frederiksen: None; H. Berner Hammer: None; B. Schultz Overgaard: None; L. Terslev: None; T. Rajeeth Savarimuthu: Ropca ApS, 8; S. Just: ROPCA, 2, 8.

To cite this abstract in AMA style:

Weber A, Ammitzbøll Danielsen M, Aplin Frederiksen B, Berner Hammer H, Schultz Overgaard B, Terslev L, Rajeeth Savarimuthu T, Just S. Performance of an Artificial Intelligence Model Compared to Multiple Human Experts in Scoring Synovitis Severity and Osteophyte Severity on Joint Ultrasound Images [abstract]. Arthritis Rheumatol. 2024; 76 (suppl 9). https://acrabstracts.org/abstract/performance-of-an-artificial-intelligence-model-compared-to-multiple-human-experts-in-scoring-synovitis-severity-and-osteophyte-severity-on-joint-ultrasound-images/. Accessed .
  • Tweet
  • Email
  • Print

« Back to ACR Convergence 2024

ACR Meeting Abstracts - https://acrabstracts.org/abstract/performance-of-an-artificial-intelligence-model-compared-to-multiple-human-experts-in-scoring-synovitis-severity-and-osteophyte-severity-on-joint-ultrasound-images/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology