ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0353

Validation of Claims-based Algorithms for Newly Diagnosed Juvenile Idiopathic Arthritis Using Machine Learning Methods

Patricia Hoffman1, Lauren Parlett2, Daniel Beachler2, Daniel Reiff3, Sarah McGuire1, Sonia Pothraj1, Lakshmi Moorthy1, Cynthia Salvant4, Dawn Koffman5, Sanika Rege5, Cecilia Huang5, Matthew Iozzio5, Kevin Schott2, Kevin Haynes6, Amy Davidow7, Stephen Crystal8, Tobias Gerhard5, Brian Strom9, Carlos Rose10 and Daniel Horton4, 1Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, 2Carelon, Wilmington, DE, 3University of Alabama at Birmingham, Birmingham, AL, 4Department of Pediatrics, Rutgers Robert Wood Johnson Medical School, New Brunswick, NJ, 5Rutgers Center for Pharmacoepidemiology and Treatment Science, New Brunswick, NJ, 6Janssen Pharmaceuticals, Raritan, NJ, 7New York University, New York, NY, 8Rutgers University, New Brunswick, NJ, 9Rutgers, State University of New Jersey, New Brunswick, NJ, 10Thomas Jefferson University, Philadelphia, PA

Meeting: ACR Convergence 2023

Keywords: Administrative Data, Epidemiology, informatics, Juvenile idiopathic arthritis

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Sunday, November 12, 2023

Title: (0345–0379) Pediatric Rheumatology – Clinical Poster I: JIA

Session Type: Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: Administrative claims databases represent important settings for studying large populations with juvenile idiopathic arthritis (JIA), but prior efforts to validate diagnostic algorithms for JIA using administrative data have been limited to potentially non-generalizable settings (e.g., specific clinics or health care systems). We aimed to develop and validate algorithms for new diagnoses of JIA in a large claims database using rule-based and machine learning-based approaches.

Methods: We performed a cross-sectional validation study using US commercial health plan data (2013-2020). We identified children diagnosed with JIA (ICD-9-CM: 696.0, 714, 720; ICD-10-CM: L40.5, M05, M06, M08, M45) before age 18 following ≥12 months of baseline continuous enrollment without JIA diagnosis or immunosuppression. JIA diagnoses were based on 3 previously validated definitions: 1) rheumatologist’s diagnosis plus orders for ≥2 specific laboratory tests; 2) ≥2 outpatient diagnoses 8-52 weeks apart; or 3) 1 inpatient diagnosis. Charts from a random subset of subjects meeting each definition were abstracted and independently adjudicated by clinical experts; discrepancies were resolved by a third expert or, where necessary, consensus. Incident JIA was defined as definite or probable JIA diagnosed in the prior 4 months. Using data from 1 year before through 1 year after first JIA diagnosis, we then created candidate predictor variables from demographics, diagnoses, medications, procedures, and specialty of clinicians diagnosing JIA. After applying a simulation-based balancing method (Synthetic Minority Oversampling Technique, SMOTE), we selected optimal logistic regression regularization hyperparameters using 10-fold cross-validation. Model variables were used to score observations, and sensitivity, specificity, and positive predictive value (PPV) [95% confidence interval (CI)] were assessed at different thresholds of predicted JIA probability.

Results: Of 182 eligible charts reviewed (92 ICD-9-based, 90 ICD-10-based), 133 had definite/probable JIA (ICD-9 64%, ICD-10 82%). Of JIA diagnoses, 90 were incident (ICD-9 90%, ICD-10 50%). Rule-based algorithms had limited PPV for incident JIA (ICD-9 58%, ICD-10 41%) (Table). Use of machine-learning based algorithms enabled excellent discrimination between incident and prevalent JIA (ICD-9 AUC 0.97, ICD-10 AUC 0.88) and between incident JIA and unlikely JIA (ICD-9 AUC 0.99, ICD-10 AUC 0.94) (Figure 1). Specific predicted probability thresholds yielded excellent test characteristics for differentiating incident JIA from unlikely JIA (ICD-9: sensitivity 95%, specificity 96%, PPV 96% [95% CI 96-100%]; ICD-10: sensitivity 81%, specificity 92%, PPV 91% [95% CI 84-97%]) (Figure 2).

Conclusion: Machine learning-based diagnostic algorithms for incident JIA enhanced traditional rule-based algorithms in identifying new diagnoses of JIA using ICD-9 and ICD-10 codes within a large US claims database. External validation of these models is warranted, but these algorithms will facilitate use of administrative data to study JIA diagnosis, management, and outcomes in large populations.

Supporting image 1

Table. Accuracy of rule-based algorithms for incident JIA. ICD, International Classification of Diseases; Inc., incident; JIA, juvenile idiopathic arthritis; PPV, positive predictive value; Prev., prevalent. Numbers of charts are shown by claims-based definition and adjudicated diagnosis. PPV represents the percentage with definite or probable incident JIA among those meeting the claims-based definition. Specific labs eligible for Definition 1 were antinuclear antibody, rheumatoid factor, anti-cyclic citrullinated protein antibody, and HLA-B27.

Supporting image 2

Figure 1. ROC curves for machine learning-based algorithms for incident JIA. AUC, area under the curve; ROC, receiver operating characteristic; ICD, International Classification of Diseases; JIA, juvenile idiopathic arthritis. ROC curves show balance of sensitivity and specificity of machine learning-based algorithms for incident JIA based on comparisons with prevalent JIA (ICD-9 in A, ICD_10 in B) and with unlikely JIA (ICD-9 in C, ICD_10 in D). The Youden Index (dot on ROC curve) represents the maximum value of (Sensitivity + Specificity – 1).

Supporting image 3

Figure 2. Impact of predicted probability thresholds from machine learning-based models on accuracy of diagnoses of incident JIA compared with unlikely JIA. CI, confidence interval; ICD, International Classification of Diseases; PPV, positive predictive value. Curves show numbers of true positive (dark grey) and false positive (light grey) cases of incident JIA at different thresholds of predicted probability from machine learning-based models comparing children with incident JIA and children unlikely to have JIA based on ICD-9 codes (A) and ICD_10 codes (B). Numbers reflect the total study population, including simulated cases generated by application of a balancing method (Synthetic Minority Oversampling Technique, SMOTE). Threshold values yielding excellent test characteristics (dotted lines) were 0.97 for ICD-9 (sensitivity 95%, specificity 96%, PPV 96% [95% CI 96_100%]) and 0.22 for ICD_10 (sensitivity 81%, specificity 92%, PPV 91% [95% CI 84-97%]).


Disclosures: P. Hoffman: None; L. Parlett: Elevance Health, 3, Sanofi, 5; D. Beachler: Elevance Health, 3; D. Reiff: None; S. McGuire: None; S. Pothraj: None; L. Moorthy: Bristol-Myers Squibb(BMS), 12, SitePI for Abatacept registry BMS.; C. Salvant: None; D. Koffman: None; S. Rege: None; C. Huang: None; M. Iozzio: None; K. Schott: None; K. Haynes: Janssen, 3; A. Davidow: None; S. Crystal: None; T. Gerhard: None; B. Strom: AbbVie/Abbott, 2, Consumer Healthcare Products Association, 2; C. Rose: None; D. Horton: Danisco USA, Inc., 5.

To cite this abstract in AMA style:

Hoffman P, Parlett L, Beachler D, Reiff D, McGuire S, Pothraj S, Moorthy L, Salvant C, Koffman D, Rege S, Huang C, Iozzio M, Schott K, Haynes K, Davidow A, Crystal S, Gerhard T, Strom B, Rose C, Horton D. Validation of Claims-based Algorithms for Newly Diagnosed Juvenile Idiopathic Arthritis Using Machine Learning Methods [abstract]. Arthritis Rheumatol. 2023; 75 (suppl 9). https://acrabstracts.org/abstract/validation-of-claims-based-algorithms-for-newly-diagnosed-juvenile-idiopathic-arthritis-using-machine-learning-methods/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2023

ACR Meeting Abstracts - https://acrabstracts.org/abstract/validation-of-claims-based-algorithms-for-newly-diagnosed-juvenile-idiopathic-arthritis-using-machine-learning-methods/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology