ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 128

Use of Machine Learning and Traditional Statistical Methods to Classify RA-Related Disability Using Administrative Claims Data

Jeffrey R. Curtis1, Huifeng Yun2, Carol J. Etzel3, Shuo Yang4 and Lang Chen4, 1Rheumatology & Immunology, University of Alabama at Birmingham, Birmingham, AL, 2University of Alabama at Birmingham, Birmingham, AL, 3Departments of Epidemiology and Biostatistics, University of Texas School of Public Health, Houston, TX, 4Division of Clinical Immunology & Rheumatology, University of Alabama at Birmingham, Birmingham, AL

Meeting: 2017 ACR/ARHP Annual Meeting

Date of first publication: September 18, 2017

Keywords: Electronic Health Record, outcomes and rheumatoid arthritis (RA)

  • Tweet
  • Email
  • Print
Session Information

Date: Sunday, November 5, 2017

Title: Epidemiology and Public Health Poster I: Rheumatoid Arthritis

Session Type: ACR Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: Administrative claims and electronic health record (EHR) data are commonly used to assess outcomes in rheumatoid arthritis (RA). However, direct measures of functional status are typically not available in these data sources to control for confounding in comparative effectiveness research.

Methods: Corrona registry data linked to Medicare claims (2006-2014) were used to build a claims-based disability classifier, measured by HAQ. Eligible patients had RA per Corrona rheumatologist, and >=1yr prior coverage. Demographics, socioeconomic factors, comorbidities, healthcare utilization, and medications from claims data were included as predictors.

In separate analyses, HAQ was classified dichotomously (<1, ≥1), as 3 categories (0–<0.5, ≥0.5–<1.5, and ≥1.5–3), and as a continuous variable, converted to corresponding HAQ category. Generalized logistic regression (GenLogit) with LASSO for variable selection and results were compared with machine learning methods including RandomForests, using a forest of 2000 trees. Separate models were run classifying each of the 8 HAQ subdomains separately, and then summing to form the composite HAQ score. Misclassification rates were compared and the area under the receiver operator curves (AUROC) was described.

Results:   A total of 2,788 RA patients were eligible, classifying 52% of patients with low (n=1448) and 48% with high (n=1340) HAQ; and as 3 categories, low (n=887), moderate (n=1109), and high (n=792). Univariable analysis showed higher HAQ was associated with older age, being disabled (per Medicare), rural residence, and greater comorbidity burden, and higher healthcare utilization.

Variables selected by various methods were similar (Table). In the 2 category HAQ models, overall misclassification was 29% (RandomForests), 28% , and 38% (LASSO), with an AUROC of 0.84. In the 3 category HAQ models, RandomForests yielded misclassification of ~48% that did not meaningfully differ across the 3 HAQ categories. Misclassification of the GenLogit model varied widely by HAQ category. When misclassification did occur in the 3 category analysis, patients were usually 1 category off; more extreme misclassification (categorizing low HAQ patients as high, or vice-versa) was uncommon (<8%). The median (IQR) difference in the (observed – predicted) HAQ was 0.00 (-0.45, 0.41) units. Ongoing work is refining these models, reducing the misclassification rate, and validating the approach.

Conclusion: Results from this preliminary analysis suggest that administrative claims and EHR data might be useful to classify RA-related disability as measured by the HAQ with reasonable accuracy. Larger datasets and richer information in EHR data likely will improve the accuracy of these methods.

 

Table: Key variables from administrative claims data selected by
machine learning methods to classify HAQ category*

 

RandomForests

Generalized logistic regression with LASSO

Age

Number of rheumatology visits

Number of AHRQ CCS comorbidities

Number of unique medications (any type)

Number of outpatient physician visits

Baseline steroid use

Median household income

Elixhauser comorbidity index

Wheelchair

Disable

Sex

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

X

*results shown for 3 category HAQ models

 


Disclosure: J. R. Curtis, AbbVie, Roche/Genentech, BMS, UCB, Myraid, Lilly, Amgen, Janssen, Pfizer, Corrona, 5,Amgen, Pfizer, Crescendo Bio, Corrona, 9; H. Yun, Bristol-Myers Squibb, 2; C. J. Etzel, Corrona, LLC, 3,Merck Human Health, 9; S. Yang, None; L. Chen, None.

To cite this abstract in AMA style:

Curtis JR, Yun H, Etzel CJ, Yang S, Chen L. Use of Machine Learning and Traditional Statistical Methods to Classify RA-Related Disability Using Administrative Claims Data [abstract]. Arthritis Rheumatol. 2017; 69 (suppl 10). https://acrabstracts.org/abstract/use-of-machine-learning-and-traditional-statistical-methods-to-classify-ra-related-disability-using-administrative-claims-data/. Accessed .
  • Tweet
  • Email
  • Print

« Back to 2017 ACR/ARHP Annual Meeting

ACR Meeting Abstracts - https://acrabstracts.org/abstract/use-of-machine-learning-and-traditional-statistical-methods-to-classify-ra-related-disability-using-administrative-claims-data/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology