ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0549

Developing an Algorithm for Identifying Mortality in MarketScan Claims Data Using Machine Learning

Fenglong Xie1, Hong Zhao1, Huifeng Yun2, Sasha Bernatsky3 and Jeffrey R Curtis2, 1University of Alabama at Birmingham, Birmingham, AL, 2Division of Clinical Immunology and Rheumatology, University of Alabama at Birmingham, Birmingham, AL, 3The Research Institute of the McGill University Health Centre, Montreal, ON, Canada

Meeting: ACR Convergence 2020

Keywords: Mortality

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Saturday, November 7, 2020

Title: Epidemiology & Public Health Poster II: OA, Osteoporosis, & Other Rheumatic Disease

Session Type: Poster Session B

Session Time: 9:00AM-11:00AM

Background/Purpose: In 2016, MarketScan data no longer included information about inpatient mortality, compromising the ability to study fatal hospitalization events. Using data through 2015 when mortality remained available, we developed an algorithm to accurately identify in-hospital mortality using coverage patterns, proximate healthcare claims diagnoses, and corresponding information for family members.

Methods: We selected the latest hospital claim in 2011-2015 MarketScan data for each individual. Hospitalizations with discharge status of (20, 40, 41) were defined as death, and (21, 87, missing) as alive. Predictors included age, sex, coverage disenrollment post hospitalization, diagnosis codes and timing of last submitted claim, and corresponding information from family members linked by family ID (to confirm family members remained enrolled). Individual predictors were optimized using the c (concordance) index. Datasets were split into Training (80% random sample of hospitalizations, 2011-2013); Test1 (80% in 2014-2015) and Test2 (remaining 20%, 2011-2015). Machine learning (ML) methods included decision tree (DT), random forest (RF), elastic-net regularization (ER) and XGBoost methods. Hyper-parameters were tuned using a random search method with 10-fold cross validation across a preset range. was used to assess model performance. Each model was validated in Test1 and Test2 datasets separately. Sensitivity, specificity, positive predicted value (PPV) and accuracy were calculated in all datasets.

Results: 1,307,532 hospitalizations were selected among patients; mean age was 47.4 (standard deviation 25.9) years and 43.6% were male. Training data included 727,887 hospitalizations, 23.1% ending in death. Disenrollment ending within 30 days (Figure 1), and last claim within 90 days were optimized as single predictors (Figure 2). Models using different ML methods performed well in all datasets, albeit with a trend for less optimal parameters with decision tree methods. In test datasets, PPV and accuracy was as high as 0.91 and 0.95 respectively.

Conclusion: We derived and validated an algorithm for identifying in-hospital mortality in Marketscan claims data which performed well with >90% accuracy. This represents filling of a key gap for health outcomes research with Marketscan data.

Figure 1: Optimization of cut-point for disenrollment

Figure 2: Optimization of interval from hospital discharge date to last claim

Table: Model performance to classify in hospital mortality in Training and Test datasets


Disclosure: F. Xie, None; H. Zhao, None; H. Yun, Pfizer, 2; S. Bernatsky, None; J. Curtis, AbbVie, 2, 5, Amgen, 2, 5, Bristol-Myers Squibb, 2, 5, Corrona, 2, 5, Janssen, 2, 5, Lilly, 2, 5, Myriad, 2, 5, Pfizer, 2, 5, Regeneron, 2, 5, Roche, 2, 5, UCB, 2, 5, Gilead Sciences, Inc., 5, Sanofi, 5.

To cite this abstract in AMA style:

Xie F, Zhao H, Yun H, Bernatsky S, Curtis J. Developing an Algorithm for Identifying Mortality in MarketScan Claims Data Using Machine Learning [abstract]. Arthritis Rheumatol. 2020; 72 (suppl 10). https://acrabstracts.org/abstract/developing-an-algorithm-for-identifying-mortality-in-marketscan-claims-data-using-machine-learning/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2020

ACR Meeting Abstracts - https://acrabstracts.org/abstract/developing-an-algorithm-for-identifying-mortality-in-marketscan-claims-data-using-machine-learning/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology