ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0383

Identification of Contributing Factors to Difficult-to-Treat Rheumatoid Arthritis (D2T RA) in a Cohort of 972 Patients Using a Natural Language Processing (NLP) Approach

Hugo BERGIER1, Thibaut Fabacher2, Nathanaël Sedmak2, Erik Sauleau2 and Jacques-Eric Gottenberg3, 1Hautepierre Hospital, Strasbourg, France, 2Hôpitaux Universitaires de Strasbourg, Strasbourg, France, 3Rheumatology Department, Strasbourg University Hospital, Strasbourg, France

Meeting: ACR Convergence 2023

Keywords: Bioinformatics, Biologicals, Data Management, Disease-Modifying Antirheumatic Drugs (Dmards), rheumatoid arthritis

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Sunday, November 12, 2023

Title: (0380–0422) RA – Diagnosis, Manifestations, and Outcomes Poster I

Session Type: Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: Natural Language Processing (NLP), an interdisciplinary field combining artificial intelligence and language science, has gained significant interest in the medical domain for automated collection and structuring of medical data. In this study, we employed an NLP approach to identify and extract the characteristics of patients with D2T RA from their computerized medical records.

Methods: We conducted a monocentric observational retrospective cohort study in our French hospital. Patients were longitudinally recruited over a 5-year period (2015-2020), and their characteristics were retrieved from their electronic health records spanning 2010-2022. Firstly, we performed an NLP phenotyping tool to identify patients who had at least one hospital stay related to RA between 2015 and 2020. The tool relied on CIM-10 coding and the recognition of specific keywords (e.g., “rheumatoid arthritis” or “ACPA”) in the medical records to calculate the patient’s probability of meeting the RA phenotype. We subsequently employed a named-entity recognition algorithm to identify all drugs associated with RA from the patients’ medical reports. The drugs were then arranged in chronological order based on their appearance, and the date of introduction was approximated as the date of first extraction. D2T RA was defined as the failure of at least two targeted therapies (identified by the algorithm retrieving three or more biologics), thereby allowing us to separate our RA cohort into D2T and non-D2T RA groups. Finally, we used another NLP program employing named-entity recognition to retrieve the characteristics of these patients. Specific keywords related to general variables (sex, age at first stay), disease characteristics (presence of joint erosion, DAS28 activity score), and comorbidities were sought in the medical reports using cumulative extraction. The occurrence of these specific tokens was linked to the presence of the respective variable. Finally, a multivariate logistic regression analysis was performed to assess the relationship between variables and the D2T group.

Results: The phenotyping tool identified 972 RA patients, of which 313 were classified as D2T RA and 659 as non-D2T RA. The presence of joint erosions (OR 2.97 CI95% [2.17; 4.08], p< 0.001) and higher disease activity (median DAS28 3.16 vs 2.89) were associated with the D2T group. Female gender, obesity, smoking, anxio-depression, and fibromyalgia comorbidities showed similar proportions in our two groups. Chronic kidney disease (OR=0.66 CI95% [0.44;1.01], p=0.068), cardiac insufficiency (OR 0.52 CI95% [0.38;0.7], p=0.001), liver insufficiency (OR=0.7 CI95% [0.35;1.4], p=0.39), and a history of malignancy (OR=0.62 CI95% [0.47;0.82], p=0.001) were found in higher proportions in the non-D2T group.

Conclusion: NLP facilitates the automated collection of diverse data directly from medical reports using named-entity recognition algorithms. In this study, we demonstrated its application in identifying potential contributing factors to D2T RA. Enhanced understanding of the mechanisms underlying D2T RA and early detection of contributing factors hold promise for improved outcomes and management in this heterogeneous population.


Disclosures: H. BERGIER: None; T. Fabacher: None; N. Sedmak: None; E. Sauleau: None; J. Gottenberg: AbbVie, 2, BMS, 2, 5, Galapagos, 2, Gilead, 2, Lilly, 2, MSD, 2, Novartis, 2, Pfizer, 2, 5.

To cite this abstract in AMA style:

BERGIER H, Fabacher T, Sedmak N, Sauleau E, Gottenberg J. Identification of Contributing Factors to Difficult-to-Treat Rheumatoid Arthritis (D2T RA) in a Cohort of 972 Patients Using a Natural Language Processing (NLP) Approach [abstract]. Arthritis Rheumatol. 2023; 75 (suppl 9). https://acrabstracts.org/abstract/identification-of-contributing-factors-to-difficult-to-treat-rheumatoid-arthritis-d2t-ra-in-a-cohort-of-972-patients-using-a-natural-language-processing-nlp-approach/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2023

ACR Meeting Abstracts - https://acrabstracts.org/abstract/identification-of-contributing-factors-to-difficult-to-treat-rheumatoid-arthritis-d2t-ra-in-a-cohort-of-972-patients-using-a-natural-language-processing-nlp-approach/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology