ACR Meeting Abstracts

ACR Meeting Abstracts

  • Meetings
    • ACR Convergence 2024
    • ACR Convergence 2023
    • 2023 ACR/ARP PRSYM
    • ACR Convergence 2022
    • ACR Convergence 2021
    • ACR Convergence 2020
    • 2020 ACR/ARP PRSYM
    • 2019 ACR/ARP Annual Meeting
    • 2018-2009 Meetings
    • Download Abstracts
  • Keyword Index
  • Advanced Search
  • Your Favorites
    • Favorites
    • Login
    • View and print all favorites
    • Clear all your favorites
  • ACR Meetings

Abstract Number: 0257

Application of Text Mining Methods to Identify Lupus Nephritis from Electronic Health Records

Milena Gianfrancesco1, Suzanne Tamang2, Gabriela Schmajuk3 and Jinoos Yazdany4, 1University of California, San Francisco, San Francisco, CA, 2Stanford Center for Population Health Sciences, Redwood City, CA, 3University of California, San Francisco, Atherton, CA, 4UCSF, San Francisco, CA

Meeting: ACR Convergence 2020

Keywords: informatics, Nephritis, Systemic lupus erythematosus (SLE)

  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print
Session Information

Date: Friday, November 6, 2020

Title: SLE – Diagnosis, Manifestations, & Outcomes Poster I: Clinical Manifestations

Session Type: Poster Session A

Session Time: 9:00AM-11:00AM

Background/Purpose: Lupus nephritis (LN) is a frequent complication of SLE and associated with higher morbidity and mortality. Accurate estimates of the prevalence of LN in the population remain limited due to the inability to capture this information through structured data fields such as ICD codes in electronic health records (EHR). We developed a text mining pipeline to extract information on LN and class from clinical notes in the EHR of a large, racially/ethnically diverse university health system.

Methods: Individuals with a single diagnosis code for SLE in the EHR between June 2012 – February 2019 were included. All available clinical notes (including physician ambulatory and hospital progress reports, and biopsy reports) were extracted and annotated using an open source clinical text-mining tool, the Clinical Event Recognizer (CLEVER). CLEVER is a hybrid information extraction approach that combines rule-based (semantic) and statistical components to annotate and extract information from millions of clinical notes efficiently. A custom-built dictionary that included “lupus nephritis,” “class,” and various associated terms referring to nephritis was built. Performance of the text-mining tool in identifying LN was assessed by calculating the sensitivity and specificity against a gold-standard subset of SLE patients for whom ACR criteria were assessed through chart review. We also compared our findings to a published algorithm in which LN is identified using structured data only.

Results: We included 2,782 SLE patients; these patients had a total of 614,683 clinical notes. Most patients were female (87%), with a mean age of 46.9 years (+ 17.9), and the sample was racially/ethnically diverse (Table 1). A total of 18,354 positive and 9,293 negative mentions of LN were detected using CLEVER. Positive mentions were captured for 848 unique individuals with SLE, indicating that 30% of our SLE population had LN, similar to previously published estimates. When compared to a gold-standard set of chart-reviewed cases (n=152), our text mining tool detected LN with 96% sensitivity and 94% specificity. Compared to a previous algorithm of LN detection based on structured data fields (such as ICD codes) alone, our text-mining pipeline identified 631 additional cases of LN that would have otherwise not been captured (Table 2). Chart review of notes for a random sample of 50 cases from these 631 additional cases indicated that 86% were true positives. Thirty-seven cases that were not tagged by CLEVER, but positive according to the structured data algorithm, were found by chart review to be true negatives, with phrases such as “unclear,” “possible,” “to be evaluated,” or “no evidence.” Additionally, of those with LN, CLEVER was able to detect the specific class of LN (I-VI) in 415 (50%) cases.

Conclusion: We developed the first text-mining strategy to extract SLE LN status and class from clinical notes. Additional evaluation on clinical notes from the EHRs of additional institutions is ongoing to examine the generalizability of this algorithm. Further refinement of the pipeline will allow us to determine factors associated with this important disease outcome.


Disclosure: M. Gianfrancesco, None; S. Tamang, None; G. Schmajuk, None; J. Yazdany, Eli Lilly, 5, Astra Zeneca, 5.

To cite this abstract in AMA style:

Gianfrancesco M, Tamang S, Schmajuk G, Yazdany J. Application of Text Mining Methods to Identify Lupus Nephritis from Electronic Health Records [abstract]. Arthritis Rheumatol. 2020; 72 (suppl 10). https://acrabstracts.org/abstract/application-of-text-mining-methods-to-identify-lupus-nephritis-from-electronic-health-records/. Accessed .
  • Tweet
  • Click to email a link to a friend (Opens in new window) Email
  • Click to print (Opens in new window) Print

« Back to ACR Convergence 2020

ACR Meeting Abstracts - https://acrabstracts.org/abstract/application-of-text-mining-methods-to-identify-lupus-nephritis-from-electronic-health-records/

Advanced Search

Your Favorites

You can save and print a list of your favorite abstracts during your browser session by clicking the “Favorite” button at the bottom of any abstract. View your favorites »

All abstracts accepted to ACR Convergence are under media embargo once the ACR has notified presenters of their abstract’s acceptance. They may be presented at other meetings or published as manuscripts after this time but should not be discussed in non-scholarly venues or outlets. The following embargo policies are strictly enforced by the ACR.

Accepted abstracts are made available to the public online in advance of the meeting and are published in a special online supplement of our scientific journal, Arthritis & Rheumatology. Information contained in those abstracts may not be released until the abstracts appear online. In an exception to the media embargo, academic institutions, private organizations, and companies with products whose value may be influenced by information contained in an abstract may issue a press release to coincide with the availability of an ACR abstract on the ACR website. However, the ACR continues to require that information that goes beyond that contained in the abstract (e.g., discussion of the abstract done as part of editorial news coverage) is under media embargo until 10:00 AM ET on November 14, 2024. Journalists with access to embargoed information cannot release articles or editorial news coverage before this time. Editorial news coverage is considered original articles/videos developed by employed journalists to report facts, commentary, and subject matter expert quotes in a narrative form using a variety of sources (e.g., research, announcements, press releases, events, etc.).

Violation of this policy may result in the abstract being withdrawn from the meeting and other measures deemed appropriate. Authors are responsible for notifying colleagues, institutions, communications firms, and all other stakeholders related to the development or promotion of the abstract about this policy. If you have questions about the ACR abstract embargo policy, please contact ACR abstracts staff at [email protected].

Wiley

  • Online Journal
  • Privacy Policy
  • Permissions Policies
  • Cookie Preferences

© Copyright 2025 American College of Rheumatology