Development of a Scoring System for Accurate Lupus Nephritis Case Identification in Real-World Databases

Zara Izadi¹, Alfredo Aguirre¹, Christine Anastasiou¹, Julia Kay¹, Gabriela Schmajuk² and Jinoos Yazdany³, ¹University of California San Francisco, San Francisco, CA, ²UCSF / SFVA, San Francisco, CA, ³University of California, General Department of Medicine, Division of Rheumatology, San Francisco, CA

Meeting: ACR Convergence 2023

Keywords: Bioinformatics, Epidemiology, Health Services Research, Nephritis, Systemic lupus erythematosus (SLE)

Session Information

Date: Tuesday, November 14, 2023

Title: (1840–1861) Health Services Research Poster III

Session Type: Poster Session C

Session Time: 9:00AM-11:00AM

Background/Purpose: Accurate identification of prevalent cases of lupus nephritis (LN) is essential for timely patient monitoring and treatment, advancing research, and informing public health initiatives for the management of LN. However, diagnosis codes for LN are generally underutilized, making identification of this patient population in real-world databases challenging. We developed a scoring system to quantify the probability of accurate LN case identification.

Methods: We used data from EHRs of two large health systems and included patients with ≥1 ICD9/10 codes for SLE from June 2012 to Jan 2022. Prevalent LN was defined as current active LN or a history of LN. We used regular expressions with negation to loosely tag LN within EHR notes, in a training set consisting of a balanced sample of 2038 patients from the larger health system. Testing sets included 100 patients randomly selected from each health system and were manually chart reviewed to classify patients as having ‘no LN’, ‘definite LN’ (biopsy report of Class III, IV or V LN), ‘potential LN’ (no biopsy report but physician diagnosed LN), and ‘diagnostic uncertainty’ (physician states LN is possible). A gradient boosting model (GBM) including 42 predictors that covered demographics, encounters, diagnosis and procedure codes, comorbidities, medications, and laboratory test results (e.g., serologies, urine studies, chemistries) was used for predictor selection. Predictive performance of a logit regression model (LRM) including key predictors from GBM was evaluated for identifying patients with a “strict” (definite LN) or an “inclusive” (definite LN, potential LN, or diagnostic uncertainty) definition of LN. A LRM-based scoring system was developed and calibrated.

Results: Table 1 includes demographics of the 4,522 patients meeting the eligibility criteria from both health systems. In addition to more specific diagnosis codes for LN, presence of diagnosis codes for acute or chronic kidney disease or proteinuria, younger age at first SLE diagnosis code, and use of mycophenolate mofetil or mycophenolic acid were identified as key predictors and included in the final LRM. Urine protein creatinine ratios (UPCR) >0.5, abnormal complement component 3 (C3) levels, any use of hydroxychloroquine, azathioprine, or rituximab, and glucocorticoid dose were also identified as important predictors but were omitted from the final LRM as their inclusion did not further improve performance. The final LRM had an area under the curve, sensitivity, and positive predictive value of 0.93, 0.88, and 0.84, respectively, for identifying LN using the inclusive definition, performed similarly with a strict LN definition, and had good external validity when tested in the second health system (Table 2). Predicted and observed probabilities had good calibration (Table 2).The scoring system was derived from this model (Table 3).

Conclusion: Prediction of prevalent LN using data elements available in EHR or claims data was feasible, had good accuracy and was validated externally. The scoring system has the potential to identify prevalent LN accurately across health systems.
Disclaimer: Aurinia provided funding for the study.

Table 1. Characteristics of the underlying population.

Table 2. Performance of the final logit regression model including key predictors.
LN: Lupus nephritis; AUC: area under the curve; PPV: positive predictive value; NPV: negative predictive value;
Strict definition of LN: definite LN; Inclusive definition of LN: definite LN, potential LN, or diagnostic uncertainty.

Table 3. The scoring system and interpretation.
Diagnosis codes for lupus nephritis included ICD10 codes: M32.14 or M32.15, or ICD9 code 710.0 in combination with ICD9 codes 583.81, 581.81, or 583.89. Diagnosis codes for acute or chronic kidney disease or proteinuria included ICD10: N00-N08, N17-N19, and R80, or ICD9 codes 580-586, and 791.0.

Disclosures: Z. Izadi: Bristol-Myers Squibb(BMS), 3; A. Aguirre: None; C. Anastasiou: None; J. Kay: Pfizer, 12, Own Stock; G. Schmajuk: None; J. Yazdany: AstraZeneca, 2, 5, Aurinia, 5, Gilead, 5, Pfizer, 2.

To cite this abstract in AMA style:

Izadi Z, Aguirre A, Anastasiou C, Kay J, Schmajuk G, Yazdany J. Development of a Scoring System for Accurate Lupus Nephritis Case Identification in Real-World Databases [abstract]. Arthritis Rheumatol. 2023; 75 (suppl 9). https://acrabstracts.org/abstract/development-of-a-scoring-system-for-accurate-lupus-nephritis-case-identification-in-real-world-databases/. Accessed .

« Back to ACR Convergence 2023

ACR Meeting Abstracts - https://acrabstracts.org/abstract/development-of-a-scoring-system-for-accurate-lupus-nephritis-case-identification-in-real-world-databases/