Session Information
Session Type: Poster Session C
Session Time: 9:00AM-11:00AM
Background/Purpose: Accurate identification of prevalent cases of lupus nephritis (LN) is essential for timely patient monitoring and treatment, advancing research, and informing public health initiatives for the management of LN. However, diagnosis codes for LN are generally underutilized, making identification of this patient population in real-world databases challenging. We developed a scoring system to quantify the probability of accurate LN case identification.
Methods: We used data from EHRs of two large health systems and included patients with ≥1 ICD9/10 codes for SLE from June 2012 to Jan 2022. Prevalent LN was defined as current active LN or a history of LN. We used regular expressions with negation to loosely tag LN within EHR notes, in a training set consisting of a balanced sample of 2038 patients from the larger health system. Testing sets included 100 patients randomly selected from each health system and were manually chart reviewed to classify patients as having ‘no LN’, ‘definite LN’ (biopsy report of Class III, IV or V LN), ‘potential LN’ (no biopsy report but physician diagnosed LN), and ‘diagnostic uncertainty’ (physician states LN is possible). A gradient boosting model (GBM) including 42 predictors that covered demographics, encounters, diagnosis and procedure codes, comorbidities, medications, and laboratory test results (e.g., serologies, urine studies, chemistries) was used for predictor selection. Predictive performance of a logit regression model (LRM) including key predictors from GBM was evaluated for identifying patients with a “strict” (definite LN) or an “inclusive” (definite LN, potential LN, or diagnostic uncertainty) definition of LN. A LRM-based scoring system was developed and calibrated.
Results: Table 1 includes demographics of the 4,522 patients meeting the eligibility criteria from both health systems. In addition to more specific diagnosis codes for LN, presence of diagnosis codes for acute or chronic kidney disease or proteinuria, younger age at first SLE diagnosis code, and use of mycophenolate mofetil or mycophenolic acid were identified as key predictors and included in the final LRM. Urine protein creatinine ratios (UPCR) >0.5, abnormal complement component 3 (C3) levels, any use of hydroxychloroquine, azathioprine, or rituximab, and glucocorticoid dose were also identified as important predictors but were omitted from the final LRM as their inclusion did not further improve performance. The final LRM had an area under the curve, sensitivity, and positive predictive value of 0.93, 0.88, and 0.84, respectively, for identifying LN using the inclusive definition, performed similarly with a strict LN definition, and had good external validity when tested in the second health system (Table 2). Predicted and observed probabilities had good calibration (Table 2).The scoring system was derived from this model (Table 3).
Conclusion: Prediction of prevalent LN using data elements available in EHR or claims data was feasible, had good accuracy and was validated externally. The scoring system has the potential to identify prevalent LN accurately across health systems.
Disclaimer: Aurinia provided funding for the study.
To cite this abstract in AMA style:
Izadi Z, Aguirre A, Anastasiou C, Kay J, Schmajuk G, Yazdany J. Development of a Scoring System for Accurate Lupus Nephritis Case Identification in Real-World Databases [abstract]. Arthritis Rheumatol. 2023; 75 (suppl 9). https://acrabstracts.org/abstract/development-of-a-scoring-system-for-accurate-lupus-nephritis-case-identification-in-real-world-databases/. Accessed .« Back to ACR Convergence 2023
ACR Meeting Abstracts - https://acrabstracts.org/abstract/development-of-a-scoring-system-for-accurate-lupus-nephritis-case-identification-in-real-world-databases/