62nd National Congress of the Italian Society of Rheumatology
Vol. 77 No. s1 (2025): Abstract book of the 62th Conference of the Italian Society for Rheumatology, Rimini, 26-29 November 2025

CO:05:1 | Predicting one-year disease activity in systemic lupus erythematosus using machine learning and explainable AI

Pier Giacomo Cerasuolo1, Augusta Ortolan1, Livia Lilli2, Laura Antenucci2, Carlotta Maschiocchi2, Jacopo Lenkowicz2, Lucia Lanzo1, Luca Petricca1, Maria Rita Gigante1, Vivianaantonella Pacucci1, Pierluigi Rizzuti1, Rebecca Ventura1, Silvia Piunno1, Cesare Gavotti1, Marco Gorini3, Gabriella Castellino3, Alfredo Cesario2, Stefano Patarnello2, Silvia Laura Bosello1, Maria Antonietta D'Agostino1. | 1UOC di Reumatologia ed Immunologia Clinica, Fondazione Policlinico A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Roma; 2Gemelli Generator, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Roma; 3AstraZeneca Italy, MIND, Milano, Italy

Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Published: 26 November 2025
99
Views
0
Downloads

Authors

Background: Systemic lupus erythematosus (SLE) is a chronic, multisystem autoimmune disease characterized by marked heterogeneity in clinical presentation, disease course, and response to therapy. This variability makes it particularly challenging to predict disease flare-ups in clinical practice, despite their profound implications on prognosis and quality of life.

 

Objectives: This study aimed to develop a machine learning model to predict the risk of disease activity events in SLE patients within a 12-month time frame, based on clinical status, recent history, and long-term disease trajectory. Methods: We built a structured dataset (SLE Data Mart) from electronic health records using data mining and NLP techniques. It includes demographic, therapeutic, clinical, and laboratory information from SLE patients seen at our University Hospital. Patients had at least one outpatient visit and one hospitalization coded as ICD9 710.0 or 695.4. The outcome—an activity event within 12 months—was defined as any of the following: neurological, renal, or vascular symptoms; new organ domain involvement; or hospitalization for SLE. Input features described current contact, past 12 months, and prior clinical history, expressed as binary or continuous variables. Univariable selection (Chi-Square, Mann-Whitney U tests) identified features significantly associated with the outcome (p < 0.05). Selected variables were used to train multiple models (Logistic Regression, SVC, Decision Tree, Random Forest, XGBoost), evaluated by 5-fold cross-validation. The model with the highest AUC was selected and interpreted using SHAP to identify key predictors.

 

Results: The dataset included 262 SLE patients and 5,952 contacts between 2012–2020. Most patients were female (88%), with a median age of 43 years and median follow-up of 6 years. Patients had a median of 16 contacts each. The cohort was split into training (70%) and testing (30%) sets. Overall, 48% of contacts were associated with an activity event. Univariable selection retained 125 of 192 variables (Table 1). Significant predictors included younger age at contact (median 44 years, p < 0.0001) and abnormal labs (e.g., proteinuria, albuminuria). The best-performing model (AUC = 0.70) identified renal involvement—especially proteinuria >=0.5 g/24h and albuminuria—as the most impactful predictors. SHAP analysis also highlighted the protective role of therapy step-down and of the current hematological or cutaneous involvement (Figure 1).

 

Conclusion: Renal abnormalities strongly predict future SLE activity, but other factors, including hematological and cutaneous involvement, prior therapy changes, and overall disease history, significantly contribute to to determine the risk for an activity event at 12 months.

Acknowledgements. This study was conducted with the unrestricted support of AstraZeneca.

mceclip0-d82fc6d8dcf4840cc32c374cbf1d696a.jpg

 

318_20250609112507.jpg

Downloads

Download data is not yet available.

Citations

How to Cite



1.
CO:05:1 | Predicting one-year disease activity in systemic lupus erythematosus using machine learning and explainable AI: Pier Giacomo Cerasuolo1, Augusta Ortolan1, Livia Lilli2, Laura Antenucci2, Carlotta Maschiocchi2, Jacopo Lenkowicz2, Lucia Lanzo1, Luca Petricca1, Maria Rita Gigante1, Vivianaantonella Pacucci1, Pierluigi Rizzuti1, Rebecca Ventura1, Silvia Piunno1, Cesare Gavotti1, Marco Gorini3, Gabriella Castellino3, Alfredo Cesario2, Stefano Patarnello2, Silvia Laura Bosello1, Maria Antonietta D’Agostino1. | 1UOC di Reumatologia ed Immunologia Clinica, Fondazione Policlinico A. Gemelli IRCCS, Università Cattolica del Sacro Cuore, Roma; 2Gemelli Generator, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Roma; 3AstraZeneca Italy, MIND, Milano, Italy. Reumatismo [Internet]. 2025 Nov. 26 [cited 2026 Jan. 22];77(s1). Available from: https://www.reumatismo.org/reuma/article/view/1970