DOAJ Open Access 2026

Accurate and interpretable prediction of chemical oxygen demand using explainable boosting algorithms with SHAP analysis

Khaled Merabet Sungwon Kim Salim Heddam Fabio Di Nunno Francesco Granata +4 lainnya

Abstrak

Abstract Accurate prediction of Chemical Oxygen Demand (COD) is vital for effective water quality management and pollution control. This study compares six ensemble boosting models, AdaBoost, CatBoost, XGBoost, LightGBM, HistGBRT, and NGBoost, for estimating COD from multiple water quality parameters, including pH, dissolved oxygen, suspended solids, and specific conductance. Data from two monitoring stations in South Korea (Toilchun and Hwangji) were used to train and validate the models. Model performance was evaluated using RMSE, MAE, R, NSE, and PBIAS, while interpretability was assessed through SHapley Additive exPlanations (SHAP). Results showed that NGBoost achieved the highest predictive accuracy at Toilchun (R = 0.979, NSE = 0.958, RMSE = 0.397 mg/L), while CatBoost performed best at Hwangji (R = 0.861, NSE = 0.733, RMSE = 0.477 mg/L). As NGBoost provides predictive probability distributions rather than single estimates, its results also reflect model uncertainty, supporting a more robust quantification of COD variability. SHAP analysis identified total organic carbon (TOC), biochemical oxygen demand (BOD₅), and suspended solids (SS) as the most influential variables controlling COD dynamics.

Topik & Kata Kunci

Penulis (9)

K

Khaled Merabet

S

Sungwon Kim

S

Salim Heddam

F

Fabio Di Nunno

F

Francesco Granata

O

Ozgur Kisi

R

Rana Muhammad Adnan

M

Mohammad Zounemat-Kermani

C

Christoph Külls

Format Sitasi

Merabet, K., Kim, S., Heddam, S., Nunno, F.D., Granata, F., Kisi, O. et al. (2026). Accurate and interpretable prediction of chemical oxygen demand using explainable boosting algorithms with SHAP analysis. https://doi.org/10.1038/s41598-026-38757-4

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1038/s41598-026-38757-4
Informasi Jurnal
Tahun Terbit
2026
Sumber Database
DOAJ
DOI
10.1038/s41598-026-38757-4
Akses
Open Access ✓