DOAJ Open Access 2025

An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China

Ziheng Shangguan

Abstrak

The global acceleration of population urbanization has transformed cities into primary spatial hubs of human activity. As urban populations continue to expand, identifying the socioeconomic drivers of urbanization and elucidating their underlying mechanisms are essential for achieving Sustainable Development Goal 11, established by the United Nations. This study leverages machine learning and big data to investigate the determinants of population urbanization in China over the period 1991–2023. Utilizing the XGBoost algorithm combined with SHAP (Shapley Additive Explanations), the analysis reveals a tripartite structure of key drivers encompassing industrial support, employment orientation, and infrastructure accessibility. Regional assessments indicate distinct urbanization patterns: Eastern coastal areas are predominantly driven by finance and service industries; central inland regions follow an investment-led trajectory anchored in infrastructure development and real estate expansion, while the western interior relies mainly on employment-centered strategies. Partial Dependence Plots (PDPs) highlighted spatial variations in the effects of sensitive factors, with interaction analyses revealing synergistic effects between tertiary sector shares and the working-age share in eastern coastlands, structural amplification by real estate investment with appropriate working-age population shares in the central inlands, and balancing interactions between GDP growth rates and tertiary sector shares in the western interior. These findings contribute to a more nuanced understanding of the socioeconomic forces shaping urbanization and offer evidence-based recommendations for policymakers in other developing countries seeking to foster sustainable urban growth.

Penulis (1)

Z

Ziheng Shangguan

Format Sitasi

Shangguan, Z. (2025). An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China. https://doi.org/10.3390/systems13080679

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3390/systems13080679
Informasi Jurnal
Tahun Terbit
2025
Sumber Database
DOAJ
DOI
10.3390/systems13080679
Akses
Open Access ✓