An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China
Abstrak
The global acceleration of population urbanization has transformed cities into primary spatial hubs of human activity. As urban populations continue to expand, identifying the socioeconomic drivers of urbanization and elucidating their underlying mechanisms are essential for achieving Sustainable Development Goal 11, established by the United Nations. This study leverages machine learning and big data to investigate the determinants of population urbanization in China over the period 1991–2023. Utilizing the XGBoost algorithm combined with SHAP (Shapley Additive Explanations), the analysis reveals a tripartite structure of key drivers encompassing industrial support, employment orientation, and infrastructure accessibility. Regional assessments indicate distinct urbanization patterns: Eastern coastal areas are predominantly driven by finance and service industries; central inland regions follow an investment-led trajectory anchored in infrastructure development and real estate expansion, while the western interior relies mainly on employment-centered strategies. Partial Dependence Plots (PDPs) highlighted spatial variations in the effects of sensitive factors, with interaction analyses revealing synergistic effects between tertiary sector shares and the working-age share in eastern coastlands, structural amplification by real estate investment with appropriate working-age population shares in the central inlands, and balancing interactions between GDP growth rates and tertiary sector shares in the western interior. These findings contribute to a more nuanced understanding of the socioeconomic forces shaping urbanization and offer evidence-based recommendations for policymakers in other developing countries seeking to foster sustainable urban growth.
Topik & Kata Kunci
Penulis (1)
Ziheng Shangguan
Akses Cepat
- Tahun Terbit
- 2025
- Sumber Database
- DOAJ
- DOI
- 10.3390/systems13080679
- Akses
- Open Access ✓