DOAJ Open Access 2024

Using Diverse Data Sources to Impute Missing Air Quality Data Collected in a Resource-Limited Setting

Moses Mogakolodi Kebalepile Loveness Nyaradzo Dzikiti Kuku Voyi

Abstrak

The sustainable operation of ambient air quality monitoring stations in developing countries is not always possible. Intermittent failures and breakdowns at air quality monitoring stations often affect the continuous measurement of data as required. These failures and breakdowns result in missing data. This study aimed to impute NO<sub>2</sub>, SO<sub>2</sub>, O<sub>3</sub>, and PM 10 to produce complete data sets of daily average exposures from 2010 to 2017. Models were built for (a) an individual pollutant at a monitoring station, (b) a combined model for the same pollutant from different stations, and (c) a data set with all the pollutants from all the monitoring stations. This study sought to evaluate the efficacy of the Multiple Imputation by Chain Equations (MICE) algorithm in successfully imputing air quality data that are missing at random. The application of classification and regression trees (CART) analysis using the MICE package in the R statistical programming language was compared with the predictive mean matching (PMM) method. The CART method performed better, with the pooled R-squared statistics of the imputed data ranging from 0.3 to 0.7, compared to a range of 0.02 to 0.25 for PMM. The MICE algorithm successfully resolved the incompleteness of the data. It was concluded that the CART method produced better reliable data than the PMM method. However, in this study, the pooled R<sup>2</sup> values were accurate for NO<sub>2</sub>, but not so much for other pollutants.

Topik & Kata Kunci

Penulis (3)

M

Moses Mogakolodi Kebalepile

L

Loveness Nyaradzo Dzikiti

K

Kuku Voyi

Format Sitasi

Kebalepile, M.M., Dzikiti, L.N., Voyi, K. (2024). Using Diverse Data Sources to Impute Missing Air Quality Data Collected in a Resource-Limited Setting. https://doi.org/10.3390/atmos15030303

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3390/atmos15030303
Informasi Jurnal
Tahun Terbit
2024
Sumber Database
DOAJ
DOI
10.3390/atmos15030303
Akses
Open Access ✓