Spatiotemporal modeling of air pollutant concentrations in Germany using machine learning

oleh: V. Balamurugan, J. Chen, A. Wenzel, F. N. Keutsch, F. N. Keutsch

Format: Article
Diterbitkan: Copernicus Publications 2023-09-01

Deskripsi

<p>Machine learning (ML) models are becoming a meaningful tool for modeling air pollutant concentrations. ML models are capable of learning and modeling complex nonlinear interactions between variables, and they require less computational effort than chemical transport models (CTMs). In this study, we used gradient-boosted tree (GBT) and multi-layer perceptron (MLP; neural network) algorithms to model near-surface nitrogen dioxide (<span class="inline-formula">NO<sub>2</sub></span>) and ozone (<span class="inline-formula">O<sub>3</sub></span>) concentrations over Germany at 0.1<span class="inline-formula"><sup>∘</sup></span> spatial resolution and daily intervals.</p> <p>We trained the ML models using TROPOspheric Monitoring Instrument (TROPOMI) satellite column measurements combined with information on emission sources, air pollutant precursors, and meteorology as feature variables. We found that the trained GBT model for <span class="inline-formula">NO<sub>2</sub></span> and <span class="inline-formula">O<sub>3</sub></span> explained a major portion of the observed concentrations (<span class="inline-formula"><i>R</i><sup>2</sup>=0.68</span>–0.88 and <span class="inline-formula">RMSE=4.77</span>–8.67 <span class="inline-formula">µg m<sup>−3</sup></span>; <span class="inline-formula"><i>R</i><sup>2</sup>=0.74</span>–0.92 and <span class="inline-formula">RMSE=8.53</span>–13.2 <span class="inline-formula">µg m<sup>−3</sup></span>, respectively). The trained MLP model performed worse than the trained GBT model for both <span class="inline-formula">NO<sub>2</sub></span> and <span class="inline-formula">O<sub>3</sub></span> (<span class="inline-formula"><i>R</i><sup>2</sup>=0.46</span>–0.82 and <span class="inline-formula"><i>R</i><sup>2</sup>=0.42</span>–0.9, respectively).</p> <p>Our <span class="inline-formula">NO<sub>2</sub></span> GBT model outperforms the CAMS model, a data-assimilated CTM but slightly underperforms for <span class="inline-formula">O<sub>3</sub></span>. However, our <span class="inline-formula">NO<sub>2</sub></span> and <span class="inline-formula">O<sub>3</sub></span> ML models require less computational effort than CTM. Therefore, we can analyze people's exposure to near-surface <span class="inline-formula">NO<sub>2</sub></span> and <span class="inline-formula">O<sub>3</sub></span> with significantly less effort. During the study period (30 April 2018 and 1 July 2021), it was found that around 36 % of people lived in locations where the World Health Organization (WHO) <span class="inline-formula">NO<sub>2</sub></span> limit was exceeded for more than 25 % of the days during the study period, while 90 % of the population resided in areas where the WHO <span class="inline-formula">O<sub>3</sub></span> limit was surpassed for over 25 % of the study days. Although metropolitan areas had high <span class="inline-formula">NO<sub>2</sub></span> concentrations, rural areas, particularly in southern Germany, had high <span class="inline-formula">O<sub>3</sub></span> concentrations.</p> <p>Furthermore, our ML models can be used to evaluate the effectiveness of mitigation policies. Near-surface <span class="inline-formula">NO<sub>2</sub></span> and <span class="inline-formula">O<sub>3</sub></span> concentration changes during the 2020 COVID-19 lockdown period over Germany were indeed reproduced by the GBT model, with meteorology-normalized near-surface <span class="inline-formula">NO<sub>2</sub></span> having significantly decreased (by <span class="inline-formula">23±5.3</span> %) and meteorology-normalized near-surface <span class="inline-formula">O<sub>3</sub></span> having slightly increased (by <span class="inline-formula">1±4.6</span> %) over 10 major German metropolitan areas when compared to 2019. Finally, our <span class="inline-formula">O<sub>3</sub></span> GBT model is highly transferable to neighboring countries and locations where no measurements are available (<span class="inline-formula"><i>R</i><sup>2</sup>=0.87</span>–0.94), whereas our <span class="inline-formula">NO<sub>2</sub></span> GBT model is moderately transferable (<span class="inline-formula"><i>R</i><sup>2</sup>=0.32</span>–0.64).</p>