This study proposed a random forest prediction model optimized with a fuzzy C-means clustering algorithm. The model utilized monitoring data for six air pollutants (O3, PM2.5, PM10, NO2, SO2, CO)along with weather forecast data from 2014 to 2020. Initially, two clustering factors were identified through cross-correlation analysis. O3 concentrations were then classified into three categories using the fuzzy C-means clustering algorithm. A random forest model was subsequently constructed to predict O3 concentrations, with its performance evaluated both before and after clustering. The results indicate that the previous day's O3 and PM10 concentrations have the most significant impact on the next day's O3 levels, and seasonal variations also play a critical role. Following fuzzy C-means clustering, the Mean Absolute Error (MAE) and Root Mean Square Error (RMSE) of the predicted O3_8 h concentrations decreased by 10.5% and 8.8%, respectively. Additionally, the coefficient of determination (R²) increased, confirming an improvement in prediction accuracy. These findings highlight the practical value of the proposed model for forecasting O3 pollution in Shanghai.
NuvoloneD, PetriD, VollerF.The effects of ozone on human health[J].Environ Sci Pollut Res,2018,25(9):8074-8088.
[2]
WangT, XueL K, FengZ Z,et al.Ground-level ozone pollution in China:a synthesis of recent findings on influencing factors and impacts[J].Environ Res Lett,2022,17(6):063003.
[3]
MENGXia, WANGWeidong, SHISu,et al.Evaluating the spatiotemporal ozone characteristics with high-resolution predictions in China's mainland,2013-2019[J].Environ Pollut,2022,299:118865.
GENGChunmei, WANGZongshuang, RENLihong,et al.Study on the impact of elevated atmospheric ozone on crop yield[J].Research of Environmental Sciences,2014,27(3):239-245.(in Chinese)
ZHUXinyue, ZHAOHuan, ZHANGXiaoling,et al.Analysis on the ozone pollution characteristics and its influencing factors in Chengdu[J].Advances in Geosciences,2023,13(10):1216-1229.(in Chinese)
SUNBaolei, SUNHao, ZHANGChaoneng,et al.Forecast of air pollutant concentrations by BP neural network[J].Acta Scientiae Circumstantiae,2017,37(5):1864-1871.(in Chinese)
SUNQuande, JIAORuili, XIAJiangjiang,et al.Adjusting wind speed prediction of numerical weather forecast model based on machine learning methods[J].Meteorological Monthly,2019,45(3):426-436.(in Chinese)
[12]
ZhangH, WuP B, YinA J,et al.Prediction of soil organic carbon in an intensively managed reclamation zone of eastern China:a comparison of multiple linear regressions and the random forest model[J].Sci Total Environ,2017,592:704-713.
SUNSuqi, WANGShigong, LUOBin,et al.Air pollution forecast in winter based on machine learning method in Chengdu[J].Journal of Meteorology and Environment,2020,36(2):98-104.(in Chinese)
[15]
Al-ShammariE T.Public warning systems for forecasting ambient ozone pollution in Kuwait[J].Environ Syst Res,2013,2(1):2.
DINGSu, CHENBaozhang, WANGJin,et al.An applied research of decision-tree based statistical model in forecasting the spatial-temporal distribution of O3 [J].Acta Scientiae Circumstantiae,2018,38(8):3229-3242.(in Chinese)
WANGWeijian, YAOZhanyu, JIAShuo,et al.Application research on random forest algorithm in the statistical test of rainfall enhancement effect[J].Meteorological and Environmental Sciences,2018,41(2):111-117.(in Chinese)
ZHUGuodong, ZHULei, WANGNan,et al.Research on airport temperature forecast method based on automatic machine learning[J].Desert and Oasis Meteorology,2021,15(6):113-119.(in Chinese)
CHENLei, MAJinghui, ZHENXinrong,et al.Variation characteristics and meteorological influencing factors of air pollution in Shanghai[J].Journal of Meteorology and Environment,2017,33(3):59-67.(in Chinese)
YUZhongqi, MAJinghui, MAOZhuocheng,et al.Study on the meteorological conditions and synoptic classifications of O3 pollution in Shanghai in 2017[J].Journal of Meteorology and Environment,2019,35(6):46-54.(in Chinese)
HOUSuxia, ZHANGJianda, LIJing.Analysis of spatiotemporal distribution and correlation factors of atmospheric pollutants in Shanghai city[J].Ecology and Environmental Sciences,2021,30(6):1220-1228.(in Chinese)
LIANGZe, WANGYueyao, YUEYuanwen,et al.A coupling model of genetic algorithm and RBF neural network for the prediction of PM2.5 concentration[J].China Environmental Science,2020,40(2):523-529.(in Chinese)
[32]
NaghibiS A, AhmadiK, DaneshiA.Application of support vector machine,random forest,and genetic algorithm optimized random forest models in groundwater potential mapping[J].Water Resour Manag,2017,31(9):2761-2775.
[33]
BelgiuM, DrăguţL.Random forest in remote sensing:a review of applications and future directions[J].ISPRS J Photogramm Remote Sens,2016,114:24-31.
[34]
ChenT, TrinderJ C, NiuR Q.Object-oriented landslide mapping using ZY-3 satellite imagery,random forest and mathematical morphology,for the three-gorges reservoir,China[J].Remote Sens,2017,9(4):333.
JIHaolin, PENGLiang.Research on short term wind speed forecasting of super closed ball algorithm based on clustering[J].Measurement Control Technology,2016,35(8):138-141, 145.(in Chinese)
[37]
IkotunA M, EzugwuA E, AbualigahL,et al.K-means clustering algorithms:a comprehensive review,variants analysis,and advances in the era of big data[J].Inform Sci,2023,622:178-210.
[38]
DunnJ C.A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters[J].J Cybernet,1973,3(3):32-57.
PENGLi, GAOWei, GENGFuhai,et al.Analysis of ozone vertical distribution in Shanghai area[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2011,47(5):805-811.(in Chinese)
ZOUXudong, YANGHongbin, ZHANGYunhai,et al.Changes of meteorological factors in Shenyang city during 1951-2012 and its relationship with air pollution[J].Ecology and Environmental Sciences,2015,24(1):76-83.(in Chinese)
MEIMei, ZHURong, SUNChaoyang.Study on meteorological conditions for heavy air pollution and its climatic characteristics in“2+26”cities around Beijing-Tianjin-Hebei region in autumn and winter[J].Climate Change Research,2019,15(3):270-281.(in Chinese)
[46]
SicardP, PaolettiE, AgathokleousE,et al.Ozone weekend effect in cities:deep insights for urban air pollution control[J].Environ Res,2020,191:110193.
[47]
LüY, JuQ R, LüF M,et al.Spatiotemporal variations of air pollutants and ozone prediction using machine learning algorithms in the Beijing-Tianjin-Hebei region from 2014 to 2021[J].Environ Pollut,2022,306:119420.
WANGZhanshan, LIYunting, DONGXin,et al.Analysis on weekend effect of air pollutants in urban atmosphere of Beijing[J].Journal of University of Chinese Academy of Sciences,2015,32(6):843-850.(in Chinese)