基于PU-Bagging负样本采样的白龙江流域泥石流易发性分级评价
万芹江 , 郑鸿超 , 王洪磊 , 吴彬 , 石振明 , 李元伟
地球科学 ›› 2025, Vol. 50 ›› Issue (10) : 4044 -4058.
基于PU-Bagging负样本采样的白龙江流域泥石流易发性分级评价
Classification Assessment of Debris Flow Susceptibility in Bailong River Basin Based on PU-Bagging Negative Sampling
,
为提高白龙江流域泥石流易发分区的可靠性,建立随机森林为基学习器的PU-Bagging负样本采样模型.选取高程、降水量等评价因子,使用逻辑回归、随机森林、支持向量机和XGBoost算法,构建白龙江流域泥石流易发性评价模型.根据混淆矩阵衍生的评价指标、ROC曲线和5种分级方法,对比分析了4种模型的性能,并利用SHAP分析评价因子对模型的贡献程度.结果表明:(1)支持向量机模型结合几何间隔分级方法的灾害识别精度提升了24%.(2)随机森林模型能够识别更多的潜在泥石流样本,而XGBoost模型可减少对非灾害样本的误判.(3)SHAP值对高程变化的敏感性间接反映了高差对泥石流发育的重要性.本研究可以为白龙江流域新型城镇化建设与泥石流防治工程的规划提供数据支撑.
To improve the reliability of the debris flow-prone zones in the Bailong River Basin, a PU-Bagging negative sampling model based on the random forest as the base learner is established. Evaluation factors such as elevation and precipitation were selected, and logistic regression, random forest, support vector machine and XGBoost algorithms were used to construct an evaluation model for the susceptibility of debris flows in the Bailong River Basin. Based on the evaluation indicators derived from the confusion matrix, the ROC curve and five classification methods, the performances of the four models were compared and analyzed, and the contribution degree of the evaluation factors to the model was analyzed by using SHAP. The results show follows. (1) The disaster identification accuracy of the support vector machine model combined with the geometric interval classification method has increased by 24%. (2) The random forest model can identify more potential debris flow samples, while the XGBoost model can reduce the misjudgment of non-disaster samples. (3) The sensitivity of SHAP values to elevation changes indirectly reflects the importance of height differences for the development of debris flows. This research can provide data support for the planning of the new urbanization construction and debris flow prevention and control project in the Bailong River Basin.
白龙江流域 / 负样本采样 / 易发性 / 泥石流 / 分级方法 / 工程地质学.
Bailong River Basin / negative sampling / susceptibility / debris flow / classification method / engineering geology
| [1] |
Breiman, L., 2001.Random Forests.Machine Learning, 45(1):5-32.https://doi.org/10.1023/A:1010933404324 |
| [2] |
Cao, S. A., Guo, Z., Chen, J. L., 2025. Geological Hazard Susceptibility Evaluation Based on Improved Information Model: A Case Study of the G219 National Highway in Zayu County, Xizang. Geological Bulletin of China,44(1): 185-200 (in Chinese with English abstract). |
| [3] |
Chao, Z. Z., 2023. Study on Erosion Characteristics of Different Cover Slopes in High Debris Flow Area of Bailong River Basin (Dissertation). Lanzhou University, Lanzhou (in Chinese with English abstract). |
| [4] |
Deng, M. D., Ju, N. P., Wu, T. W., et al., 2024. Evaluation of Susceptibility under Different Landslide Sample Points and Polygonal Expression Modes. Earth Science, 49(5): 1565-1583 (in Chinese with English abstract). |
| [5] |
Du, G. L., Yang, Z. H., Yuan, Y., et al., 2021. Landslide Susceptibility Mapping in the Sichuan-Tibet Traffic Corridor Using Logistic Regression-Information Value Method. Hydrogeology & Engineering Geology, 48(5): 102-111 (in Chinese with English abstract). |
| [6] |
Du, G. L., Zhang, Y. S., Iqbal, J., et al., 2017. Landslide Susceptibility Mapping Using an Integrated Model of Information Value Method and Logistic Regression in the Bailongjiang Watershed, Gansu Province, China.Journal of Mountain Science, 14(2): 249-268. https://doi.org/10.1007/s11629-016-4126-9 |
| [7] |
Esper Angillieri, M. Y., 2020. Debris Flow Susceptibility Mapping Using Frequency Ratio and Seed Cells, in a Portion of a Mountain International Route, Dry Central Andes of Argentina.CATENA, 189: 104504. https://doi.org/10.1016/j.catena.2020.104504 |
| [8] |
Gu, T. F., Duan, P., Wang, M. G., et al., 2024. Effects of Non-Landslide Sampling Strategies on Machine Learning Models in Landslide Susceptibility Mapping.Scientific Reports, 14(1): 7201. https://doi.org/10.1038/s41598-024-57964-5 |
| [9] |
Huang, F. M., Zhang, J., Zhou, C. B., et al., 2020. A Deep Learning Algorithm Using a Fully Connected Sparse Autoencoder Neural Network for Landslide Susceptibility Prediction. Landslides, 17(1): 217-229. https://doi.org/10.1007/s10346-019-01274-9 |
| [10] |
Huang, Q. L., Chen, W., Fu, X. D., 2018. AHP-RBF Assessment Model of Regional Debris Flow Hazard Supported by Unit Slope. Journal of Zhejiang University (Engineering Science), 52(9): 1667-1675 (in Chinese with English abstract). |
| [11] |
Huang, Y., Zhao, L., 2018. Review on Landslide Susceptibility Mapping Using Support Vector Machines.CATENA, 165: 520-529. https://doi.org/10.1016/j.catena.2018.03.003 |
| [12] |
Kong, J. X., Zhuang, J. Q., Peng, J. B., et al., 2023. Evaluation of Landslide Susceptibility in Chinese Loess Plateau Based on Ⅳ-RF and Ⅳ-CNN Coupling Models. Earth Science, 48(5): 1711-1729 (in Chinese with English abstract). |
| [13] |
Kumar, D., Thakur, M., Dubey, C. S., et al., 2017. Landslide Susceptibility Mapping & Prediction Using Support Vector Machine for Mandakini River Basin, Garhwal Himalaya, India.Geomorphology, 295: 115-125. https://doi.org/10.1016/j.geomorph.2017.06.013 |
| [14] |
Li, K., Zhao, J. S., Lin, Y. L., et al., 2022. Assessment of Debris Flow Susceptibility Based on Different Slope Unit Division Methods and BP Neural Network. Bulletin of Surveying and Mapping, (8): 68-74(in Chinese with English abstract). |
| [15] |
Li, Y. X., Zhang, Y., Su, X. J., et al., 2021. Early Identification and Characteristics of Potential Landslides in the Bailong River Basin Using InSAR Technique. National Remote Sensing Bulletin, 25(2): 677-690 (in Chinese with English abstract). |
| [16] |
Li, Z., Chen, N. S., Hou, R. N., et al., 2024. Susceptibility Assessment of Debris Flow Disaster Based on Machine Learning Models in the Loess Area along Yili Valley. The Chinese Journal of Geological Hazard and Control, 35(3): 129-140 (in Chinese with English abstract). |
| [17] |
Lin, X. X., Xiao, G. R., Zhou, H. B., 2023. Landslide Susceptibility Assessment Method Considering Land Use Dynamic Change. Journal of Geo-Information Science, 25(5): 953-966 (in Chinese with English abstract). |
| [18] |
Liu, B., Guo, H. X., Li, J. L., et al., 2024. Application and Interpretability of Ensemble Learning for Landslide Susceptibility Mapping along the Three Gorges Reservoir Area, China. Natural Hazards, 120(5): 4601-4632. https://doi.org/10.1007/s11069-023-06374-3 |
| [19] |
Liu, C. Z., Wang, J. X., 2024. Research on Classification of Collapse, Landslide and Debris Flow Disaster Chains. Journal of Engineering Geology, 32(5): 1573-1596 (in Chinese with English abstract). |
| [20] |
Liu, G. D., Qin, S. W., Meng, F. Q., et al., 2023. Application of Geographic Information Similarity Based Absence Sampling Method to Debris Flow Susceptibility Mapping. Journal of Engineering Geology, 31(2): 526-537 (in Chinese with English abstract). |
| [21] |
Liu, J., Li, S. L., Chen, T., 2018. Landslide Susceptibility Assesment Based on Optimized Random Forest Model. Geomatics and Information Science of Wuhan University, 43(7): 1085-1091 (in Chinese with English abstract). |
| [22] |
Lv, L., Chen, T., Dou, J., et al., 2022. A Hybrid Ensemble-Based Deep-Learning Framework for Landslide Susceptibility Mapping. International Journal of Applied Earth Observation and Geoinformation, 108: 102713. https://doi.org/10.1016/j.jag.2022.102713 |
| [23] |
Martinello, C., Cappadonia, C., Conoscenti, C., et al., 2021. Optimal Slope Units Partitioning in Landslide Susceptibility Mapping.Journal of Maps, 17(3): 152-162. https://doi.org/10.1080/17445647.2020.1805807 |
| [24] |
Merghadi, A., Yunus, A. P., Dou, J., et al., 2020. Machine Learning Methods for Landslide Susceptibility Studies: A Comparative Overview of Algorithm Performance. Earth-Science Reviews, 207: 103225. https://doi.org/10.1016/j.earscirev.2020.103225 |
| [25] |
Qing, F., Zhao, Y., Chong, Y., et al., 2024. Temporal and Spatial Regularity of Debris Flow Outbreak in Bailong River Basin and Hazard Prediction of River Blocking Disasters. Journal of Lanzhou University (Natural Sciences), 60(4): 488-493 (in Chinese with English abstract). |
| [26] |
Rao, S. S., Leng, X. P., 2024. Debris Flow Susceptibility Evaluation of Liangshan Prefecture Based on the RSIV-RF Model. Bulletin of Geological Science and Technology, 43(1): 275-287 (in Chinese with English abstract). |
| [27] |
Sun, D. L., Gu, Q. Y., Wen, H. J., et al., 2023. Assessment of Landslide Susceptibility along Mountain Highways Based on Different Machine Learning Algorithms and Mapping Units by Hybrid Factors Screening and Sample Optimization.Gondwana Research, 123: 89-106. https://doi.org/10.1016/j.gr.2022.07.013 |
| [28] |
Sun, D. L., Xu, J. H., Wen, H. J., et al., 2021. Assessment of Landslide Susceptibility Mapping Based on Bayesian Hyperparameter Optimization: A Comparison between Logistic Regression and Random Forest.Engineering Geology, 281: 105972. https://doi.org/10.1016/j.enggeo.2020.105972 |
| [29] |
Tian, Y., Gao, B., Yin, H., et al., 2024. Handling Imbalanced Samples in Landslide Susceptibility Evaluation. Hydrogeology & Engineering Geology, 51(6): 171-181 (in Chinese with English abstract). |
| [30] |
Bui, D.T., Tuan, T. A., Klempe, H., et al., 2016. Spatial Prediction Models for Shallow Landslide Hazards: A Comparative Assessment of the Efficacy of Support Vector Machines, Artificial Neural Networks, Kernel Logistic Regression, and Logistic Model Tree.Landslides, 13(2): 361-378. https://doi.org/10.1007/s10346-015-0557-6 |
| [31] |
Wang, Y., Cao, Y., Xu, F. D., et al., 2024. Reservoir Landslide Susceptibility Prediction Considering Non-Landslide Sampling and Ensemble Machine Learning Methods. Earth Science, 49(5): 1619-1635 (in Chinese with English abstract). |
| [32] |
Wu, B., Shi, Z. M., Zheng, H. C., et al., 2024. Impact of Sampling for Landslide Susceptibility Assessment Using Interpretable Machine Learning Models.Bulletin of Engineering Geology and the Environment, 83(11): 461. https://doi.org/10.1007/s10064-024-03980-8 |
| [33] |
Xiong, K., Adhikari, B. R., Stamatopoulos, C. A., et al., 2020. Comparison of Different Machine Learning Methods for Debris Flow Susceptibility Mapping: A Case Study in the Sichuan Province, China.Remote Sensing, 12(2): 295. https://doi.org/10.3390/rs12020295 |
| [34] |
Yang, C., Liu, L. L., Huang, F. M., et al., 2023. Machine Learning-Based Landslide Susceptibility Assessment with Optimized Ratio of Landslide to Non-Landslide Samples.Gondwana Research, 123: 198-216. https://doi.org/10.1016/j.gr.2022.05.012 |
| [35] |
Yu, H. K., Ouyang, J. F., Wang, B. Q., et al., 2024. Susceptibility Assessment of Regional Landslides under Different Sampling Strategies. Safety and Environmental Engineering, 31(5): 122-134, 162(in Chinese with English abstract). |
| [36] |
Zhou, C., Yin, K. L., Cao, Y., et al., 2020. Landslide Susceptibility Assessment by Applying the Coupling Method of Radial Basis Neural Network and Adaboost: A Case Study from the Three Gorges Reservoir Area. Earth Science, 45(6): 1865-1876 (in Chinese with English abstract). |
| [37] |
Zhou, X. Z., Wen, H. J., Zhang, Y. L., et al., 2021. Landslide Susceptibility Mapping Using Hybrid Random Forest with GeoDetector and RFE for Factor Optimization.Geoscience Frontiers, 12(5): 101211. https://doi.org/10.1016/j.gsf.2021.101211 |
| [38] |
Zhou, Y. Y., 2023. Risk Assessment and Prediction of Debris Flow Based on the Coupling Mechanism of Eco-Geological Environment in the Bailong River Basin (Dissertation). Lanzhou University, Lanzhou(in Chinese with English abstract). |
| [39] |
Zhu, H., Zhang, L. M., Xiao, T., et al., 2017. Enhancement of Slope Stability by Vegetation Considering Uncertainties in Root Distribution.Computers and Geotechnics, 85: 84-89. https://doi.org/10.1016/j.compgeo.2016.12.027 |
国家自然科学基金项目-面上项目(42477150)
/
| 〈 |
|
〉 |