首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于机器学习方法的小麦镉富集因子预测
引用本文:牛硕,李艳玲,杨阳,商艳萍,王天齐,陈卫平.基于机器学习方法的小麦镉富集因子预测[J].环境科学,2023,44(6):3619-3626.
作者姓名:牛硕  李艳玲  杨阳  商艳萍  王天齐  陈卫平
作者单位:郑州大学河南先进技术研究院, 郑州 450003;中国科学院生态环境研究中心城市与区域生态国家重点实验室, 北京 100085;中交天津航道局有限公司天津市疏浚工程技术企业重点实验室, 天津 300461;河南省济源市种子管理站, 济源 459000
基金项目:国家自然科学基金项目(41907353,41977146)
摘    要:应用机器学习方法解析区域土壤-小麦系统镉(Cd)富集特征有助于风险决策的准确性和科学性.基于区域调查,构建了Freundlich-type转移方程、随机森林(RF)模型和神经网络(BPNN)模型对小麦Cd富集因子(BCF-Cd)进行预测,验证不同模型的预测精度并评估其不确定性.结果表明,RF(R2=0.583)和BPNN(R2=0.490)模型预测性能均优于Freundlich转移方程(R2=0.410).重复训练结果显示RF和BPNN平均绝对误差(MAE)和均方根误差(RMSE)较为接近,但RF(R2为0.527~0.601)较BPNN(R2为0.432~0.661)模型精度和稳定性更高.特征变量重要性分析显示多重因素的共同作用导致小麦BCF-Cd的异质性,其中土壤磷(P)和锌(Zn)是影响小麦BCF-Cd变化的关键变量.参数优化可进一步提高模型精度、稳定性和泛化能力.

关 键 词:随机森林  神经网络  回归方程  富集系数  小麦
收稿时间:2022/7/25 0:00:00
修稿时间:2022/9/6 0:00:00

Prediction of Cadmium Uptake Factor in Wheat Based on Machine Learning
NIU Shuo,LI Yan-ling,YANG Yang,SHANG Yan-ping,WANG Tian-qi,CHEN Wei-ping.Prediction of Cadmium Uptake Factor in Wheat Based on Machine Learning[J].Chinese Journal of Environmental Science,2023,44(6):3619-3626.
Authors:NIU Shuo  LI Yan-ling  YANG Yang  SHANG Yan-ping  WANG Tian-qi  CHEN Wei-ping
Institution:Henan Institutes of Advanced Technology, Zhengzhou University, Zhengzhou 450003, China;State Key Laboratory of Urban and Regional Ecology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China;Tianjin Key Laboratory for Dredging Engineer Enterprises, China Communications Construction Company Tianjin Dredging Co., Ltd., Tianjin 300461;Henan Jiyuan County Seed Management Station, Jiyuan 459000, China
Abstract:Applying machine learning methods to resolve the cadmium (Cd) uptake characteristics of regional soil-wheat systems can contribute to the accuracy and rationality of risk decisions. Based on a regional survey, we constructed a Freundlich-type transfer equation, random forest (RF) model, and neural network (BPNN) model to predict wheat Cd enrichment factor (BCF-Cd); verified the prediction accuracy; and assessed the uncertainty of different models. The results showed that both RF (R2=0.583) and BPNN (R2=0.490) were better than the Freundlich transfer equation (R2=0.410). The RF and BPNN were further trained repeatedly, and the results showed that the mean absolute error (MAE) and root mean square error (RMSE) of RF and BPNN were close to each other. Additionally, the accuracy and stability of RF (R2=0.527-0.601) was higher than that of BPNN (R2=0.432-0.661). Feature importance analysis showed that multiple factors led to the heterogeneity of wheat BCF-Cd, in which soil phosphorus (P) and zinc (Zn) were the key variables affecting the change in wheat BCF-Cd. Parameter optimization can further improve the accuracy, stability, and generalization ability of the model.
Keywords:random forest  neural network  regression equation  uptake factors  wheat
点击此处可从《环境科学》浏览原始摘要信息
点击此处可从《环境科学》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号