首页 | 本学科首页   官方微博 | 高级检索  
     检索      

有机化合物生物富集因子的计算机预测研究
引用本文:孙露,陈英杰,吴曾睿,李卫华,刘桂霞,Philip W. Lee,唐赟.有机化合物生物富集因子的计算机预测研究[J].生态毒理学报,2015,10(2):173-182.
作者姓名:孙露  陈英杰  吴曾睿  李卫华  刘桂霞  Philip W. Lee  唐赟
作者单位:华东理工大学药学院上海市新药设计重点实验室,上海,200237
基金项目:国家自然科学基金(No. 81373329);学科创新引智计划即111计划(No. B07023)
摘    要:有机化合物在生物体内的富集,通常用生物富集因子(bioconcentration factor,简称BCF)来表达,这是化合物生态环境毒性评估的重要指标。为合理预测有机化合物是否易于生物富集,首先从美国环保局网站收集了624个具有不同BCF值的化合物,然后采用7种分子指纹结合5种机器学习方法(包括支持向量机、C4.5决策树、k最近邻法、随机森林法和朴素贝叶斯法),构建了化合物BCF的分类预测模型,所有模型均采用独立外部验证集进行验证。其中,使用Chemo Typer分子指纹结合支持向量机方法得到的二分类模型,整体预测准确度最好,达到了85.4%。通过采用信息增益、频率分析等方法,进一步确定了化合物中易于引起生物富集的关键子结构,包括芳基氯、二芳基醚、氯代烷烃等。研究中所用到的方法为有毒化学品的生态风险评价提供了良好可靠的预测工具。

关 键 词:生物富集因子  计算机预测  二分类模型  警示子结构  环境毒理学
收稿时间:2014/12/5 0:00:00
修稿时间:1/9/2015 12:00:00 AM

In Silico Prediction of Chemical Bioconcentration Factor
Sun Lu,Chen Yingjie,Wu Zengrui,Li Weihu,Liu Guixi,Philip W. Lee and Tang Yun.In Silico Prediction of Chemical Bioconcentration Factor[J].Asian Journal of Ecotoxicology,2015,10(2):173-182.
Authors:Sun Lu  Chen Yingjie  Wu Zengrui  Li Weihu  Liu Guixi  Philip W Lee and Tang Yun
Institution:Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China;Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, Shanghai 200237, China
Abstract:Bioconcentration is an important endpoint in evaluation of chemical adverse effects on ecosystems. In this study, in silico methods were used to predict chemical bioconcentration factor (BCF). At first a data set containing 624 chemicals with BCF values was collected from the Estimation Program Interface Suite of the U. S. Environmental Protection Agency. Using seven fingerprints to represent the molecules, binary classification models were developed with five machine learning methods, including support vector machine (SVM), C4.5 decision tree (C4.5 DT), k-nearest neighbors (kNN), random forest (RF), and Na?ve Bayes (NB). Reliable predictive models were then obtained and validated by 10-fold cross validation and external validation set. Among them, the model built by SVM with ChemoTyper fingerprint performed best, with predictive accuracy up to 85.4%. Moreover, some substructures were identified to be key for bioconcentration via several methods, such as arylchloride, diarylether, chloroalkene, and so on. The approaches used in this study provide a useful tool for environmental risk assessment of chemicals.
Keywords:bioconcentration factor  in silico prediction  binary classification models  substructural alerts  environmental toxicology
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《生态毒理学报》浏览原始摘要信息
点击此处可从《生态毒理学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号