首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Use of genetic algorithms to select input variables in decision tree models for the prediction of benthic macroinvertebrates
Authors:Tom D'heygere  Peter L M Goethals  Niels De Pauw
Institution:Laboratory of Environmental Toxicology and Aquatic Ecology, Ghent University, J. Plateaustraat 22, B-9000, Gent, Belgium
Abstract:Predicting freshwater organisms based on machine learning is becoming more and more reliable due to the availability of appropriate datasets, advanced modelling techniques and the continuously increasing capacity of computers. A database consisting of measurements collected at 360 sampling sites in non-navigable watercourses in Flanders was applied to predict the absence/presence of benthic macroinvertebrate taxa by means of decision trees. The measured variables were a combination of physical–chemical (temperature, pH, dissolved oxygen concentration, conductivity, total organic carbon, Kjeldahl nitrogen and total phosphorus), structural (granulometric analysis of the sediment, width, depth and flow velocity of the river) and two ecotoxicological variables. The predictive power of decision trees was assessed on the basis of the number of Correctly Classified Instances (CCI). A genetic algorithm was introduced to compare the predictive power of different sets of input variables for the decision trees. The number of input variables was reduced from 15 to 2–8 variables without affecting the predictive power of the decision trees significantly. Furthermore, reducing the number of input variables allowed to ease the identification of general data trends.
Keywords:Benthic macroinvertebrates  Predictive models  Genetic algorithm  Decision trees  Physical–  chemical-  ecotoxicological-  structural variables
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号