首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
We explored the effect of varying pseudo-absence data in species distribution modelling using empirical data for four real species and simulated data for two imaginary species. In all analyses we used a fixed study area, a fixed set of environmental predictors and a fixed set of presence observations. Next, we added pseudo-absence data generated by different sampling designs and in different numbers to assess their relative importance for the output from the species distribution model. The sampling design strongly influenced the predictive performance of the models while the number of pseudo-absences had minimal effect on the predictive performance. We attribute much of these results to the relationship between the environmental range of the pseudo-absences (i.e. the extent of the environmental space being considered) and the environmental range of the presence observations (i.e. under which environmental conditions the species occurs). The number of generated pseudo-absences had a direct effect on the predicted probability, which translated to different distribution areas. Pseudo-absence observations that fell within grid cells with presence observations were purposely included in our analyses. We discourage the practice of excluding certain pseudo-absence data because it involves arbitrary assumptions about what are (un)suitable environments for the species being modelled.  相似文献   

2.
Species distribution models (SDMs) based on statistical relationships between occurrence data and underlying environmental conditions are increasingly used to predict spatial patterns of biological invasions and prioritize locations for early detection and control of invasion outbreaks. However, invasive species distribution models (iSDMs) face special challenges because (i) they typically violate SDM's assumption that the organism is in equilibrium with its environment, and (ii) species absence data are often unavailable or believed to be too difficult to interpret. This often leads researchers to generate pseudo-absences for model training or utilize presence-only methods, and to confuse the distinction between predictions of potential vs. actual distribution. We examined the hypothesis that true-absence data, when accompanied by dispersal constraints, improve prediction accuracy and ecological understanding of iSDMs that aim to predict the actual distribution of biological invasions. We evaluated the impact of presence-only, true-absence and pseudo-absence data on model accuracy using an extensive dataset on the distribution of the invasive forest pathogen Phytophthora ramorum in California. Two traditional presence/absence models (generalized linear model and classification trees) and two alternative presence-only models (ecological niche factor analysis and maximum entropy) were developed based on 890 field plots of pathogen occurrence and several climatic, topographic, host vegetation and dispersal variables. The effects of all three possible types of occurrence data on model performance were evaluated with receiver operating characteristic (ROC) and omission/commission error rates. Results show that prediction of actual distribution was less accurate when we ignored true-absences and dispersal constraints. Presence-only models and models without dispersal information tended to over-predict the actual range of invasions. Models based on pseudo-absence data exhibited similar accuracies as presence-only models but produced spatially less feasible predictions. We suggest that true-absence data are a critical ingredient not only for accurate calibration but also for ecologically meaningful assessment of iSDMs that focus on predictions of actual distributions.  相似文献   

3.
Habitat classification models (HCMs) are invaluable tools for species conservation, land-use planning, reserve design, and metapopulation assessments, particularly at broad spatial scales. However, species occurrence data are often lacking and typically limited to presence points at broad scales. This lack of absence data precludes the use of many statistical techniques for HCMs. One option is to generate pseudo-absence points so that the many available statistical modeling tools can bb used. Traditional techniques generate pseudo-absence points at random across broadly defined species ranges, often failing to include biological knowledge concerning the species-habitat relationship. We incorporated biological knowledge of the species-habitat relationship into pseudo-absence points by creating habitat envelopes that constrain the region from which points were randomly selected. We define a habitat envelope as an ecological representation of a species, or species feature's (e.g., nest) observed distribution (i.e., realized niche) based on a single attribute, or the spatial intersection of multiple attributes. We created HCMs for Northern Goshawk (Accipiter gentilis atricapillus) nest habitat during the breeding season across Utah forests with extant nest presence points and ecologically based pseudo-absence points using logistic regression. Predictor variables were derived from 30-m USDA Landfire and 250-m Forest Inventory and Analysis (FIA) map products. These habitat-envelope-based models were then compared to null envelope models which use traditional practices for generating pseudo-absences. Models were assessed for fit and predictive capability using metrics such as kappa, threshold-independent receiver operating characteristic (ROC) plots, adjusted deviance (D(adj)2), and cross-validation, and were also assessed for ecological relevance. For all cases, habitat envelope-based models outperformed null envelope models and were more ecologically relevant, suggesting that incorporating biological knowledge into pseudo-absence point generation is a powerful tool for species habitat assessments. Furthermore, given some a priori knowledge of the species-habitat relationship, ecologically based pseudo-absence points can be applied to any species, ecosystem, data resolution, and spatial extent.  相似文献   

4.
5.
An important decision in presence-only species distribution modeling is how to select background (or pseudo-absence) localities for model parameterization. The selection of such localities may influence model parameterization and thus, can influence the appropriateness and accuracy of the model prediction when extrapolating the species distribution across time and space. We used 12 species from the Australian Wet Tropics (AWT) to evaluate the relationship between the geographic extent from which pseudo-absences are taken and model performance, and shape and importance of predictor variables using the MAXENT modeling method. Model performance is lower when pseudo-absence points are taken from either a restricted or broad region with respect to species occurrence data than from an intermediate region. Furthermore, variable importance (i.e., contribution to the model) changed such that, models became increasingly simplified, dominated by just two variables, as the area from which pseudo-absence points were drawn increased. Our results suggest that it is important to consider the spatial extent from which pseudo-absence data are taken. We suggest species distribution modeling exercises should begin with exploratory analyses evaluating what extent might provide both the most accurate results and biologically meaningful fit between species occurrence and predictor variables. This is especially important when modeling across space or time—a growing application for species distributional modeling.  相似文献   

6.
Obtaining Environmental Favourability Functions from Logistic Regression   总被引:6,自引:0,他引:6  
Logistic regression is a statistical tool widely used for predicting species’ potential distributions starting from presence/absence data and a set of independent variables. However, logistic regression equations compute probability values based not only on the values of the predictor variables but also on the relative proportion of presences and absences in the dataset, which does not adequately describe the environmental favourability for or against species presence. A few strategies have been used to circumvent this, but they usually imply an alteration of the original data or the discarding of potentially valuable information. We propose a way to obtain from logistic regression an environmental favourability function whose results are not affected by an uneven proportion of presences and absences. We tested the method on the distribution of virtual species in an imaginary territory. The favourability models yielded similar values regardless of the variation in the presence/absence ratio. We also illustrate with the example of the Pyrenean desman’s (Galemys pyrenaicus) distribution in Spain. The favourability model yielded more realistic potential distribution maps than the logistic regression model. Favourability values can be regarded as the degree of membership of the fuzzy set of sites whose environmental conditions are favourable to the species, which enables applying the rules of fuzzy logic to distribution modelling. They also allow for direct comparisons between models for species with different presence/absence ratios in the study area. This makes them more useful to estimate the conservation value of areas, to design ecological corridors, or to select appropriate areas for species reintroductions. Received: June 2005 / Revised: July 2005  相似文献   

7.
Eradication and control of invasive species are often possible only if populations are detected when they are small and localized. To be efficient, detection surveys should be targeted at locations where there is the greatest risk of incursions. We examine the utility of habitat suitability index (HSI) and particle dispersion models for targeting sampling for marine pests. Habitat suitability index models are a simple way to identify suitable habitat when species distribution data are lacking. We compared the performance of HSI models with statistical models derived from independent data from New Zealand on the distribution of two nonindigenous bivalves: Theora lubrica and Musculista senhousia. Logistic regression models developed using the HSI scores as predictors of the presence/absence of Theora and Musculista explained 26.7% and 6.2% of the deviance in the data, respectively. Odds ratios for the HSI scores were greater than unity, indicating that they were genuine predictors of the presence/ absence of each species. The fit and predictive accuracy of each logistic model were improved when simulated patterns of dispersion from the nearest port were added as a predictor variable. Nevertheless, the combined model explained, at best, 46.5% of the deviance in the distribution of Theora and correctly predicted 56% of true presences and 50% of all cases. Omission errors were between 6% and 16%. Although statistical distribution models built directly from environmental predictors always outperformed the equivalent HSI models, the gain in model fit and accuracy was modest. High residual deviance in both types of model suggests that the distributions realized by Theora and Musculista in the field data were influenced by factors not explicitly modeled as explanatory variables and by error in the environmental data used to project suitable habitat for the species. Our results highlight the difficulty of accurately predicting the distribution of invasive marine species that exhibit low habitat occupancy and patchy distributions in time and space. Although the HSI and statistical models had utility as predictors of the likely distribution of nonindigenous marine species, the level of spatial accuracy achieved with them may be well below expectations for sensitive surveillance programs.  相似文献   

8.
An important aspect of species distribution modelling is the choice of the modelling method because a suboptimal method may have poor predictive performance. Previous comparisons have found that novel methods, such as Maxent models, outperform well-established modelling methods, such as the standard logistic regression. These comparisons used training samples with small numbers of occurrences per estimated model parameter, and this limited sample size may have caused poorer predictive performance due to overfitting. Our hypothesis is that Maxent models would outperform a standard logistic regression because Maxent models avoid overfitting by using regularisation techniques and a standard logistic regression does not. Regularisation can be applied to logistic regression models using penalised maximum likelihood estimation. This estimation procedure shrinks the regression coefficients towards zero, causing biased predictions if applied to the training sample but improving the accuracy of new predictions. We used Maxent and logistic regression (standard and penalised) to analyse presence/pseudo-absence data for 13 tree species and evaluated the predictive performance (discrimination) using presence-absence data. The penalised logistic regression outperformed standard logistic regression and equalled the performance of Maxent. The penalised logistic regression may be considered one of the best methods to develop species distribution models trained with presence/pseudo-absence data, as it is comparable to Maxent. Our results encourage further use of the penalised logistic regression for species distribution modelling, especially in those cases in which a complex model must be fitted to a sample with a limited size.  相似文献   

9.
Empirical models for predicting the distribution of organisms from environmental data have often focused on principles of ecological niche theory. However, even at large scales, there is little agreement over how to represent the dimensions of a species’ niche. The performance of such models is greatly affected by the nature of species distributional and environmental data. Regional scale distribution models were developed for 30 willow species in Ontario to examine (i) the predictive ability of logistic regression analysis, and (ii) the effects of using different distributional and environmental data sets. Two original measures of model accuracy and over-prediction were employed and evaluated using independent data. Models based on unique combinations of monthly climate data predicted distributions most accurately for all species. Models based on a fixed set of variables, while generating the highest average probabilities of occurrence for certain species with limited ranges, resulted in the greatest under- and over-estimates of willow distributions. Comparisons of models demonstrated climatic patterns among willows of differing habit and habitat. The distribution of dwarf willow species, present only in the Ontario arctic, followed gradients of summer maximum temperatures. The distribution of the tree species in the southerly portions of the province followed gradients of fall and winter minimum temperatures. Regardless of distributional and environmental data input, no algorithm maximized model performance for all species. Individual species models require individual approaches; i.e., the variable selection technique, the set of environmental factors used as predictors, and the nature of species distributional data must be carefully matched to the intended application. An understanding of evolutionary processes enhances the meaningful interpretation of individual species models. Unless sampling bias and species prevalence can be accounted for, models based on collection point data are best used to guide field surveys. While inferred range data may be better suited to determine potential ecological niches, overestimation of species prevalence and environmental tolerance must be recognized. A combination of available distributional data types is recommended to best determine species niches, an important step in developing conservation strategies.  相似文献   

10.
Abstract: Distribution models are used increasingly for species conservation assessments over extensive areas, but the spatial resolution of the modeled data and, consequently, of the predictions generated directly from these models are usually too coarse for local conservation applications. Comprehensive distribution data at finer spatial resolution, however, require a level of sampling that is impractical for most species and regions. Models can be downscaled to predict distribution at finer resolutions, but this increases uncertainty because the predictive ability of models is not necessarily consistent beyond their original scale. We analyzed the performance of downscaled, previously published models of environmental favorability (a generalized linear modeling technique) for a restricted endemic insectivore, the Iberian desman (Galemys pyrenaicus), and a more widespread carnivore, the Eurasian otter (Lutra lutra), in the Iberian Peninsula. The models, built from presence–absence data at 10 × 10 km resolution, were extrapolated to a resolution 100 times finer (1 × 1 km). We compared downscaled predictions of environmental quality for the two species with published data on local observations and on important conservation sites proposed by experts. Predictions were significantly related to observed presence or absence of species and to expert selection of sampling sites and important conservation sites. Our results suggest the potential usefulness of downscaled projections of environmental quality as a proxy for expensive and time‐consuming field studies when the field studies are not feasible. This method may be valid for other similar species if coarse‐resolution distribution data are available to define high‐quality areas at a scale that is practical for the application of concrete conservation measures.  相似文献   

11.
Abstract: Species distribution models are critical tools for the prediction of invasive species spread and conservation of biodiversity. The majority of species distribution models have been built with environmental data. Community ecology theory suggests that species co‐occurrence data could also be used to predict current and potential distributions of species. Species assemblages are the products of biotic and environmental constraints on the distribution of individual species and as a result may contain valuable information for niche modeling. We compared the predictive ability of distribution models of annual grassland plants derived from either environmental or community‐composition data. Composition‐based models were built with the presence or absence of species at a site as predictors of site quality, whereas environment‐based models were built with soil chemistry, moisture content, above‐ground biomass, and solar radiation as predictors. The reproductive output of experimentally seeded individuals of 4 species and the abundance of 100 species were used to evaluate the resulting models. Community‐composition data were the best predictors of both the site‐specific reproductive output of sown individuals and the site‐specific abundance of existing populations. Successful community‐based models were robust to omission of data on the occurrence of rare species, which suggests that even very basic survey data on the occurrence of common species may be adequate for generating such models. Our results highlight the need for increased public availability of ecological survey data to facilitate community‐based modeling at scales relevant to conservation.  相似文献   

12.
Predicting species distributions from samples collected along roadsides   总被引:1,自引:0,他引:1  
Predictive models of species distributions are typically developed with data collected along roads. Roadside sampling may provide a biased (nonrandom) sample; however, it is currently unknown whether roadside sampling limits the accuracy of predictions generated by species distribution models. We tested whether roadside sampling affects the accuracy of predictions generated by species distribution models by using a prospective sampling strategy designed specifically to address this issue. We built models from roadside data and validated model predictions at paired locations on unpaved roads and 200 m away from roads (off road), spatially and temporally independent from the data used for model building. We predicted species distributions of 15 bird species on the basis of point-count data from a landbird monitoring program in Montana and Idaho (U.S.A.). We used hierarchical occupancy models to account for imperfect detection. We expected predictions of species distributions derived from roadside-sampling data would be less accurate when validated with data from off-road sampling than when it was validated with data from roadside sampling and that model accuracy would be differentially affected by whether species were generalists, associated with edges, or associated with interior forest. Model performance measures (kappa, area under the curve of a receiver operating characteristic plot, and true skill statistic) did not differ between model predictions of roadside and off-road distributions of species. Furthermore, performance measures did not differ among edge, generalist, and interior species, despite a difference in vegetation structure along roadsides and off road and that 2 of the 15 species were more likely to occur along roadsides. If the range of environmental gradients is surveyed in roadside-sampling efforts, our results suggest that surveys along unpaved roads can be a valuable, unbiased source of information for species distribution models.  相似文献   

13.
Although long-lived tree species experience considerable environmental variation over their life spans, their geographical distributions reflect sensitivity mainly to mean monthly climatic conditions. We introduce an approach that incorporates a physiologically based growth model to illustrate how a half-dozen tree species differ in their responses to monthly variation in four climatic-related variables: water availability, deviations from an optimum temperature, atmospheric humidity deficits, and the frequency of frost. Rather than use climatic data directly to correlate with a species’ distribution, we assess the relative constraints of each of the four variables as they affect predicted monthly photosynthesis for Douglas-fir, the most widely distributed species in the region. We apply an automated regression-tree analysis to create a suite of rules, which differentially rank the relative importance of the four climatic modifiers for each species, and provide a basis for predicting a species’ presence or absence on 3737 uniformly distributed U.S. Forest Services’ Forest Inventory and Analysis (FIA) field survey plots. Results of this generalized rule-based approach were encouraging, with weighted accuracy, which combines the correct prediction of both presence and absence on FIA survey plots, averaging 87%. A wider sampling of climatic conditions throughout the full range of a species’ distribution should improve the basis for creating rules and the possibility of predicting future shifts in the geographic distribution of species.  相似文献   

14.
Maximum entropy modeling of species geographic distributions   总被引:94,自引:0,他引:94  
The availability of detailed environmental data, together with inexpensive and powerful computers, has fueled a rapid increase in predictive modeling of species environmental requirements and geographic distributions. For some species, detailed presence/absence occurrence data are available, allowing the use of a variety of standard statistical techniques. However, absence data are not available for most species. In this paper, we introduce the use of the maximum entropy method (Maxent) for modeling species geographic distributions with presence-only data. Maxent is a general-purpose machine learning method with a simple and precise mathematical formulation, and it has a number of aspects that make it well-suited for species distribution modeling. In order to investigate the efficacy of the method, here we perform a continental-scale case study using two Neotropical mammals: a lowland species of sloth, Bradypus variegatus, and a small montane murid rodent, Microryzomys minutus. We compared Maxent predictions with those of a commonly used presence-only modeling method, the Genetic Algorithm for Rule-Set Prediction (GARP). We made predictions on 10 random subsets of the occurrence records for both species, and then used the remaining localities for testing. Both algorithms provided reasonable estimates of the species’ range, far superior to the shaded outline maps available in field guides. All models were significantly better than random in both binomial tests of omission and receiver operating characteristic (ROC) analyses. The area under the ROC curve (AUC) was almost always higher for Maxent, indicating better discrimination of suitable versus unsuitable areas for the species. The Maxent modeling approach can be used in its present form for many applications with presence-only datasets, and merits further research and development.  相似文献   

15.
There is increasing interestin broad-scale analysis, modeling, and prediction of the distribution and composition of plant species assemblages under climatic, environmental, and biotic change, particularly for conservation purposes. We devised a method to reliably predict the impact of climate change on large assemblages of plant communities, while also considering competing biotic and environmental factors. To this purpose, we first used multilabel algorithms in order to convert the task of explaining a large assemblage of plant communities into a classification framework able to capture with high cross-validated accuracy the pattern of species distributions under a composite set of biotic and abiotic factors. We applied our model to a large set of plant communities in the Swiss Alps. Our model explained presences and absences of 175 plant species in 608 plots with >87% cross-validated accuracy, predicted decreases in α, β, and γ diversity by 2040 under both moderate and extreme climate scenarios, and identified likely advantaged and disadvantaged plant species under climate change. Multilabel variable selection revealed the overriding importance of topography, soils, and temperature extremes (rather than averages) in determining the distribution of plant species in the study area and their response to climate change. Our method addressed a number of challenging research problems, such as scaling to large numbers of species, considering species relationships and rarity, and addressing an overwhelming proportion of absences in presence–absence matrices. By handling hundreds to thousands of plants and plots simultaneously over large areas, our method can inform broad-scale conservation of plant species under climate change because it allows species that require urgent conservation action (assisted migration, seed conservation, and ex situ conservation) to be detected and prioritized. Our method also increases the practicality of assisted colonization of plant species by helping to prevent ill-advised introduction of plant species with limited future survival probability.  相似文献   

16.
Predators and prey assort themselves relative to each other, the availability of resources and refuges, and the temporal and spatial scale of their interaction. Predictive models of predator distributions often rely on these relationships by incorporating data on environmental variability and prey availability to determine predator habitat selection patterns. This approach to predictive modeling holds true in marine systems where observations of predators are logistically difficult, emphasizing the need for accurate models. In this paper, we ask whether including prey distribution data in fine-scale predictive models of bottlenose dolphin (Tursiops truncatus) habitat selection in Florida Bay, Florida, U.S.A., improves predictive capacity. Environmental characteristics are often used as predictor variables in habitat models of top marine predators with the assumption that they act as proxies of prey distribution. We examine the validity of this assumption by comparing the response of dolphin distribution and fish catch rates to the same environmental variables. Next, the predictive capacities of four models, with and without prey distribution data, are tested to determine whether dolphin habitat selection can be predicted without recourse to describing the distribution of their prey. The final analysis determines the accuracy of predictive maps of dolphin distribution produced by modeling areas of high fish catch based on significant environmental characteristics. We use spatial analysis and independent data sets to train and test the models. Our results indicate that, due to high habitat heterogeneity and the spatial variability of prey patches, fine-scale models of dolphin habitat selection in coastal habitats will be more successful if environmental variables are used as predictor variables of predator distributions rather than relying on prey data as explanatory variables. However, predictive modeling of prey distribution as the response variable based on environmental variability did produce high predictive performance of dolphin habitat selection, particularly foraging habitat.  相似文献   

17.
《Ecological modelling》2004,175(2):137-149
Bird species are selective on the vegetation types in which they are found but predictive models of bird distribution based on variables derived from land-use/land-cover maps tend to have limited success. It has been suggested that accuracy of existing maps used to derive predictors is in part responsible for the limited success of bird distribution models. In two areas of 4900 km2 of Western Andalusia, Spain, we compared the predictive ability of bird distribution models derived from two existing general-purpose land-use/land-cover maps, which differ in their resolution and accuracy: a coarse scale vegetation map of Europe, the CORINE land-cover map, and a detailed regional map, the 1995 land-use/land-cover map of Andalusia from the SINAMBA (Consejerı́a de Medio Ambiente, Junta de Andalucı́a). We compared the bird distribution models derived from these general-purpose vegetation maps with models derived from two more accurate structural vegetation maps built considering directly variables that influence bird habitat selection, one built from satellite images for this study and another obtained by improving the resolution and accuracy of the SINAMBA map with satellite data. We sampled the presence/absence of bird species at 857 points using 15-min point surveys. Predictive models for 54 bird species were built with generalised additive models (GAMs), using as potential predictors the same set of landscape and vegetation structure variables measured on each map. We compared for each bird species the predictive accuracy of the best model derived from each map. Vegetation structure measured at bird sample points was used as ground-truth for comparing the accuracy of vegetation maps. Although maps differed in their resolution and accuracy, the results show that all of them produced similarly accurate bird distribution models, with a mixed map produced with both thematic and satellite information being the best. The models derived from the more accurate vegetation structure maps obtained from satellite data were not more accurate than those derived directly from the SINAMBA or CORINE maps. Our results suggest that some general-purpose land-use/land-cover maps are accurate enough to derive bird distribution models. There is a certain limit to improve vegetation maps above which there is no effect in their power to predict bird distribution.  相似文献   

18.
Developing robust species distribution models is important as model outputs are increasingly being incorporated into conservation policy and management decisions. A largely overlooked component of model assessment and refinement is whether to include historic species occurrence data in distribution models to increase the data sample size. Data of different temporal provenance often differ in spatial accuracy and precision. We test the effect of inclusion of historic coarse-resolution occurrence data on distribution model outputs for 187 species of birds in Australian tropical savannas. Models using only recent (after 1990), fine-resolution data had significantly higher model performance scores measured with area under the receiver operating characteristic curve (AUC) than models incorporating both fine- and coarse-resolution data. The drop in AUC score is positively correlated with the total area predicted to be suitable for the species (R2 = 0.163-0.187, depending on the environmental predictors in the model), as coarser data generally leads to greater predicted areas. The remaining unexplained variation is likely to be due to the covariate errors resulting from resolution mismatch between species records and environmental predictors. We conclude that decisions regarding data use in species distribution models must be conscious of the variation in predictions that mixed-scale datasets might cause.  相似文献   

19.
Planning land-use for biodiversity conservation frequently involves computer-assisted reserve selection algorithms. Typically such algorithms operate on matrices of species presence–absence in sites, or on species-specific distributions of model predicted probabilities of occurrence in grid cells. There are practically always errors in input data—erroneous species presence–absence data, structural and parametric uncertainty in predictive habitat models, and lack of correspondence between temporal presence and long-run persistence. Despite these uncertainties, typical reserve selection methods proceed as if there is no uncertainty in the data or models. Having two conservation options of apparently equal biological value, one would prefer the option whose value is relatively insensitive to errors in planning inputs. In this work we show how uncertainty analysis for reserve planning can be implemented within a framework of information-gap decision theory, generating reserve designs that are robust to uncertainty. Consideration of uncertainty involves modifications to the typical objective functions used in reserve selection. Search for robust-optimal reserve structures can still be implemented via typical reserve selection optimization techniques, including stepwise heuristics, integer-programming and stochastic global search.  相似文献   

20.
We assessed the occurrence of a common river bird, the Plumbeous Redstart Rhyacornis fuliginosus, along 180 independent streams in the Indian and Nepali Himalaya. We then compared the performance of multiple discrimant analysis (MDA), logistic regression (LR) and artificial neural networks (ANN) in predicting this species’ presence or absence from 32 variables describing stream altitude, slope, habitat structure, chemistry and invertebrate abundance. Using the entire data (=training set) and a threshold for accepting presence in ANN and LR set to P≥0.5, ANN correctly classified marginally more cases (88%) than either LR (83%) or MDA (84%). Model performance was assessed from two methods of data partitioning. In a ‘leave-one-out’ approach, LR correctly predicted more cases (82%) than MDA (73%) or ANN (69%). However, in a holdout procedure, all the methods performed similarly (73–75%). All methods predicted true absence (i.e. specificity in holdout: 81–85%) better than true presence (i.e. sensitivity: 57–60%). These effects reflect species’ prevalence (=frequency of occurrence), but are seldom considered in distribution modelling. Despite occurring at only 36% of the sites, Plumbeous Redstarts are one of the most common Himalayan river birds, and problems will be greater with less common species. Both LR and ANN require an arbitrary threshold probability (often P=0.5) at which to accept species presence from model prediction. Simulations involving varied prevalence revealed that LR was particularly sensitive to threshold effects. ROC plots (received operating characteristic) were therefore used to compare model performance on test data at a range of thresholds; LR always outperformed ANN. This case study supports the need to test species’ distribution models with independent data, and to use a range of criteria in assessing model performance. ANN do not yet have major advantages over conventional multivariate methods for assessing bird distributions. LR and MDA were both more efficient in the use of computer time than ANN, and also more straightforward in providing testable hypotheses about environmental effects on occurrence. However, LR was apparently subject to chance significant effects from explanatory variables, emphasising the well-known risks of models based purely on correlative data.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号