首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
In this work we present a Bayesian analysis in linear regression models with spatially varying coefficients for modeling and inference in spatio-temporal processes. This kind of model is particularly appealing in situations where the effect of one or more explanatory processes on the response present substantial spatial heterogeneity. We describe for this model how to make inference about the regression coefficients and response processes under two scenarios: when the explanatory processes are known throughout the study region, and when they are known only at the sampling locations. Using a simulation experiment we investigate how parameter inference and interpolation performance are affected by some features of the data and prior distribution that is used. The proposed methodology is used to model the dataset on PM10 levels in the metropolitan region of Rio de Janeiro presented in Paez and Gamerman (2003).  相似文献   

2.
At the time of European settlement, land surveys were conducted progressively westward throughout the United States. Outside of the original 13 colonies, surveys generally followed the Public Land Survey system in which trees, called witness trees, were regularly recorded at 1 mi by 1 mi grid intersections. This unintentional sampling provides insight into the composition and structure of pre-European settlement forests, which is used as baseline data to assess forest change following settlement. In this paper, a model for the Public Land Surveys of east central Alabama is developed. Assuming that the locations of trees of each species are realized from independent Poisson processes whose respective log intensities are linear functions of environmental covariates (i.e., elevation, landform, and physiographic province), the species observed at the survey grid intersections are independently sampled from a generalized logistic regression model. If all 68 species found in the survey were included, the model would be highly over-parameterized, so only the distribution of the most common taxon, pines, will be considered at this time. To assess the impact of environmental factors not included in the model, a hidden Gaussian random field shall be added as a random effect. A Markov Chain Monte Carlo algorithm is developed for Bayesian inference on model parameters, and for Bayes posterior prediction of the spatial distribution of pines in east central Alabama. Received: June 2004 / Revised: November 2004  相似文献   

3.
In the framework of generalized extreme value (GEV) distribution, the frequentist and Bayesian methods have been used to analyse the extremes of annual maxima wind speed recorded by automatic weather stations in Cape Town, Western Cape, South Africa. In the frequentist approach, the GEV distribution parameters were estimated using maximum likelihood, whereas in the Bayesian method the Markov Chain Monte Carlo technique with the Metropolis–Hastings algorithm was used. The results show that the GEV model with trend in the location parameter appears to be a better model for annual maxima data. The paper also discusses a method to construct informative priors empirically using historical data of the underlying process from other weather stations. The results from the Bayesian analysis show that posterior inference might be affected by the choice of priors and hence by the distance between a weather station used to formulate the priors and the point of interest.  相似文献   

4.
Spatial autocorrelation in wildlife observation data arises when extrinsic environmental processes and patterns that influence the spatial distribution of wildlife are themselves spatially structured, or when species are subject to intrinsic population processes, causing contagion or dispersion effects. Territoriality, Allee effects, dispersal limitations, and social clustering are examples of intrinsic processes. Both forms of autocorrelation can violate the assumptions of generalized linear regression models, resulting in biased estimation of model coefficients and diminished predictive performance. Such consequences may be avoided for extrinsic autocorrelation when autocorrelated environmental variables are available for use as model covariates, whereas intrinsic spatial autocorrelation requires an alternative modeling approach. The autologistic model provides an approach suited to the binary observations often obtained in wildlife surveys, but its performance has not been tested across widely varying sampling intensities or strengths of intrinsic spatial structure. Here we use simulated data to test the autologistic model under a range of sampling conditions. The autologistic model obtains better fits and substantially better predictive performance than the standard logistic regression model over the full range of sampling designs and intensities tested. We provide a simple Bayesian implementation of the autologistic model, which until now has not been achieved with standard statistical software alone. A step-by-step procedure is given for characterizing and modeling spatial autocorrelation in binary observation data, along with computer code for fitting autologistic models in WinBUGS, a freeware Bayesian analysis package. This approach avoids normal approximations to the pseudo-likelihood, in contrast to previous Bayesian applications of the autologistic model. We provide a sample application of the autologistic model, fitted to survey data for a gliding marsupial in southeastern Australia.  相似文献   

5.
A Bayesian framework for stable isotope mixing models   总被引:1,自引:0,他引:1  
Stable isotope sourcing is used to estimate proportional contributions of sources to a mixture, such as in the analysis of animal diets and plant nutrient use. Statistical methods for inference on the diet proportions using stable isotopes have focused on the linear mixing model. Existing frequentist methods provide inferences when the diet proportion vector can be uniquely solved for in terms of the isotope ratios. Bayesian methods apply for arbitrary numbers of isotopes and diet sources but existing models are somewhat limited as they assume that trophic fractionation or discrimination is estimated without error or that isotope ratios are uncorrelated. We present a Bayesian model for the estimation of mean diet that accounts for uncertainty in source means and discrimination and allows correlated isotope ratios. This model is easily extended to allow the diet proportion vector to depend on covariates, such as time. Two data sets are used to illustrate the methodology. Code is available for selected analyses.  相似文献   

6.
Bayesian methods incorporate prior knowledge into a statistical analysis. This prior knowledge is usually restricted to assumptions regarding the form of probability distributions of the parameters of interest, leaving their values to be determined mainly through the data. Here we show how a Bayesian approach can be applied to the problem of drawing inference regarding species abundance distributions and comparing diversity indices between sites. The classic log series and the lognormal models of relative- abundance distribution are apparently quite different in form. The first is a sampling distribution while the other is a model of abundance of the underlying population. Bayesian methods help unite these two models in a common framework. Markov chain Monte Carlo simulation can be used to fit both distributions as small hierarchical models with shared common assumptions. Sampling error can be assumed to follow a Poisson distribution. Species not found in a sample, but suspected to be present in the region or community of interest, can be given zero abundance. This not only simplifies the process of model fitting, but also provides a convenient way of calculating confidence intervals for diversity indices. The method is especially useful when a comparison of species diversity between sites with different sample sizes is the key motivation behind the research. We illustrate the potential of the approach using data on fruit-feeding butterflies in southern Mexico. We conclude that, once all assumptions have been made transparent, a single data set may provide support for the belief that diversity is negatively affected by anthropogenic forest disturbance. Bayesian methods help to apply theory regarding the distribution of abundance in ecological communities to applied conservation.  相似文献   

7.
A spatial zero-inflated poisson regression model for oak regeneration   总被引:1,自引:0,他引:1  
Ecological counts data are often characterized by an excess of zeros and spatial dependence. Excess zeros can occur in regions outside the range of the distribution of a given species. A zero-inflated Poisson regression model is developed, under which the species range is determined by a spatial probit model, including physical variables as covariates. Within that range, species counts are independently drawn from a Poisson distribution whose mean depends on biotic variables. Bayesian inference for this model is illustrated using data on oak seedling counts. Received: May 2004 / Revised: December 2004  相似文献   

8.
There is an increasing interest in the quality of soil, especially for small geographical areas. We present a method to estimate the percent of the area in a county or hydrological basin that is eroded. There are sample data (for several counties in eastern Iowa) from the National Resources Inventory and population data on land use, land capability class, rainfall and slope length and steepness. Using the Gibbs sampler we perform Bayesian predictive inference to obtain estimates for the non-sampled units. These estimates, together with the sample data, provide an estimate of the proportion of the total area that is eroded. We assess the quality of fit of our model using two cross-validation exercises and graphical methods.  相似文献   

9.
For modeling spatial processes, we propose a rich parametric class of stationary range anisotropic covariance structures that, when applied in R2, greatly increases the scope of variogram contors. Geometric anisotropy, which provides the most common generalization of isotropy within stationarity, is a special case. Our class is built from monotonic isotropic correlation functions and special cases include the Matérn and the general exponential functions. As a result, our range anisotropic correlation specification can be attached to a second order stationary spatial process model, unlike ad hoc approaches to range anisotropy in the literature. We adopt a Bayesian perspective to obtain full inference and demonstrate how to fit the resulting model using sampling-based methods. In the presence of measurement error/microscale effect, we can obtain both the usual predictive as well as the noiseless predictive distribution. We analyze a data set of scallop catches under the general exponential range anisotropic model, withholding ten sites to compare the accuracy and precision of the standard and noiseless predictive distributions.  相似文献   

10.
《Ecological modelling》2005,187(4):369-388
Ecosystems exhibit nonlinear dynamics that are often difficult to capture in models. Consequently, linearization is commonly applied to remove some of the uncertainties associated with the nonlinear terms. However, since the true model is unknown and the operating point to linearize the model about is uncertain, developing linear ecosystems models is non-trivial. To develop a linear ecosystem model, we assume that the annual mean state of an ecosystem is a minor bias from the long-term mean state. A first order approximation inverse model to govern the year-to-year dynamics of ecosystems whose characteristic time scales are less than 1 year is developed, through theoretically formulation, on the basis of steady state analysis, time scale separation and nondimensionalization. The approach is adept at predicting year-to-year variations and to tracking system response to changes in environmental drivers when compared to data generated with a standard nonlinear NPZD model.  相似文献   

11.
Global and regional numerical models for terrestrial ecosystem dynamics require fine spatial resolution and temporally complete historical climate fields as input variables. However, because climate observations are unevenly spaced and have incomplete records, such fields need to be estimated. In addition, uncertainty in these fields associated with their estimation are rarely assessed. Ecological models are usually driven with a geostatistical model's mean estimate (kriging) of these fields without accounting for this uncertainty, much less evaluating such errors in terms of their propagation in ecological simulations. We introduce a Bayesian statistical framework to model climate observations to create spatially uniform and temporally complete fields, taking into account correlation in time and space, spatial heterogeneity, lack of normality, and uncertainty about all these factors. A key benefit of the Bayesian model is that it generates uncertainty measures for the generated fields. To demonstrate this method, we reconstruct historical monthly precipitation fields (a driver for ecological models) on a fine resolution grid for a climatically heterogeneous region in the western United States. The main goal of this work is to evaluate the sensitivity of ecological models to the uncertainty associated with prediction of their climate drivers. To assess their numerical sensitivity to predicted input variables, we generate a set of ecological model simulations run using an ensemble of different versions of the reconstructed fields. We construct such an ensemble by sampling from the posterior predictive distribution of the climate field. We demonstrate that the estimated prediction error of the climate field can be very high. We evaluate the importance of such errors in ecological model experiments using an ensemble of historical precipitation time series in simulations of grassland biogeochemical dynamics with an ecological numerical model, Century. We show how uncertainty in predicted precipitation fields is propagated into ecological model results and that this propagation had different modes. Depending on output variable, the response of model dynamics to uncertainty in inputs ranged from uncertainty in outputs that matched that of inputs to those that were muted or that were biased, as well as uncertainty that was persistent in time after input errors dropped.  相似文献   

12.
Abstract:   In conservation biology, uncertainty about the choice of a statistical model is rarely considered. Model-selection uncertainty occurs whenever one model is chosen over plausible alternative models to represent understanding about a process and to make predictions about future observations. The standard approach to representing prediction uncertainty involves the calculation of prediction (or confidence) intervals that incorporate uncertainty about parameter estimates contingent on the choice of a "best" model chosen to represent truth. However, this approach to prediction based on statistical models tends to ignore model-selection uncertainty, resulting in overconfident predictions. Bayesian model averaging (BMA) has been promoted in a range of disciplines as a simple means of incorporating model-selection uncertainty into statistical inference and prediction. Bayesian model averaging also provides a formal framework for incorporating prior knowledge about the process being modeled. We provide an example of the application of BMA in modeling and predicting the spatial distribution of an arboreal marsupial in the Eden region of southeastern Australia. Other approaches to estimating prediction uncertainty are discussed.  相似文献   

13.
Recently, public health professionals and other geostatistical researchers have shown increasing interest in boundary analysis, the detection or testing of zones or boundaries that reveal sharp changes in the values of spatially oriented variables. For areal data (i.e., data which consist only of sums or averages over geopolitical regions), Lu and Carlin (Geogr Anal 37: 265–285, 2005) suggested a fully model-based framework for areal wombling using Bayesian hierarchical models with posterior summaries computed using Markov chain Monte Carlo (MCMC) methods, and showed the approach to have advantages over existing non-stochastic alternatives. In this paper, we develop Bayesian areal boundary analysis methods that estimate the spatial neighborhood structure using the value of the process in each region and other variables that indicate how similar two regions are. Boundaries may then be determined by the posterior distribution of either this estimated neighborhood structure or the regional mean response differences themselves. Our methods do require several assumptions (including an appropriate prior distribution, a normal spatial random effect distribution, and a Bernoulli distribution for a set of spatial weights), but also deliver more in terms of full posterior inference for the boundary segments (e.g., direct probability statements regarding the probability that a particular border segment is part of the boundary). We illustrate three different remedies for the computing difficulties encountered in implementing our method. We use simulation to compare among existing purely algorithmic approaches, the Lu and Carlin (2005) method, and our new adjacency modeling methods. We also illustrate more practical modeling issues (e.g., covariate selection) in the context of a breast cancer late detection data set collected at the county level in the state of Minnesota.  相似文献   

14.
The stock and recruitment relationship is fundamental to the management of fishery natural resources. However, inferring stock-recruitment relationships is a challenging problem because of the limited available data, the collection of plausible models, and the biological characteristics that should be reflected in the model. Motivated by limitations of traditional parametric stock-recruitment models, we propose a Bayesian nonparametric approach based on a mixture model for the joint distribution of log-reproductive success and stock biomass. Flexible mixture modeling for this bivariate distribution yields rich inference for the stock-recruitment relationship through the implied conditional distribution of log-reproductive success given stock biomass. The method is illustrated with cod data from six regions of the North Atlantic, including comparison with simpler Bayesian parametric and semiparametric models.  相似文献   

15.
Zero-inflated data arise in many contexts. In this paper, we develop a zero-inflated Bayesian hierarchical model which deals with spatial effects, correlation among near-locating measurements as well as excess zeros simultaneously. Inference, including the sampling from the posterior distributions, predictions at new locations, and model selection, is carried out by using computationally efficient Markov chain Monte Carlo techniques. The posterior distributions are simulated using a Gibbs sampler with the embedded ratio-of-uniform method and the slice sampling algorithm. The approach is illustrated via an application to herbaceous data collected in the Missouri Ozark Forest Ecosystem Project. The results from the proposed model are compared with those generated from a non-zero inflated model. The proposed model fully incorporates the information from data collection and provides more reliable inference. A predictive $p$ value is computed for model checking and it indicates that the proposed model fits the data well.  相似文献   

16.
We develop regional-scale eutrophication models for lakes, ponds, and reservoirs to investigate the link between nutrients and chlorophyll-a. The Bayesian TREED (BTREED) model approach allows association of multiple environmental stressors with biological responses, and quantification of uncertainty sources in the empirical water quality model. Nutrient data for lakes, ponds, and reservoirs across the United States were obtained from the Environmental Protection Agency (EPA) National Nutrient Criteria Database. The nutrient data consist of measurements for both stressor variables (such as total nitrogen and total phosphorus), and response variables (such as chlorophyll-a), used in the BTREED model. Markov chain Monte Carlo (McMC) posterior exploration guides a stochastic search through a rich suite of candidate trees toward models that better fit the data. The Bayes factor provides a goodness of fit criterion for comparison of resultant models. We randomly split the data into training and test sets; the training data were used in model estimation, and the test data were used to evaluate out-of-sample predictive performance of the model. An average relative efficiency of 1.02 between the training and test data for the four highest log-likelihood models suggests good out-of-sample predictive performance. Reduced model uncertainty relative to over-parameterized alternative models makes the BTREED models useful for nutrient criteria development, providing the link between nutrient stressors and meaningful eutrophication response.  相似文献   

17.
Fire is both a widespread natural disturbance that affects the distribution of species and a tool that can be used to manage habitats for species. Knowledge of temporal changes in the occurrence of species after fire is essential for conservation management in fire-prone environments. Two key issues are: whether postfire responses of species are idiosyncratic or if multiple species show a limited number of similar responses; and whether such responses to time since fire can predict the occurrence of species across broad spatial scales. We examined the response of bird species to time since fire in semiarid shrubland in southeastern Australia using data from surveys at 499 sites representing a 100-year chronosequence. We used nonlinear regression to model the probability of occurrence of 30 species with time since fire in two vegetation types, and compared species' responses with generalized response shapes from the literature. The occurrence of 16 species was significantly influenced by time since fire: they displayed six main responses consistent with generalized response shapes. Of these 16 species, 15 occurred more frequently in mid- or later-successional vegetation (> 20 years since fire), and only one species occurred more often in early succession (< 5 years since fire). The models had reasonable predictive ability for eight species, some predictive ability for seven species, and were little better than random for one species. Bird species displayed a limited range of responses to time since fire; thus a small set of fire ages should allow the provision of habitat for most species. Postfire successional changes extend for decades and management of the age class distribution of vegetation will need to reflect this timescale. Response curves revealed important seral stages for species and highlighted the importance of mid- to late-successional vegetation (> 20 years). Although time since fire clearly influences the distribution of numerous bird species, predictive models of the spatial distribution of species in fire-prone landscapes need to incorporate other factors in addition to time since fire.  相似文献   

18.
《Ecological modelling》2005,186(2):154-177
In recent years alternative modeling techniques have been used to account for spatial autocorrelations among data observations. They include linear mixed model (LMM), generalized additive model (GAM), multi-layer perceptron (MLP) neural network, radial basis function (RBF) neural network, and geographically weighted regression (GWR). Previous studies show these models are robust to the violation of model assumptions and flexible to nonlinear relationships among variables. However, many of them are non-spatial in nature. In this study, we utilize a local spatial analysis method (i.e., local Moran coefficient) to investigate spatial distribution and heterogeneity in model residuals from those modeling techniques with ordinary least-squares (OLS) as the benchmark. The regression model used in this study has tree crown area as the response variable, and tree diameter and the coordinates of tree locations as the predictor variables. The results indicate that LMM, GAM, MLP and RBF may improve model fitting to the data and provide better predictions for the response variable, but they generate spatial patterns for model residuals similar to OLS. The OLS, LMM, GAM, MLP and RBF models yield more residual clusters of similar values, indicating that trees in some sub-areas are either all underestimated or all overestimated for the response variable. In contrast, GWR estimates model coefficients at each location in the study area, and produces more accurate predictions for the response variable. Furthermore, the residuals of the GWR model have more desirable spatial distributions than the ones derived from the OLS, LMM, GAM, MLP and RBF models.  相似文献   

19.
Models of the geographic distributions of species have wide application in ecology. But the nonspatial, single-level, regression models that ecologists have often employed do not deal with problems of irregular sampling intensity or spatial dependence, and do not adequately quantify uncertainty. We show here how to build statistical models that can handle these features of spatial prediction and provide richer, more powerful inference about species niche relations, distributions, and the effects of human disturbance. We begin with a familiar generalized linear model and build in additional features, including spatial random effects and hierarchical levels. Since these models are fully specified statistical models, we show that it is possible to add complexity without sacrificing interpretability. This step-by-step approach, together with attached code that implements a simple, spatially explicit, regression model, is structured to facilitate self-teaching. All models are developed in a Bayesian framework. We assess the performance of the models by using them to predict the distributions of two plant species (Proteaceae) from South Africa's Cape Floristic Region. We demonstrate that making distribution models spatially explicit can be essential for accurately characterizing the environmental response of species, predicting their probability of occurrence, and assessing uncertainty in the model results. Adding hierarchical levels to the models has further advantages in allowing human transformation of the landscape to be taken into account, as well as additional features of the sampling process.  相似文献   

20.
New approaches to modelling fish-habitat relationships   总被引:1,自引:0,他引:1  
Ecologists often develop models that describe the relationship between faunal communities and their habitat. Coral reef fishes have been the focus of numerous such studies, which have used a wide range of statistical tools to answer an equally wide range of questions. Here, we apply a series of both conventional statistical techniques (linear and generalized additive regression models) and novel machine-learning techniques (the support vector machine and three ensemble techniques used with regression trees) to predict fish species richness, biomass, and diversity from a range of habitat variables. We compare the techniques in terms of their predictive performance, and we compare a subset of the models in terms of the influence each habitat variable has for the predictions. Prediction errors are estimated by cross-validation, and variable importance is assessed using permutations of individual variable values. For predictions of species richness and diversity the tree-based models generally and the random forest model specifically are superior (produce the lowest errors). These model types are all able to model both nonlinear and interaction effects. The linear model, unable to model either effect type, performs the worst (produces the highest errors). For predictions of biomass, the generalized additive model is superior, and the support vector machine performs the worst. Depth range, the difference between maximum and minimum water depth at a given site, is identified as the most important variable in the majority of models predicting the three fish community variables. However, variable importance is highly dependent upon model type, which leads to questions regarding the interpretation of variable importance and its proper use as an indicator of causality. The representation of ecological relationships by tree-based ensemble learners will improve predictive performance, and provide a new avenue for exploring ecological relationships, both statistical and causal.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号