首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 171 毫秒
1.
The mean of a balanced ranked set sample is more efficient than the mean of a simple random sample of equal size and the precision of ranked set sampling may be increased by using an unbalanced allocation when the population distribution is highly skewed. The aim of this paper is to show the practical benefits of the unequal allocation in estimating simultaneously the means of more skewed variables through real data. In particular, the allocation rule suggested in the literature for a single skewed distribution may be easily applied when more than one skewed variable are of interest and an auxiliary variable correlated with them is available. This method can lead to substantial gains in precision for all the study variables with respect to the simple random sampling, and to the balanced ranked set sampling too.  相似文献   

2.
The issue of variances of different soil variables prevailing at different sampling scales is addressed. This topic is relevant for soil science, agronomy and landscape ecology. In multi-stage sampling there are randomness components in each stage of sampling which can be taken into account by introducing random effects in analysis through the use of hierarchical linear mixed models (HLMM). Due to the nested sampling scheme, there are several hierarchical sub-models. The selection of the best model can be carried out through likelihood ratio tests (LRTs) or Wald tests, which are asymptotically equivalent under standard conditions. However, when the comparison leads to a restricted hypothesis of variance components, standard conditions are not maintained, which leads to more elaborated versions of LRTs. These versions are not disseminated among environmental scientists. The present study shows the modeling of soil data from a sampling where sites, fields within sites, transects within fields, and sampling points within transects were selected in order to take samples from different vegetation types (open and shade). For soil data, several sub-models were compared using Wald tests, classic LRTs and adjusted LRTs where the distribution of the test statistic under the null hypothesis is the Chi-square mixture of Chi-square distributions. The inclusion of random effects via HLMM and suggested by the latest version of LRT allowed us to detect effects of vegetation type on soil properties that were not detected under a classical ANOVA.  相似文献   

3.
We consider the selection of samples in ranked set sampling when several attributes of each sample are of interest. We describe approaches that have appeared previously in the literature and present a novel method that seeks to achieve samples that are nearly balanced with respect to the ranks of all attributes. This method is shown to result in very little loss of precision compared to problems in which only a single sample attribute is of interest.  相似文献   

4.
Ranked set sampling can provide an efficient basis for estimating parameters of environmental variables, particularly when sampling costs are intrinsically high. Various ranked set estimators are considered for the population mean and contrasted in terms of their efficiencies and useful- ness, with special concern for sample design considerations. Specifically, we consider the effects of the form of the underlying random variable, optimisation of efficiency and how to allocate sampling effort for best effect (e.g. one large sample or several smaller ones of the same total size). The various prospects are explored for two important positively skew random variables (lognormal and extreme value) and explicit results are given for these cases. Whilst it turns out that the best approach is to use the largest possible single sample and the optimal ranked set best linear estimator (ranked set BLUE), we find some interesting qualitatively different conclusions for the two skew distributions  相似文献   

5.
Active female sampling occurs in the fiddler crab Uca annulipes. Females sample the burrows of several males before remaining to mate in the burrow of the chosen partner. Females time larval release to coincide with the following nocturnal spring tide and must therefore leave sufficient time for embryonic development after mating. Here we show how this temporal constraint on search time affects female choosiness. We found that, at the start of the sampling period (when time constraints are minimal), females selectively sample the larger males in the population. Towards the end of the sampling period (when the temporal constraints increase the costs of sampling), females are less selective. Furthermore, we suggest that the number of males sampled (and other indices of ‘‘sampling effort’’) may not be reliable indicators of female choosiness and may not reflect the strength of female mating preferences under certain conditions. Burrow quality also emerged as an important criterion in final mate choice. Burrow structure potentially influences reproductive success, and mate acceptance based on burrow structure appears to involve a relatively invariant threshold criterion. Since there is no relationship between male size and burrow quality, females are using at least two independent criteria when choosing potential mates. We envisage mate choice as a two-stage process. First, females select which males to sample based on male size. They then decide whether or not to mate with a male based on burrow features. This sampling process explains how two unrelated variables can both predict male mating success. Received: 23 March 1995/Accepted after revision: 14 January 1996  相似文献   

6.
The paper deals with sampling from a finite population that is distributed over space and has a highly uneven spatial distribution. It suggests a sampling design that allocates a portion of the sample units that are well spread over the population and sequentially selects the remaining units in sub-areas that appear to be of more interest according to the study variable values observed during the survey. In order to estimate the population mean while using this sampling design, a computationally intense estimator, obtained via the Rao–Blackwell approach, is proposed and a resampling method is used that makes the inference computationally feasible. The whole sampling strategy is evaluated through several Monte Carlo experiments.  相似文献   

7.
Kernel density estimators are often used to estimate the utilization distributions (UDs) of animals. Kernel UD estimates have a strong theoretical basis and perform well, but are usually reported without estimates of error or uncertainty. It is intuitively and theoretically appealing to estimate the sampling error in kernel UD estimates using bootstrapping. However, standard equations for kernel density estimates are complicated and computationally expensive. Bootstrapping requires computing hundreds or thousands of probability densities and is impractical when the number of observations, or the area of interest is large. We used the fast Fourier transform (FFT) and discrete convolution theorem to create a bootstrapping algorithm fast enough to run on commonly available desktop or laptop computers. Application of the FFT method to a large (n>20,000) set of radio telemetry data would provide a 99.6% reduction in computation time (i.e., 1.6 as opposed to 444 hours) for 1000 bootstrap UD estimates. Bootstrap error contours were computed using data from a radio-collared polar bear (Ursus maritimus) in the Beaufort Sea north of Alaska.  相似文献   

8.
Adaptive cluster sampling (ACS) is an adaptive sampling scheme which operates under the rule that when the observed value of an initially selected sampling unit satisfies some condition of interest, C, other additional units in some pre-defined accompanying neighborhood are also added to the sample. In turn, if any of these additional units satisfy C, then their corresponding unit neighborhoods are added to the sample as well, and so on. This process stops when no additional units satisfying C are encountered. This paper will provide a review of the major developments and issues in ACS since its introduction by Thompson (1990) [Journal of the American Statistical Association, 85, 1050–1059].  相似文献   

9.
Chelgren ND  Adams MJ  Bailey LL  Bury RB 《Ecology》2011,92(2):408-421
Studies of the distribution of elusive forest wildlife have suffered from the confounding of true presence with the uncertainty of detection. Occupancy modeling, which incorporates probabilities of species detection conditional on presence, is an emerging approach for reducing observation bias. However, the current likelihood modeling framework is restrictive for handling unexplained sources of variation in the response that may occur when there are dependence structures such as smaller sampling units that are nested within larger sampling units. We used multilevel Bayesian occupancy modeling to handle dependence structures and to partition sources of variation in occupancy of sites by terrestrial salamanders (family Plethodontidae) within and surrounding an earlier wildfire in western Oregon, USA. Comparison of model fit favored a spatial N-mixture model that accounted for variation in salamander abundance over models that were based on binary detection/non-detection data. Though catch per unit effort was higher in burned areas than unburned, there was strong support that this pattern was due to a higher probability of capture for individuals in burned plots. Within the burn, the odds of capturing an individual given it was present were 2.06 times the odds outside the burn, reflecting reduced complexity of ground cover in the burn. Ther was weak support that true occupancy was lower within the burned area. While the odds of occupancy in the burn were 0.49 times the odds outside the burn among the five species, the magnitude of variation attributed to the burn was small in comparison to variation attributed to other landscape variables and to unexplained, spatially autocorrelated random variation. While ordinary occupancy models may separate the biological pattern of interest from variation in detection probability when all sources of variation are known, the addition of random effects structures for unexplained sources of variation in occupancy and detection probability may often more appropriately represent levels of uncertainty.  相似文献   

10.
Objects in the terrestrial environment interact differentially with electromagnetic radiation according to their essential physical, chemical and biological properties. This differential interaction is manifest as variability in scattered radiation according to wavelength, location, time, geometries of illumination and observation and polarization. If the population of scattered radiation could be measured, then estimation of these essential properties would be straightforward. The only problem would be linking such estimates to environmental variables of interest. This review paper is divided into three parts. Part 1 is an overview of the attempts that have been made to sample the five domains of scattered radiation (spectral, spatial, temporal, geometrical, polarization) and then to use the results of this sampling to estimate environmental variables of interest. Part one highlights three issues: first, that relationships between remotely sensed data and environmental variables of interest are indirect; second, our ability to estimate these environmental variables is dependent upon our ability to capture a sound representation of variability in scattered radiation and third, a considerable portion of the useful information in remotely sensed images resides in the spatial domain (within the relations between the pixels in the image). This final point is developed in Part 2 that explores ways in which the spatial domain is utilized to describe spatial variation in remotely sensed and ground data; to design optimum sampling schemes for image data and ground data and to increase the accuracy with which remotely sensed data can be used to estimate both discontinuous and continuous variables. Part 3 outlines two specific uses of information in the spatial domain; first, to select an optimum spatial resolution and second, to inform an image classification.  相似文献   

11.
Randomized graph sampling (RGS) is an approach for sampling populations associated with or describable as graphs, when the structure of the graph is known and the parameter of interest is the total weight of the graph. RGS is related to, but distinct from, other graph-based approaches such as snowball and network sampling. Graph elements are clustered into walks that reflect the structure of the graph, as well as operational constraints on sampling. The basic estimator in RGS can be constructed as a Horvitz-Thompson estimator. I prove it to be design-unbiased, and also show design-unbiasedness of an estimator of the sample variance when walks are sampled with replacement. Covariates can be employed for variance reduction either through improved assignment of selection probabilities to walks in the design step, or through the use of alternative estimators during analysis. The approach is illustrated with a trail maintenance example, which demonstrates that complicated approaches to assignment of selection probabilities can be counterproductive. I describe conditions under which RGS may be efficient in practice, and suggest possible applications.  相似文献   

12.
We describe a probabilistic sampling design of circular permanent plots for the long-term monitoring of protected dry grasslands in Switzerland. The population under study is defined by the perimeter of a national inventory. The monitoring focus is on the species composition of the protected grassland vegetation and derived conservation values. Efficient trend estimations are required for the whole country and for some predefined target groups (six biogeographical regions and eleven vegetation types). The target groups are equally important regardless of their size. Consequently, intensified sampling of the less frequent groups is essential for sample efficiency. The prior information needed to draw a targeted sample is obtained from the sampling frame and external databases. The logistics and generalized delineation of the target population may pose further problems. Thus, investments in fieldwork and travel time should be well balanced by selecting a cluster sample. Second, any access problems in the field and non-target units in the sample should be compensated for by selecting reserve plots as they otherwise may considerably reduce the effective sample size. Finally, the design has to be flexible as the sampling frame may change over time and sampling intensity might have to be adjusted to redefined budgets or requirements. Likewise, the variables and biological items of interest may change. To fulfil all these constraints and to optimally use the available prior information, we propose a multi-stage self-weighted unequal probability sampling design. The design uses modern techniques such as: balanced sampling, spreading, stratified balancing, calibration, unequal probability sampling and power allocation. This sampling design meets the numerous requirements of this study and provides a very efficient estimator.  相似文献   

13.
Knowledge of animal abundance is fundamental to many ecological studies. Frequently, researchers cannot determine true abundance, and so must estimate it using a method such as mark-recapture or distance sampling. Recent advances in abundance estimation allow one to model heterogeneity with individual covariates or mixture distributions and to derive multimodel abundance estimators that explicitly address uncertainty about which model parameterization best represents truth. Further, it is possible to borrow information on detection probability across several populations when data are sparse. While promising, these methods have not been evaluated using mark-recapture data from populations of known abundance, and thus far have largely been overlooked by ecologists. In this paper, we explored the utility of newly developed mark-recapture methods for estimating the abundance of 12 captive populations of wild house mice (Mus musculus). We found that mark-recapture methods employing individual covariates yielded satisfactory abundance estimates for most populations. In contrast, model sets with heterogeneity formulations consisting solely of mixture distributions did not perform well for several of the populations. We show through simulation that a higher number of trapping occasions would have been necessary to achieve good estimator performance in this case. Finally, we show that simultaneous analysis of data from low abundance populations can yield viable abundance estimates.  相似文献   

14.
We developed a method to estimate population abundance from simultaneous counts of unmarked individuals over multiple sites. We considered that at each sampling occasion, individuals in a population could be detected at 1 of the survey sites or remain undetected and used either multinomial or binomial simultaneous-count models to estimate abundance, the latter being equivalent to an N-mixture model with one site. We tested model performance with simulations over a range of detection probabilities, population sizes, growth rates, number of years, sampling occasions, and sites. We then applied our method to 3 critically endangered vulture species in Cambodia to demonstrate the real-world applicability of the model and to provide the first abundance estimates for these species in Cambodia. Our new approach works best when existing methods are expected to perform poorly (i.e., few sites and large variation in abundance among sites) and if individuals may move among sites between sampling occasions. The approach performed better when there were >8 sampling occasions and net probability of detection was high (>0.5). We believe our approach will be useful in particular for simultaneous surveys at aggregation sites, such as roosts. The method complements existing approaches for estimating abundance of unmarked individuals and is the first method designed specifically for simultaneous counts.  相似文献   

15.
Weeds are species of interest for ecologists because they are competitors of the crop for resources but they also play an important role in maintaining biodiversity in agroecosystems. To study their spatial distribution at the field scale, only sampled observations are available due to the cost of sampling. Weeds sampling strategies are static. However, in the domain of spatial sampling, adaptive strategies have also been developed with, for some of them, an important on-line or off-line computational cost. In this article we provide answers to the following question: Are the current adaptive sampling methods efficient enough to motivate a wider use in practice when sampling a weed species at a field scale? We provide a comparison of the behaviour of 8 static strategies and 3 adaptive ones on four criteria: density class estimation, map restoration, spatial aggregation estimation, and sampling duration. From two weeds data sets, we estimated six contrasted Markov Random Field (MRF) models of weed density class spatial distribution and a model for sampling duration. The MRF models were then used to compare the strategies on a large set of simulated maps. Our main finding was that there is no clear gain in using adaptive sampling strategies rather than static ones for the three first criteria, and adaptive strategies were associated to longer sampling duration. This conclusion points out that for weed mapping, it is more important to build a good model of spatial distribution, than to propose complex adaptive sampling strategies.  相似文献   

16.
Abstract:  Multivariate classifications of environmental factors are used as frameworks for conservation management. Although classification performance is likely to be sensitive to choice of input variables, these choices have been subjective in most previous studies. We used the Mantel test on a limited set of sites for which biological data were available to iteratively seek a definition of environmental space (i.e., intersite distances calculated with a set of appropriately transformed and weighted environmental variables) that had maximal correlation with the same sites described in a biological space. The procedure was used to select input variables for a classification of New Zealand's rivers that discriminates variation in fish communities for biodiversity management. The classification performed (i.e., discriminated biological variation) better than classifications with subjectively chosen variables. The inherently linear measures of environmental distance that underlie multivariate environmental classifications mean that they will perform best if they are defined based on variables for which there is a linear variation in the biological community throughout the entire range of the variable. Classification performance will therefore be improved when variables that have nonlinear relationships with biological variation are transformed to make their relationship with biological turnover more linear and when the contributions of environmental factors that have particularly strong relationships with biological variation are increased by weighting. Our results indicate that attention to the manner in which environmental space is defined improves the efficacy of multivariate classification and other techniques in which the environment is used as a surrogate for biological variation.  相似文献   

17.
A new spatially balanced sampling design for environmental surveys is introduced, called Halton iterative partitioning (HIP). The design draws sample locations that are well spread over the study area. Spatially balanced designs are known to be efficient when surveying natural resources because nearby locations tend to be similar. The HIP design uses structural properties of the Halton sequence to partition a resource into nested boxes. Sample locations are then drawn from specific boxes in the partition to ensure spatial diversity. The method is conceptually simple and computationally efficient, draws spatially balanced samples in two or more dimensions and uses standard design-based estimators. Furthermore, HIP samples have an implicit ordering that can be used to define spatially balanced over-samples. This feature is particularly useful when sampling natural resources because we can dynamically add spatially balanced units from the over-sample to the sample as non-target or inaccessible units are discovered. We use several populations to show that HIP sampling draws spatially balanced samples and gives precise estimates of population totals.  相似文献   

18.
Random forests for classification in ecology   总被引:27,自引:0,他引:27  
Cutler DR  Edwards TC  Beard KH  Cutler A  Hess KT  Gibson J  Lawler JJ 《Ecology》2007,88(11):2783-2792
Classification procedures are some of the most widely used statistical methods in ecology. Random forests (RF) is a new and powerful statistical classifier that is well established in other disciplines but is relatively unknown in ecology. Advantages of RF compared to other statistical classifiers include (1) very high classification accuracy; (2) a novel method of determining variable importance; (3) ability to model complex interactions among predictor variables; (4) flexibility to perform several types of statistical data analysis, including regression, classification, survival analysis, and unsupervised learning; and (5) an algorithm for imputing missing values. We compared the accuracies of RF and four other commonly used statistical classifiers using data on invasive plant species presence in Lava Beds National Monument, California, USA, rare lichen species presence in the Pacific Northwest, USA, and nest sites for cavity nesting birds in the Uinta Mountains, Utah, USA. We observed high classification accuracy in all applications as measured by cross-validation and, in the case of the lichen data, by independent test data, when comparing RF to other common classification methods. We also observed that the variables that RF identified as most important for classifying invasive plant species coincided with expectations based on the literature.  相似文献   

19.
20.
Wildlife sampling for habitat selection often combines a random background sample with a random sample of used sites, because the background sample could contain too few used sites to be informative for rare species. This approach is referred to as use-availability sampling. Two variants are considered where there is: (1) a random background sample including used and unused sites augmented with a sample of used sites, and (2) a sample of used sites augmented with a contaminated background sample, i.e. use is not recorded. A weighted estimator first proposed by Manski and Lerman (Econometrica 45(8):1977?C1988, 1977) forms the basis for our suggested approach. The weighted estimator has been shown to perform better than the usual unweighted approach with uncontaminated data and mis-specified logit models (Xie and Manski in Sociol Methods Res 17(3):283?C302, 1989). A weighted EM algorithm is developed for use with contaminated background data. We show that the weighted estimator continues to perform well with contaminated data and maintains its robustness to model mis-specification. The weighted estimator has not been previously used for use-availability sampling due to reliance on the assumption that only the intercept is biased, which is valid for a correct logit model. We show that adjusting the intercept may not eliminate the bias with an incorrect logit model. In this case, the weighted estimator is a relatively simple and effective alternative.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号