首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Ranked set sampling can provide an efficient basis for estimating parameters of environmental variables, particularly when sampling costs are intrinsically high. Various ranked set estimators are considered for the population mean and contrasted in terms of their efficiencies and useful- ness, with special concern for sample design considerations. Specifically, we consider the effects of the form of the underlying random variable, optimisation of efficiency and how to allocate sampling effort for best effect (e.g. one large sample or several smaller ones of the same total size). The various prospects are explored for two important positively skew random variables (lognormal and extreme value) and explicit results are given for these cases. Whilst it turns out that the best approach is to use the largest possible single sample and the optimal ranked set best linear estimator (ranked set BLUE), we find some interesting qualitatively different conclusions for the two skew distributions  相似文献   

2.
When sample observations are expensive or difficult to obtain, ranked set sampling is known to be an efficient method for estimating the population mean, and in particular to improve on the sample mean estimator. Using best linear unbiased estimators, this paper considers the simple linear regression model with replicated observations. Use of a form of ranked set sampling is shown to be markedly more efficient for normal data when compared with the traditional simple linear regression estimators.  相似文献   

3.
Ranked set sampling was developed for situations where measurement cost is expensive compared with unit acquisition. This paper presents results of simulations and theory examining the impact of balanced ranked set sampling on the relative efficiencies of the slope and intercept estimators of an ordinary least squares regression. Perfect ranking of either the independent or the dependent variable is assumed throughout. In contradistinction to most of the published ranked set sampling work, it is demonstrated that balanced ranked set sampling offers at most little improvement in the relative efficiencies of the slope estimator at any sample size.  相似文献   

4.
We consider the selection of samples in ranked set sampling when several attributes of each sample are of interest. We describe approaches that have appeared previously in the literature and present a novel method that seeks to achieve samples that are nearly balanced with respect to the ranks of all attributes. This method is shown to result in very little loss of precision compared to problems in which only a single sample attribute is of interest.  相似文献   

5.
Rank-based sampling designs are powerful alternatives to simple random sampling (SRS) and often provide large improvements in the precision of estimators. In many environmental, ecological, agricultural, industrial and/or medical applications the interest lies in sampling designs that are cheaper than SRS and provide comparable estimates. In this paper, we propose a new variation of ranked set sampling (RSS) for estimating the population mean based on the random selection technique to measure a smaller number of observations than RSS design. We study the properties of the population mean estimator using the proposed design and provide conditions under which the mean estimator performs better than SRS and some existing rank-based sampling designs. Theoretical results are augmented with some numerical studies and a real-life example, where we also study the performance of our proposed design under perfect and imperfect ranking situations.  相似文献   

6.
A ranked set sampling protocol is proposed when an auxiliary variable is available in addition to the target variable in sample surveys. The protocol may be practically carried out without additional sampling effort or costs. Under the suggested sampling scheme, the estimators usually adopted in surveys with auxiliary information - such as the ratio estimator or the regression estimator - display surprising theoretical properties as well as high performance in practice.  相似文献   

7.
Ranked set sampling can be useful when measurements are expensive but units from the popu- lation can be easily ranked. In this situation one may draw k units from the population, rank them, select one on which to make the expensive measurement, draw another k units, rank them, select one, and so on. The method was originally suggested by McIntyre (1952) in connection with pasture yields and is obviously applicable in other situations as well. Dell and Clutter (1972) and Patil et al. (1994) explain the basics from a classical point of view. Our aim is to examine the procedure from a Bayesian point of view, determine whether ranked set sampling provides advantages over simple random sampling and explore some optimality questions  相似文献   

8.
In this paper, we consider design-based estimation using ranked set sampling (RSS) in finite populations. We first derive the first and second-order inclusion probabilities for an RSS design and present two Horvitz–Thompson type estimators using these inclusion probabilities. We also develop an alternate Hansen–Hurwitz type estimator and investigate its properties. In particular, we show that this alternate estimator always outperforms the usual Hansen–Hurwitz type estimator in the simple random sampling with replacement design with comparable sample size. We also develop formulae for ratio estimator for all three developed estimators. The theoretical results are augmented by numerical and simulation studies as well as a case study using a well known data set. These show that RSS design can yield a substantial improvement in efficiency over the usual simple random sampling design in finite populations.  相似文献   

9.
A new spatially balanced sampling design for environmental surveys is introduced, called Halton iterative partitioning (HIP). The design draws sample locations that are well spread over the study area. Spatially balanced designs are known to be efficient when surveying natural resources because nearby locations tend to be similar. The HIP design uses structural properties of the Halton sequence to partition a resource into nested boxes. Sample locations are then drawn from specific boxes in the partition to ensure spatial diversity. The method is conceptually simple and computationally efficient, draws spatially balanced samples in two or more dimensions and uses standard design-based estimators. Furthermore, HIP samples have an implicit ordering that can be used to define spatially balanced over-samples. This feature is particularly useful when sampling natural resources because we can dynamically add spatially balanced units from the over-sample to the sample as non-target or inaccessible units are discovered. We use several populations to show that HIP sampling draws spatially balanced samples and gives precise estimates of population totals.  相似文献   

10.
The objective of a long-term soil survey is to determine the mean concentrations of several chemical parameters for the pre-defined soil layers and to compare them with the corresponding values in the past. A two-stage random sampling procedure is used to achieve this goal. In the first step, n subplots are selected from N subplots by simple random sampling without replacement; in the second step, m sampling sites are chosen within each of the n selected subplots. Thus n · m soil samples are collected for each soil layer. The idea of the composite sample design comes from the challenge of reducing very expensive laboratory analyses: m laboratory samples from one subplot and one soil layer are physically mixed to form a composite sample. From each of the n selected subplots, one composite sample per soil layer is analyzed in the laboratory, thus n per soil layer in total. In this paper we show that the cost is reduced by the factor m — 1 when instead of the two-stage sampling its composite sample alternative is used; however, the variance of the composite sample mean is increased. In the case of positive intraclass correlation the increase is less than 12.5%; in the case of negative intraclass correlation the increase depends on the properties of the variable as well. For the univariate case we derive the optimal number of subplots and sampling sites. A case study is discussed at the end.  相似文献   

11.
Ranked set sampling is a simple idea of great use. It was proposed half a century ago. The last 15 years or so have witnessed considerable development in the research and applications of ranked set sampling. In this paper, we give an overview on ranked set sampling. We review several variants of ranked set sampling developed since the original idea was proposed. We discuss the essence and the theoretical foundation of ranked set sampling. We present some novel applications of ranked set sampling in areas such as clinical trials, genetic quantitative trait loci mappings and others. By doing so, we wish to provide the reader with a philosophical view on ranked set sampling and shed some lights on a broader range of its applications.  相似文献   

12.
We describe a probabilistic sampling design of circular permanent plots for the long-term monitoring of protected dry grasslands in Switzerland. The population under study is defined by the perimeter of a national inventory. The monitoring focus is on the species composition of the protected grassland vegetation and derived conservation values. Efficient trend estimations are required for the whole country and for some predefined target groups (six biogeographical regions and eleven vegetation types). The target groups are equally important regardless of their size. Consequently, intensified sampling of the less frequent groups is essential for sample efficiency. The prior information needed to draw a targeted sample is obtained from the sampling frame and external databases. The logistics and generalized delineation of the target population may pose further problems. Thus, investments in fieldwork and travel time should be well balanced by selecting a cluster sample. Second, any access problems in the field and non-target units in the sample should be compensated for by selecting reserve plots as they otherwise may considerably reduce the effective sample size. Finally, the design has to be flexible as the sampling frame may change over time and sampling intensity might have to be adjusted to redefined budgets or requirements. Likewise, the variables and biological items of interest may change. To fulfil all these constraints and to optimally use the available prior information, we propose a multi-stage self-weighted unequal probability sampling design. The design uses modern techniques such as: balanced sampling, spreading, stratified balancing, calibration, unequal probability sampling and power allocation. This sampling design meets the numerous requirements of this study and provides a very efficient estimator.  相似文献   

13.
14.
Sampling from partially rank-ordered sets   总被引:1,自引:0,他引:1  
In this paper we introduce a new sampling design. The proposed design is similar to a ranked set sampling (RSS) design with a clear difference that rankers are allowed to declare any two or more units are tied in ranks whenever the units can not be ranked with high confidence. These units are replaced in judgment subsets. The fully measured units are then selected from these partially ordered judgment subsets. Based on this sampling scheme, we develop unbiased estimators for the population mean and variance. We show that the proposed sampling procedure has some advantages over standard ranked set sampling.  相似文献   

15.
This paper presents a ranked set sample allocation procedure that is optimal for a number of nonparametric test procedures. We define a function that measures the amount of information provided by each observation given the actual joint ranking of all the units in a set. The optimal ranked set sample allocates order statistics by maximizing this information function. This paper shows that the optimal allocation of order statistics in a ranked set sample is determined by the location of the mode(s) of the underlying distribution. For unimodal, symmetric distributions, optimal allocation always quantifies the middle observation(s). If the underlying distribution with cdf F is a multi-modal distribution with modes , then the optimal allocation procedure quantifies observations at in a set of size m. We provide similar results for unimodal, asymmetric distributions. We also propose a new sign test which considers the relative positions of the quantified observations from the same cycle in a ranked set sample. The proposed sign test provides improvement in the Pitman efficiency over the ranked set sample sign test of Hettmansperger (1995). It is shown that the information optimal allocation procedure induced by Pitman efficiency is equivalent to the optimal allocation procedure induced by the information criteria. We show that the finite sample distribution of the proposed test based on this optimal design is binomial.  相似文献   

16.
Consider a survey of a plant or animal species in which abundance or presence/absence will be recorded. Further assume that the presence of the plant or animal is rare and tends to cluster. A sampling design will be implemented to determine which units to sample within the study region. Adaptive cluster sampling designs Thompson (1990) are sampling designs that are implemented by first selecting a sample of units according to some conventional probability sampling design. Then, whenever a specified criterion is satisfied upon measuring the variable of interest, additional units are adaptively sampled in neighborhoods of those units satisfying the criterion. The success of these adaptive designs depends on the probabilities of finding the rare clustered events, called networks. This research uses combinatorial generating functions to calculate network inclusion probabilities associated with a simple Latin square sample. It will be shown that, in general, adaptive simple Latin square sampling when compared to adaptive simple random sampling will (i) yield higher network inclusion probabilities and (ii) provide Horvitz-Thompson estimators with smaller variability.  相似文献   

17.
Classical sampling methods can be used to estimate the mean of a finite or infinite population. Block kriging also estimates the mean, but of an infinite population in a continuous spatial domain. In this paper, I consider a finite population version of block kriging (FPBK) for plot-based sampling. The data are assumed to come from a spatial stochastic process. Minimizing mean-squared-prediction errors yields best linear unbiased predictions that are a finite population version of block kriging. FPBK has versions comparable to simple random sampling and stratified sampling, and includes the general linear model. This method has been tested for several years for moose surveys in Alaska, and an example is given where results are compared to stratified random sampling. In general, assuming a spatial model gives three main advantages over classical sampling: (1) FPBK is usually more precise than simple or stratified random sampling, (2) FPBK allows small area estimation, and (3) FPBK allows nonrandom sampling designs.  相似文献   

18.
Abundance vector estimation is a well investigated problem in statistical ecology. The use of simple random sampling with replacement or replicated sampling ensures good asymptotic properties of the abundance vector estimators. However, real surveys are based on small sample sizes, and assuming any specific distribution of the abundance vector estimator may be hazardous.In this paper we focus our attention on situations where the population is not too large and the sample size is small. We propose bootstrap multivariate confidence regions based on data depth. Data depth is a geometrical concept of ordering data from the center outwardly in higher dimensions. The Simplicial depth, the Tukey's depth and the Mahalanobis depth are presented. In order to build confidence regions in the presence of a skewed distribution of the abundance vector estimator, the use of Tukey's depth is suggested. The proposed method has been applied to the benthic community of Lake Lesina. A comparison with Mahalanobis depth and standard existing methods is reported.  相似文献   

19.
Adaptive two-stage one-per-stratum sampling   总被引:1,自引:0,他引:1  
We briefly describe adaptive cluster sampling designs in which the initial sample is taken according to a Markov chain one-per-stratum design (Breidt, 1995) and one or more secondary samples are taken within strata if units in the initial sample satisfy a given condition C. An empirical study of the behavior of the estimation procedure is conducted for three small artificial populations for which adaptive sampling is appropriate. The specific sampling strategy used in the empirical study was a single random-start systematic sample with predefined systematic samples within strata when the initially sampled unit in that stratum satisfies C. The bias of the Horvitz-Thompson estimator for this design is usually very small when adaptive sampling is conducted in a population for which it is suited. In addition, we compare the behavior of several alternative estimators of the standard error of the Horvitz-Thompson estimator of the population total. The best estimator of the standard error is population-dependent but it is not unreasonable to use the Horvitz-Thompson estimator of the variance. Unfortunately, the distribution of the estimator is highly skewed hence the usual approach of constructing confidence intervals assuming normality cannot be used here.  相似文献   

20.
Gray BR  Burlew MM 《Ecology》2007,88(9):2364-2372
Ecologists commonly use grouped or clustered count data to estimate temporal trends in counts, abundance indices, or abundance. For example, the U.S. Breeding Bird Survey data represent multiple counts of birds from within each of multiple, spatially defined routes. Despite a reliance on grouped counts, analytical methods for prospectively estimating precision of trend estimates or statistical power to detect trends that explicitly acknowledge the characteristics of grouped count data are undescribed. These characteristics include the fact that the sampling variance is an increasing function of the mean, and that sampling and group-level variance estimates are generally estimated on different scales (the sampling and log scales, respectively). We address these issues for repeated sampling of a single population using an analytical approach that has the flavor of a generalized linear mixed model, specifically that of a negative binomial-distributed count variable with random group effects. The count mean, including grand intercept, trend, and random group effects, is modeled linearly on the log scale, while sampling variance of the mean is estimated on the log scale via the delta method. Results compared favorably with those derived using Monte Carlo simulations. For example, at trend = 5% per temporal unit, differences in standard errors and in power were modest relative to those estimated by simulation (< or = /11/% and < or = /16/%, respectively), with relative differences among power estimates decreasing to < or = /7/% when power estimated by simulations was > or = 0.50. Similar findings were obtained using data from nine surveys of fingernail clams in the Mississippi River. The proposed method is suggested (1) where simulations are not practical and relative precision or power is desired, or (2) when multiple precision or power calculations are required and where the accuracy of a fraction of those calculations will be confirmed using simulations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号