首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 687 毫秒
1.
Sampling from partially rank-ordered sets   总被引:1,自引:0,他引:1  
In this paper we introduce a new sampling design. The proposed design is similar to a ranked set sampling (RSS) design with a clear difference that rankers are allowed to declare any two or more units are tied in ranks whenever the units can not be ranked with high confidence. These units are replaced in judgment subsets. The fully measured units are then selected from these partially ordered judgment subsets. Based on this sampling scheme, we develop unbiased estimators for the population mean and variance. We show that the proposed sampling procedure has some advantages over standard ranked set sampling.  相似文献   

2.
Nonparametric mean estimation using partially ordered sets   总被引:2,自引:0,他引:2  
In ranked-set sampling (RSS), the ranker must give a complete ranking of the units in each set. In this paper, we consider a modification of RSS that allows the ranker to declare ties. Our sampling method is simply to break the ties at random so that we obtain a standard ranked-set sample, but also to record the tie structure for use in estimation. We propose several different nonparametric mean estimators that incorporate the tie information, and we show that the best of these estimators is substantially more efficient than estimators that ignore the ties. As part of our comparison of estimators, we develop new results about models for ties in rankings. We also show that there are settings where, to achieve more efficient estimation, ties should be declared not just when the ranker is actually unsure about how units rank, but also when the ranker is sure about the ranking, but believes that the units are close.  相似文献   

3.
Rank-based sampling designs are powerful alternatives to simple random sampling (SRS) and often provide large improvements in the precision of estimators. In many environmental, ecological, agricultural, industrial and/or medical applications the interest lies in sampling designs that are cheaper than SRS and provide comparable estimates. In this paper, we propose a new variation of ranked set sampling (RSS) for estimating the population mean based on the random selection technique to measure a smaller number of observations than RSS design. We study the properties of the population mean estimator using the proposed design and provide conditions under which the mean estimator performs better than SRS and some existing rank-based sampling designs. Theoretical results are augmented with some numerical studies and a real-life example, where we also study the performance of our proposed design under perfect and imperfect ranking situations.  相似文献   

4.
In this paper, we consider design-based estimation using ranked set sampling (RSS) in finite populations. We first derive the first and second-order inclusion probabilities for an RSS design and present two Horvitz–Thompson type estimators using these inclusion probabilities. We also develop an alternate Hansen–Hurwitz type estimator and investigate its properties. In particular, we show that this alternate estimator always outperforms the usual Hansen–Hurwitz type estimator in the simple random sampling with replacement design with comparable sample size. We also develop formulae for ratio estimator for all three developed estimators. The theoretical results are augmented by numerical and simulation studies as well as a case study using a well known data set. These show that RSS design can yield a substantial improvement in efficiency over the usual simple random sampling design in finite populations.  相似文献   

5.
This paper develops statistical inference for population mean and total using stratified judgment post-stratified (SJPS) samples. The SJPS design selects a judgment post-stratified sample from each stratum. Hence, in addition to stratum structure, it induces additional ranking structure within stratum samples. SJPS is constructed from a finite population using either a with or without replacement sampling design. Inference is constructed under both randomization theory and a super population model. In both approaches, the paper shows that the estimators of population mean and total are unbiased. The paper also constructs unbiased estimators for the variance (mean square prediction error) of the sample mean (predictor of population mean), and develops confidence and prediction intervals for the population mean. The empirical evidence shows that the proposed estimators perform better than their competitors in the literature.  相似文献   

6.
In this paper we address the problem of estimation of the variance of a normal population based on a balanced as well as an unbalanced ranked set sample (RSS), which is a modification of the original RSS of McIntyre (1952).We have proposed several methods of estimation of variance by combining different unbiased between and within estimators, and compared their performances  相似文献   

7.
Thompson (1990) introduced the adaptive cluster sampling design and developed two unbiased estimators, the modified Horvitz-Thompson (HT) and Hansen-Hurwitz (HH) estimators, for this sampling design and noticed that these estimators are not a function of the minimal sufficient statistics. He applied the Rao-Blackwell theorem to improve them. Despite having smaller variances, these latter estimators have not received attention because a suitable method or algorithm for computing them was not available. In this paper we obtain closed forms of the Rao-Blackwell versions which can easily be computed. We also show that the variance reduction for the HH estimator is greater than that for the HT estimator using Rao-Blackwell versions. When the condition for extra samples is 0$$ " align="middle" border="0"> , one can expect some Rao-Blackwell improvement in the HH estimator but not in the HT estimator. Two examples are given.  相似文献   

8.
Aranked set sample (RSS), if not balanced, is simply a sample of independent order statistics gener- ated from the same underlying distribution F. Kvam and Samaniego (1994) derived maximum likelihood estimates of F for a general RSS. In many applications, including some in the environ- mental sciences, prior information about F is available to supplement the data-based inference. In such cases, Bayes estimators should be considered for improved estimation. Bayes estimation (using the squared error loss function) of the unknown distribution function F is investigated with such samples. Additionally, the Bayes generalized maximum likelihood estimator (GMLE) is derived. An iterative scheme based on the EM Algorithm is used to produce the GMLE of F. For the case of squared error loss, simple solutions are uncommon, and a procedure to find the solution to the Bayes estimate using the Gibbs sampler is illustrated. The methods are illustrated with data from the Natural Environmental Research Council of Great Britain (1975), representing water discharge of floods on the Nidd River in Yorkshire, England  相似文献   

9.
A design-based strategy for estimating wildlife ungulate abundance in a Mediterranean protected area (Maremma Regional Park) is considered. The estimation is based on pellet group count (clearance count technique) in a set of plots, whose size and number is established on the basis of practical considerations and available resources. The sampling scheme involves a preliminary stratification and subsequent two-stage sampling. In the first stage, large strata (defined through habitat features) are partitioned into spatial units and a sample of units is selected by means of a sampling scheme ensuring inclusion probabilities proportional to unit size, but avoiding the selection of contiguous units. Then, the abundances of the selected units are estimated in a second stage, in which plots are located using a random scheme ensuring an even coverage of the units. In small strata, only the second stage is performed. Unbiased estimators of abundance and conservative estimators of their variances are derived for each strata and for the whole study area. The proposed strategy has been applied since the Summer of 2006 and the estimation results reveal substantial improvement with respect to the previous results obtained by means of an alternative strategy.  相似文献   

10.
This paper reviews design-based estimators for two- and three-stage sampling designs to estimate the mean of finite populations. This theory is then extended to spatial populations with continuous, infinite populations of sampling units at the latter stages. We then assume that the spatial pattern is the result of a spatial stochastic process, so the sampling variance of the estimators can be predicted from the variogram. A realistic cost function is then developed, based on several factors including laboratory analysis, time of fieldwork, and numbers of samples. Simulated annealing is used to find designs with minimum sampling variance for a fixed budget. The theory is illustrated with a real-world problem dealing with the volume of contaminated bed sediments in a network of watercourses. Primary sampling units are watercourses, secondary units are transects perpendicular to the axis of the watercourse, and tertiary units are points. Optimal designs had one point per transect, from one to three transects per watercourse, and the number of watercourses varied depending on the budget. However, if laboratory costs are reduced by grouping all samples within a watercourse into one composite sample, it appeared to be efficient to sample more transects within a watercourse.  相似文献   

11.
Thompson (1990) introduced the adaptive cluster sampling design. This sampling design has been shown to be a useful sampling method for parameter estimation of a clustered and scattered population (Roesch, 1993; Smith et al., 1995; Thompson and Seber, 1996). Two estimators, the modified Hansen-Hurwitz (HH) and Horvitz-Thompson (HT) estimators, are available to estimate the mean or total of a population. Empirical results from previous researches indicate that the modified HT estimator has smaller variance than the modified HH estimator. We analytically compare the properties of these two estimators. Some results are obtained in favor of the modified HT estimator so that practitioners are strongly recommended to use the HT estimator despite easiness of computations for the HH estimator.  相似文献   

12.
Ranked set sampling (RSS) is a sampling procedure that has been shown to provide more efficient procedures than simple random sampling, in particular the Mann-Whitney-Wilcoxon (MWW) statistic and the empirical distribution function (EDF). We briefly review the work of Bohn (1992) and Stokes and Sager (1988) on the effect of imperfect ranking on the RSS-based MWW test and on the RSS-based EDF, respectively. We propose a model for a ranking error probability matrix which we hope will become a useful tool for evaluating RSS-based statistical procedures  相似文献   

13.
Forest surveys performed over a large scale (e.g. national inventories) involve several phases of sampling. The first phase is usually performed by means of a systematic search of the study region, in which the region is partitioned into regular polygons of the same size and points are randomly or systematically selected, one per polygon. In most cases, first-phase points are selected and recognized in orthophotos or very high resolution satellite images available for the whole study area. Disregarding the subsequent phases, the first phase of sampling can be effectively adopted to select small woodlots and tree rows, in the sense that a unit is selected when at least one first-phase point falls within it. On the basis of such a scheme of sampling, approximately unbiased estimators of abundance, coverage and other physical attributes readily measurable from orthophotos (e.g. tree-row length) are proposed, together with estimators of the corresponding variances. A simulation study is performed in order to check the performance of the estimators under several distributions of units over the study area (random, clustered, spatially trended).  相似文献   

14.
A fundamental challenge to estimating population size with mark-recapture methods is heterogeneous capture probabilities and subsequent bias of population estimates. Confronting this problem usually requires substantial sampling effort that can be difficult to achieve for some species, such as carnivores. We developed a methodology that uses two data sources to deal with heterogeneity and applied this to DNA mark-recapture data from grizzly bears (Ursus arctos). We improved population estimates by incorporating additional DNA "captures" of grizzly bears obtained by collecting hair from unbaited bear rub trees concurrently with baited, grid-based, hair snag sampling. We consider a Lincoln-Petersen estimator with hair snag captures as the initial session and rub tree captures as the recapture session and develop an estimator in program MARK that treats hair snag and rub tree samples as successive sessions. Using empirical data from a large-scale project in the greater Glacier National Park, Montana, USA, area and simulation modeling we evaluate these methods and compare the results to hair-snag-only estimates. Empirical results indicate that, compared with hair-snag-only data, the joint hair-snag-rub-tree methods produce similar but more precise estimates if capture and recapture rates are reasonably high for both methods. Simulation results suggest that estimators are potentially affected by correlation of capture probabilities between sample types in the presence of heterogeneity. Overall, closed population Huggins-Pledger estimators showed the highest precision and were most robust to sparse data, heterogeneity, and capture probability correlation among sampling types. Results also indicate that these estimators can be used when a segment of the population has zero capture probability for one of the methods. We propose that this general methodology may be useful for other species in which mark-recapture data are available from multiple sources.  相似文献   

15.
Many animal societies contain few breeders and many helpers. In some species, individuals can become either helpers or breeders, and breeding is often restricted to high rankers in a dominance hierarchy. In social Hymenoptera with no morphological queen caste, all females have the potential to become dominant and reproduce. In our model, females involved in the hierarchy do not work and gain direct fitness if they become alpha following death of the current alpha, while females not in the hierarchy gain indirect fitness by working. Using an inclusive-fitness model combined with models of alpha replacement in slightly different types of near-linear hierarchies, we determined the critical rank (i.e., optimal hierarchy length) for which the gain in fitness by entering the hierarchy as a hopeful reproductive equals the cost caused by a reduced worker force. High relatedness decreases the hierarchy length by lowering the potential benefit of entering the hierarchy. In contrast, large colony size increases the hierarchy length because the cost of each non-working high ranker is relatively lower. The model predicts a hierarchy of five workers in the queenless ant Dinoponera quadriceps, close to empirical data which show that in a colony of about 100 females, the top 5 carry out 72.8% of the aggressive interactions. The model is also used to make several testable predictions about the effect on hierarchy length of (1) variation in colony size and (2) variation in worker reproductive options between species of the queenless ants genus Diacamma. Received: 16 March 1999 / Received in revised form: 30 June 1999 / Accepted: 11 July 1999  相似文献   

16.
Although not design-unbiased, the ratio estimator is recognized as more efficient when a certain degree of correlation exists between the variable of primary interest and the auxiliary variable. Meanwhile, the Rao–Blackwell method is another commonly used procedure to improve estimation efficiency. Various improved ratio estimators under adaptive cluster sampling (ACS) that make use of the auxiliary information together with the Rao–Blackwellized univariate estimators have been proposed in past research studies. In this article, the variances and the associated variance estimators of these improved ratio estimators are proposed for a thorough framework of statistical inference under ACS. Performance of the proposed variance estimators is evaluated in terms of the absolute relative percentage bias and the empirical mean-squared error. As expected, results show that both the absolute relative percentage bias and the empirical mean-squared error decrease as the initial sample size increases for all the variance estimators. To evaluate the confidence intervals based on these variance estimators and the finite-population Central Limit Theorem, the coverage rate and the interval width are used. These confidence intervals suffer a disadvantage similar to that of the conventional ratio estimator. Hence, alternative confidence intervals based on a certain type of adjusted variance estimators are constructed and assessed in this article.  相似文献   

17.
The exchange of genetic information between coral reefs through the transport of larvae can be described in terms of networks that capture the linkages between distant populations. A key question arising from these networks is the determination of the highly connected modules (communities). Communities can be defined using genetic similarity or distance statistics between multiple samples but due to limited specimen sampling capacity the boundaries of the communities for the known coral reefs in the seascape remain unresolved. In this study we use the microsatellite composition of individual corals to compare sample populations using a genetic dissimilarity measure (FST) which is then used to create a complex network. This network involved sampling 1025 colonies from 22 collection sites and examining 10 microsatellites loci. The links between each sampling site were given a strength that was created from the pair wise FST values. The result is an undirected weighted network describing the genetic dissimilarity between each sampled population. From this network we then determined the community structure using a leading eigenvector algorithm within graph theory. However, given the relatively limited sampling conducted, the representation of the regional genetic structure was incomplete. To assist with defining the boundaries of the genetically based communities we also integrated the communities derived from a hydrodynamic and distance based networks. The hydrodynamic network, though more comprehensive, was of smaller spatial extent than our genetic sampling. A Bayesian Belief network was developed to integrate the overlapping communities. The results indicate the genetic population structure of the Great Barrier Reef and provide guidance on where future genetic sampling should take place to complete the genetic diversity mapping.  相似文献   

18.
The ranked-set sampling (RSS) is applicable in practical problems where the variable of interest for an observed item is costly or time-consuming but the ranking of a set of items according to the variable can be easily done without actual measurement. In the context of RSS, the need for density estimation arises in certain statistical procedures. The density estimation also has its own interest. In this article, we develop a method for the density estimation using RSS data. We derive the properties of the resulted density estimate and compare it with its counterpart in simple random sampling (SRS). It is shown that the density estimate using RSS data provides a better estimate of the density than the usual density estimate using SRS data. The density estimate developed in this article can well serve various purposes in the context of RSS.  相似文献   

19.
Order restricted randomized designs and two sample inference   总被引:1,自引:0,他引:1  
This paper develops a new design that relies on subjective judgment ranking to compare subsets of experimental units. This judgment ranking is used along with restricted randomization to improve statistical inference for the contrast between two levels of a treatment. The new design assigns the judgment ranked units in a subset to different treatments. Such an assignment translates the positive dependence among units within each subset into negative dependence for the estimators of treatment means, and hence leads to a reduction in variance for the contrast. For the proposed design, a test for the difference in means of two treatment levels is developed along with an associated confidence interval. It is shown that the null distribution of the proposed test is approximated reasonably well with the Student’s t-distribution for sample sizes as small as 6. A simulation study indicates that the proposed design is advantageous compared to its competitors in the literature for both high and low quality rankings. The new design’s advantage increases with the quality of rankings.  相似文献   

20.
In this article we consider asymptotic properties of the Horvitz-Thompson and Hansen-Hurwitz types of estimators under the adaptive cluster sampling variants obtained by selecting the initial sample by simple random sampling without replacement and by unequal probability sampling with replacement. We develop an asymptotic framework, which basically assumes that the number of units in the initial sample, as well as the number of units and networks in the population tend to infinity, but that the network sizes are bounded. Using this framework we prove that under each of the two variants of adaptive sampling above mentioned, both the Horvitz-Thompson and Hansen-Hurwitz types of estimators are design-consistent and asymptotically normally distributed. In addition we show that the ordinary estimators of their variances are also design-consistent estimators.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号