首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 27 毫秒
1.
The ranked-set sampling (RSS) is applicable in practical problems where the variable of interest for an observed item is costly or time-consuming but the ranking of a set of items according to the variable can be easily done without actual measurement. In the context of RSS, the need for density estimation arises in certain statistical procedures. The density estimation also has its own interest. In this article, we develop a method for the density estimation using RSS data. We derive the properties of the resulted density estimate and compare it with its counterpart in simple random sampling (SRS). It is shown that the density estimate using RSS data provides a better estimate of the density than the usual density estimate using SRS data. The density estimate developed in this article can well serve various purposes in the context of RSS.  相似文献   

2.
In this paper, we consider design-based estimation using ranked set sampling (RSS) in finite populations. We first derive the first and second-order inclusion probabilities for an RSS design and present two Horvitz–Thompson type estimators using these inclusion probabilities. We also develop an alternate Hansen–Hurwitz type estimator and investigate its properties. In particular, we show that this alternate estimator always outperforms the usual Hansen–Hurwitz type estimator in the simple random sampling with replacement design with comparable sample size. We also develop formulae for ratio estimator for all three developed estimators. The theoretical results are augmented by numerical and simulation studies as well as a case study using a well known data set. These show that RSS design can yield a substantial improvement in efficiency over the usual simple random sampling design in finite populations.  相似文献   

3.
Aranked set sample (RSS), if not balanced, is simply a sample of independent order statistics gener- ated from the same underlying distribution F. Kvam and Samaniego (1994) derived maximum likelihood estimates of F for a general RSS. In many applications, including some in the environ- mental sciences, prior information about F is available to supplement the data-based inference. In such cases, Bayes estimators should be considered for improved estimation. Bayes estimation (using the squared error loss function) of the unknown distribution function F is investigated with such samples. Additionally, the Bayes generalized maximum likelihood estimator (GMLE) is derived. An iterative scheme based on the EM Algorithm is used to produce the GMLE of F. For the case of squared error loss, simple solutions are uncommon, and a procedure to find the solution to the Bayes estimate using the Gibbs sampler is illustrated. The methods are illustrated with data from the Natural Environmental Research Council of Great Britain (1975), representing water discharge of floods on the Nidd River in Yorkshire, England  相似文献   

4.
Nonparametric mean estimation using partially ordered sets   总被引:2,自引:0,他引:2  
In ranked-set sampling (RSS), the ranker must give a complete ranking of the units in each set. In this paper, we consider a modification of RSS that allows the ranker to declare ties. Our sampling method is simply to break the ties at random so that we obtain a standard ranked-set sample, but also to record the tie structure for use in estimation. We propose several different nonparametric mean estimators that incorporate the tie information, and we show that the best of these estimators is substantially more efficient than estimators that ignore the ties. As part of our comparison of estimators, we develop new results about models for ties in rankings. We also show that there are settings where, to achieve more efficient estimation, ties should be declared not just when the ranker is actually unsure about how units rank, but also when the ranker is sure about the ranking, but believes that the units are close.  相似文献   

5.
Sampling from partially rank-ordered sets   总被引:1,自引:0,他引:1  
In this paper we introduce a new sampling design. The proposed design is similar to a ranked set sampling (RSS) design with a clear difference that rankers are allowed to declare any two or more units are tied in ranks whenever the units can not be ranked with high confidence. These units are replaced in judgment subsets. The fully measured units are then selected from these partially ordered judgment subsets. Based on this sampling scheme, we develop unbiased estimators for the population mean and variance. We show that the proposed sampling procedure has some advantages over standard ranked set sampling.  相似文献   

6.
The United States Environmental Protection Agency's Environmental Monitoring and Assessment Program (EMAP) is designed to describe status, trends and spatial pattern of indicators of condition of the nation's ecological resources. The proposed sampling design for EMAP is based on a triangular systematic grid and employs both variable probability and double sampling. The Horvitz-Thompson estimator provides the foundation of the design-based estimation strategy used in EMAP. However, special features of EMAP designed to accommodate the complexity of sampling environmental resources on a national scale require modifications of standard variance estimation procedures as well as development of new techniques. An overview of variance estimation methods proposed for application to EMAP's sampling strategy for discrete resources is presented.  相似文献   

7.
An estimating function approach to the inference of catch-effort models   总被引:1,自引:0,他引:1  
A class of catch-effort models, which allows for heterogeneous removal probabilities, is proposed for closed populations. The model includes three types of removal probabilities: multiplicative, Poisson and logistic. The usual removal and generalized removal models then become special cases. The equivalence of the proposed model and a special type of capture-recapture model is discussed. A unified estimating function approach is used to estimate the initial population size. For the homogeneous model, the resulting population size estimator based on optimal estimating functions is asymptotically equivalent to the maximum likelihood estimator. One advantage for our approach is that it can be extended to handle the heterogeneous populations in which the maximum likelihood estimators do not exist. The bootstrap method is applied to construct variance estimators and confidence intervals. We illustrate the method by two real data examples. Results of a simulation study investigating the performance of the proposed estimation procedure are presented.  相似文献   

8.
Estimates of a population’s growth rate and process variance from time-series data are often used to calculate risk metrics such as the probability of quasi-extinction, but temporal correlations in the data from sampling error, intrinsic population factors, or environmental conditions can bias process variance estimators and detrimentally affect risk predictions. It has been claimed (McNamara and Harding, Ecol Lett 7:16–20, 2004) that estimates of the long-term variance that incorporate observed temporal correlations in population growth are unaffected by sampling error; however, no estimation procedures were proposed for time-series data. We develop a suite of such long-term variance estimators, and use simulated data with temporally autocorrelated population growth and sampling error to evaluate their performance. In some cases, we get nearly unbiased long-term variance estimates despite ignoring sampling error, but the utility of these estimators is questionable because of large estimation uncertainty and difficulties in estimating correlation structure in practice. Process variance estimators that ignored temporal correlations generally gave more precise estimates of the variability in population growth and of the probability of quasi-extinction. We also found that the estimation of probability of quasi-extinction was greatly improved when quasi-extinction thresholds were set relatively close to population levels. Because of precision concerns, we recommend using simple models for risk estimates despite potential biases, and limiting inference to quantifying relative risk; e.g., changes in risk over time for a single population or comparative risk among populations.  相似文献   

9.
An analysis of counts of sample size N=2 arising from a survey of the grass Bromus commutatus identified several factors which might seriously affect the estimation of parameters of Taylor's power law for such small sample sizes. The small sample estimation of Taylor's power law was studied by simulation. For each of five small sample sizes, N=2, 3, 5, 15 and 30, samples were simulated from populations for which the underlying known relationship between variance and mean was given by 2 = cd. One thousand samples generated from the negative binomial distribution were simulated for each of the six combinations of c=1,2 and 11, and d=1, 2, at each of four mean densities, =0.5, 1, 10 and 100, giving 4000 samples for each combination. Estimates of Taylor's power law parameters were obtained for each combination by regressing log10 s 2 on log10 m, where s 2 and m are the sample variance and mean, respectively. Bias in the parameter estimates, b and log10 a, reduced as N increased and increased with c for both values of d and these relationships were described well by quadratic response surfaces. The factors which affect small-sample estimation are: (i) exclusion of samples for which m = s 2 = 0; (ii) exclusion of samples for which s 2 = 0, but m > 0; (iii) correlation between log10 s 2 and log10 m; (iv) restriction on the maximum variance expressible in a sample; (v) restriction on the minimum variance expressible in a sample; (vi) underestimation of log10 s 2 for skew distributions; and (vii) the limited set of possible values of m and s 2. These factors and their effect on the parameter estimates are discussed in relation to the simulated samples. The effects of maximum variance restriction and underestimation of log10 s 2 were found to be the most severe. We conclude that Taylor's power law should be used with caution if the majority of samples from which s 2 and m are calculated have size, N, less than 15. An example is given of the estimated effect of bias when Taylor's power law is used to derive an efficient sampling scheme.  相似文献   

10.
Estimating the movement of dissolved contaminants in heterogeneous aquifers is very important for the monitoring design, risk assessment, and remediation of contaminated aquifers. This work explored the influence of source size, monitoring distance, and aquifer heterogeneity on the accuracy of contaminant mass discharge (CMD) estimation using leaching surface approach as well as on the plume spread uncertainty in a 2-D heterogeneous aquifer. The interaction among source size, monitoring distance, and aquifer heterogeneity regarding the accuracy of CMD estimation and the plume spread uncertainty at downstream of the contaminated aquifer was extensively investigated. The transient leaking of a contaminated aquifer in a saturated heterogeneous aquifer under steady-state flow conditions was simulated. The effect of aquifer heterogeneity on the CMD uncertainty was evaluated through the expected values and variance. The results showed that the CMD estimation error varied from underestimation in the mildly heterogeneous aquifer, over accurate estimation in the medium heterogeneous aquifer to overestimation in the highly heterogeneous aquifer. Additionally, the results illustrated that the mean and variance of the transverse spatial extent of the peak concentrations for the plume at the control plane were very sensitive to the aquifer heterogeneity and detectable concentrations of contaminants.  相似文献   

11.
Adaptive two-stage one-per-stratum sampling   总被引:1,自引:0,他引:1  
We briefly describe adaptive cluster sampling designs in which the initial sample is taken according to a Markov chain one-per-stratum design (Breidt, 1995) and one or more secondary samples are taken within strata if units in the initial sample satisfy a given condition C. An empirical study of the behavior of the estimation procedure is conducted for three small artificial populations for which adaptive sampling is appropriate. The specific sampling strategy used in the empirical study was a single random-start systematic sample with predefined systematic samples within strata when the initially sampled unit in that stratum satisfies C. The bias of the Horvitz-Thompson estimator for this design is usually very small when adaptive sampling is conducted in a population for which it is suited. In addition, we compare the behavior of several alternative estimators of the standard error of the Horvitz-Thompson estimator of the population total. The best estimator of the standard error is population-dependent but it is not unreasonable to use the Horvitz-Thompson estimator of the variance. Unfortunately, the distribution of the estimator is highly skewed hence the usual approach of constructing confidence intervals assuming normality cannot be used here.  相似文献   

12.
Iwao's quadratic regression or Taylor's Power Law (TPL) are commonly used to model the variance as a function of the mean for sample counts of insect populations which exhibit spatial aggregation. The modeled variance and distribution of the mean are typically used in pest management programs to decide if the population is above the action threshold in any management unit (MU) (e.g., orchard, forest compartment). For nested or multi-level sampling the usual two-stage modeling procedure first obtains the sample variance for each MU and sampling level using ANOVA and then fits a regression of variance on the mean for each level using either Iwao or TPL variance models. Here this approach is compared to the single-stage procedure of fitting a generalized linear mixed model (GLMM) directly to the count data with both approaches demonstrated using 2-level sampling. GLMMs and additive GLMMs (AGLMMs) with conditional Poisson variance function as well as the extension to the negative binomial are described. Generalization to more than two sampling levels is outlined. Formulae for calculating optimal relative sample sizes (ORSS) and the operating characteristic curve for the control decision are given for each model. The ORSS are independent of the mean in the case of the AGLMMs. The application described is estimation of the variance of the mean number of leaves per shoot occupied by immature stages of a defoliator of eucalypts, the Tasmanian Eucalyptus leaf beetle, based on a sample of trees within plots from each forest compartment. Historical population monitoring data were fitted using the above approaches.  相似文献   

13.
In the mid nineteen eighties the Dutch NOx air quality monitoring network was reduced from 73 to 32 rural and city background stations, leading to higher spatial uncertainties. In this study, several other sources of information are being used to help reduce uncertainties in parameter estimation and spatial mapping. For parameter estimation, we used Bayesian inference. For mapping, we used kriging with external drift (KED) including secondary information from a dispersion model. The methods were applied to atmospheric NOx concentrations on rural and urban scales. We compared Bayesian estimation with restricted maximum likelihood estimation and KED with universal kriging. As a reference we also included ordinary least squares (OLS). Comparison of several parameter estimation and spatial interpolation methods was done by cross-validation. Bayesian analysis resulted in an error reduction of 10 to 20% as compared to restricted maximum likelihood, whereas KED resulted in an error reduction of 50% as compared to universal kriging. Where observations were sparse, the predictions were substantially improved by inclusion of the dispersion model output and by using available prior information. No major improvement was observed as compared to OLS, the cause presumably being that much good information is contained in the dispersion model output, so that no additional spatial residual random field is required to explain the data. In all, we conclude that reduction in the monitoring network could be compensated by modern geostatistical methods, and that a traditional simple statistical model is of an almost equal quality.
Jan van de KassteeleEmail:
  相似文献   

14.
Although not design-unbiased, the ratio estimator is recognized as more efficient when a certain degree of correlation exists between the variable of primary interest and the auxiliary variable. Meanwhile, the Rao–Blackwell method is another commonly used procedure to improve estimation efficiency. Various improved ratio estimators under adaptive cluster sampling (ACS) that make use of the auxiliary information together with the Rao–Blackwellized univariate estimators have been proposed in past research studies. In this article, the variances and the associated variance estimators of these improved ratio estimators are proposed for a thorough framework of statistical inference under ACS. Performance of the proposed variance estimators is evaluated in terms of the absolute relative percentage bias and the empirical mean-squared error. As expected, results show that both the absolute relative percentage bias and the empirical mean-squared error decrease as the initial sample size increases for all the variance estimators. To evaluate the confidence intervals based on these variance estimators and the finite-population Central Limit Theorem, the coverage rate and the interval width are used. These confidence intervals suffer a disadvantage similar to that of the conventional ratio estimator. Hence, alternative confidence intervals based on a certain type of adjusted variance estimators are constructed and assessed in this article.  相似文献   

15.
Judgment post stratified (JPS) and ranked set sampling (RSS) designs rely on the ability of a ranker to assign ranks to potential observations on available experimental units. In many settings, there are often more than one rankers available and each of these rankers provide judgment ranks. This paper proposes two sampling schemes, one for JPS and the other for RSS, to combine the judgment ranks of these rankers to produce a strength of agreement measure for each fully measured unit. This strength measure is used to draw inference for the population mean and cumulative distribution function. The paper shows that the estimators constructed based on this strength measure provide a substantial improvement over the same estimators based on judgment ranking information of a single best ranker.  相似文献   

16.
Rank-based sampling designs are powerful alternatives to simple random sampling (SRS) and often provide large improvements in the precision of estimators. In many environmental, ecological, agricultural, industrial and/or medical applications the interest lies in sampling designs that are cheaper than SRS and provide comparable estimates. In this paper, we propose a new variation of ranked set sampling (RSS) for estimating the population mean based on the random selection technique to measure a smaller number of observations than RSS design. We study the properties of the population mean estimator using the proposed design and provide conditions under which the mean estimator performs better than SRS and some existing rank-based sampling designs. Theoretical results are augmented with some numerical studies and a real-life example, where we also study the performance of our proposed design under perfect and imperfect ranking situations.  相似文献   

17.
Developmental toxicity experiments are designed to assess potential adverse effects of drugs and other exposures on developing fetuses from pregnant dams. Extrapolation to humans is a very difficult problem. An important issue here is whether risk assessment should be based on the fetus or the litter level. In this paper, fetus and litter-based risks that properly account for cluster size are defined and compared for the beta-binomial model and a conditional model for clustered binary data. It is shown how the hierarchical structure of non-viable implants and viable but malformed offspring can be incorporated. Risks based on a joint model for death/resorption and malformation are contrasted with risks based on an adverse event defined as either death/resorption or malformation. The estimation of safe exposure levels for all risk types is discussed and it is shown how estimation of the cluster size distribution affects variance estimation. The methods are applied to data collected under the National Toxicology Program and in large sample simulations.  相似文献   

18.
Ratio estimation of the parametric mean for a characteristic measured on plants sampled by a line intercept method is presented and evaluated via simulation using different plant dispersion patterns (Poisson, regular cluster, and Poisson cluster), plant width variances, and numbers of lines. The results indicate that on average the estimates are close to the parametric mean under all three dispersion patterns. Given a fixed number of lines, variability of the estimates is similar across dispersion patterns with variability under the Poisson pattern slightly smaller than varia-bility under the cluster patterns. No variance estimates were negative under the Poisson pattern, but some estimates were negative under the cluster patterns for smaller numbers of lines. Variance estimates become closer to zero similarly for all spatial patterns as the number of lines increases. Ratio estimation of the parametric mean in line intercept sampling works better, from the viewpoint of approximate unbiasedness and variability of estimates, under the Poisson pattern with larger numbers of lines than other combinations of spatial patterns, plant width variances and numbers of lines.  相似文献   

19.
Ranked set sampling (RSS) is a sampling procedure that has been shown to provide more efficient procedures than simple random sampling, in particular the Mann-Whitney-Wilcoxon (MWW) statistic and the empirical distribution function (EDF). We briefly review the work of Bohn (1992) and Stokes and Sager (1988) on the effect of imperfect ranking on the RSS-based MWW test and on the RSS-based EDF, respectively. We propose a model for a ranking error probability matrix which we hope will become a useful tool for evaluating RSS-based statistical procedures  相似文献   

20.
Abstract:  Soberón and Llorente (1993) proposed pure-birth stochastic processes as theoretical models for species-accumulation curves, and these processes have frequently been used to describe the progress of biological inventories. We describe, in algorithmic form, an alternative statistical analysis based on a likelihood approach ( Díaz-Francés & Gorostiza 2002 ) that provides mathematical rigor to the ideas in Soberón and Llorente (1993) and improves the estimation of the models by incorporating the facts that the variance of the error is not constant and that the observations are correlated. Additionally, we used the likelihood ratios between candidate models as an objective procedure for model selection, allowing comparison between the goodness of fit of various models. The software for these statistical methods can now be downloaded off the Internet. We used two examples of butterfly data sets to illustrate the use of the methods and the software.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号