Informatics and Applications
2015, Volume 9, Issue 3, pp 2-16
ANALYSIS OF SURVEY DATA CONTAINING ROUNDED CENSORING INTERVALS
- Yu. K. Belyaev
- B. Kristrom
Abstract
This paper makes a contribution towards the statistical analysis of data sets containing intervals, that naturally arises in survey contexts. The suggested approach is sufficiently general to cover most cases where interval data are used. Interval data appear in many contexts, such as in reliability studies and survival analysis, in medicine and economics, in opinion elicit surveys, etc. There are several reasons for the extensive use of interval data, perhaps, the most common being one of necessity; exact values of the underlying observations are censored. The nature of the intervals analyzed here is somewhat unusual. The self-selected intervals (SeSeI) are (freely) chosen by the subjects. A generalization of the influential approach has been suggested to the statistical analysis of general censoring introduced by B. W Turnbull. A key independence assumption in Turnbull's analysis has been explained and generalized. A sampling stopping rule based on the coverage probability has been suggested and the properties of a two-step estimator, based on the idea of asking two questions, where the second involves a way of fine-graining the information, has been discussed. This paper provides several informatics methods for SeSeI, targeting the problem of partial nonparametric identification. The properties of the suggested statistical models are stated, including a recursion for easy numerical calculations. An extensive simulation study, displaying, inter alia, the usefulness of the proposed resampling methods for the situation under study, completes the paper.
[+] References (24)
- Manski, C. F 1999. Identification problems in the social sciences. Harvard University Press. 194 p.
- Morgan, M. G., and M. Henrion. 1990. Uncertainty: A guide to dealing with uncertainty in quantitative risk and policy analysis. Cambridge University Press. 325 p.
- Billard, L., and E. Diday. 2006. Symbolic data analysis: Conceptual statistics and data mining. Wiley ser. in compu-tational statistics. Wiley. Vol. 636. 330 p.
- Manski, C. F., and F Molinari. 2010. Rounding probabilistic expectations in surveys. J. Bus. Econ. Stat. 28:219- 231.
- Johansson, P.-O., and B. Kristrom. 2012. The economics of evaluating water projects. Hydroelectricity versus other uses. Heidelberg: Springer. 135 p.
- Belyaev, Y., and B. Kristrom. 2010. Approach to analysis of self-selected interval data. Umea: CERE. Working Paper 2010:2. 1-34. Available at: http:// www.cere.se/se/forskning/publikationer/155-approach- to-analysis-of-self-selected-interval-data.html (accessed September 8, 2015).
- Belyaev, Y., and B. Kristrom. 2012. Two-step approach to self-selected interval data in elicitation surveys. Umea: CERE. Working Paper 2012:10. 1-46. Available at: http://www.cere.se/se/forskning/publikationer/386- two-step-approach-to-self-selected-interval-data-in- elicitation-surveys.html (accessed September 8, 2015).
- Turnbull, B. W 1974. Nonparametric estimation of a sur-vivorship function with doubly censored data. J. Am. Stat. Assoc. 69:169-173.
- Turnbull, B.W. 1976. The empirical distribution function with arbitrarily grouped, censored and truncated data. J. Roy. Stat. Soc. B 38:290-295.
- Manski, C. F 2003. Partial identification of probability dis-tributions. Springer ser. in statistics. Springer. 196 p.
- Carson, R. 2012. Contingent valuation: A comprehensive bibliography and history. Edward Elgar Publishing. 464 p.
- Hakansson, C. 2008. A new valuation question - analysis of and insights from interval open ended data in contingent valuation. Environ. Resour. Econ. 39(2):175-188.
- Broberg, T, and R. Brannlund. 2008. An alternative in-terpretation of multiple bounded WTP data-certainty de-pendent payment card intervals. Energy Resour. Econ. 30:555-567.
- Mahieu, P., P. Riera, B. Kristroom, R. Broannlund, and M. Giergiczny. 2014. Exploring the determinants of uncertainty in contingent valuation surveys. J. Environ. Econ. Policy 3(2):186-200. Available at: http://dx.doi.org/10.1080/21606544.2013.876941 (accessed September 3, 2015).
- Good, I. J. 1953. The population frequencies of species and the estimation of population parameters. Biometrika 40(3-4):237-264.
- Belyaev, Y., and B. Kristrom. 2013. Analysis of contingent valuation data with self-selected rounded WTP-intervals collected by two-steps sampling plans. 9th Tartu Conference on Multivariate Statistics and 20th IWMSProceedings. Tartu: World Scientific. 48-60.
- Gomez, J., M. Calle, and R. Oller. 2004. Frequentist and Bayesian approaches for interval-censored data. Stat. Pap. 45:139-173.
- Gomez, J., M. Calle, R. Oller, and K. Langhor. 2009. Tutorial on methods for interval-censored data and their implementations in R. Stat. Model. 9(4):259-297.
- Gentleman, R., and C. J. Geyer. 1994. Maximum likelihood for interval censored data: Consistency and computation. Biometrika 81(3):618-623.
- Brinkhuis, J., and V. Tihomirov. 2005. Optimization: In-sights and applications. Princeton-Oxford: Princeton Uni-versity Press. 680 p.
- Jammalamadaka, S. R., and V. Mangalam. 2003. Non-parametric estimation for middle-censored data. J. Non- parametr. Stat. 15:253-265.
- Belyaev, Y. K., and L. Nilsson. 1997. Parametric maximum likelihood estimators. Department ofMathematical Statistics, Umea University. Research Report 1997-15. 1-28.
- Belyaev, Y. K. 2003. Necessary and sufficient conditions for consistency of resampling. Sweden: Centre of Biostochastics, Swedish University of Agricultural Sciences. Research Report 2003-1. 1-26. Available at: http://biostochastics.slu.se/publikationer/ dokument/Report2003_l.pdf (accessed September 3, 2015).
- Klein, J. P., and M. L. Moeschberger. 2003. Survival anal-ysis: Techniques for censored and truncated data. New York, N.Y.: Springer-Verlag. 536 p.
[+] About this article
Title
ANALYSIS OF SURVEY DATA CONTAINING ROUNDED CENSORING INTERVALS
Journal
Informatics and Applications
2015, Volume 9, Issue 3, pp 2-16
Cover Date
2015-02-30
DOI
10.14357/19922264150301
Print ISSN
1992-2264
Publisher
Institute of Informatics Problems, Russian Academy of Sciences
Additional Links
Key words
elicitation surveys; random sampling; rounding; anchoring; coverage probability; likelihood; recursion; maximization; resampling
Authors
Yu. K. Belyaev and B. Kristrom
Author Affiliations
Department of Mathematics and Mathematical Statistics, Ume a University, Ume a SE-901 87, Sweden, yuri.belyaev@umu.se
Center for Environmental and Resource Economics (CERE), Swedish University of Agricultural Sciences, Ume a SE-901 83, Sweden,
bengt.kristrom@umu.se
|