Abstract Length-biased data are often encountered in cross-sectional surveys and prevalent-cohort studies on disease durations. Under length-biased sampling subjects with longer disease durations have greater chance to be observed. As… Click to show full abstract
Abstract Length-biased data are often encountered in cross-sectional surveys and prevalent-cohort studies on disease durations. Under length-biased sampling subjects with longer disease durations have greater chance to be observed. As a result, covariate values linked to the longer survivors are favored by the sampling mechanism. When the sampled durations are also subject to right censoring, the censoring is informative. Modeling dependence structure without adjusting for these issues leads to biased results. In this article, we consider copulas for modeling dependence when the collected data are length-biased and account for both informative censoring and covariate bias that are naturally linked to length-biased sampling. We address nonparametric estimation of the bivariate distribution, copula function and its density, and Kendall and Spearman measures for right-censored length-biased data. The proposed estimator for the bivariate cdf is a Hadamard-differentiable functional of two MLEs (Kaplan–Meier and empirical cdf) and inherits their efficiency. Based on this estimator, we devise two estimators for copula function and a local-polynomial estimator for copula density that accounts for boundary bias. The limiting processes of the estimators are established by deriving their iid representations. As a by-product, we establish the oscillation behavior of the bivariate cdf estimator. In addition, we introduce estimators for Kendall and Spearman measures and study their weak convergence. The proposed method is applied to analyze a set of right-censored length-biased data on survival with dementia, collected as part of a nationwide study in Canada.
               
Click one of the above tabs to view related content.