Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates

研究成果: Article査読

抄録

Issues regarding missing data are critical in observational and experimental research. Recently, for datasets with mixed continuous–discrete variables, multiple imputation by chained equation (MICE) has been widely used, although MICE may yield severely biased estimates. We propose a new semiparametric Bayes multiple imputation approach that can deal with continuous and discrete variables. This enables us to overcome the shortcomings of MICE; they must satisfy strong conditions (known as compatibility) to guarantee obtained estimators are consistent. Our simulation studies show the coverage probability of 95% interval calculated using MICE can be less than 1%, while the MSE of the proposed can be less than one-fiftieth. We applied our method to the Alzheimer’s Disease Neuroimaging Initiative (ADNI) dataset, and the results are consistent with those of the previous works that used panel data other than ADNI database, whereas the existing methods, such as MICE, resulted in inconsistent results.

本文言語English
ページ(範囲)803-825
ページ数23
ジャーナルAnnals of the Institute of Statistical Mathematics
72
3
DOI
出版ステータスPublished - 2020 6 1

ASJC Scopus subject areas

  • Statistics and Probability

フィンガープリント 「Semiparametric Bayesian multiple imputation for regression models with missing mixed continuous–discrete covariates」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル