TY - JOUR
T1 - Efficient testing and effect size estimation for set-based genetic association inference via semiparametric multilevel mixture modeling
AU - Sugasawa, Shonosuke
AU - Noma, Hisashi
N1 - Funding Information:
This research was supported by CREST (Grant number: JPMJCR1412) from JST, JSPS KAKENHI (Grant numbers: JP16H07406, JP18K12757, JP17H01557), and the Practical Research for Innovative Cancer Control (Grant number: 17ck0106266) from the Japan Agency for Medical Research.
Funding Information:
This research was supported by CREST (Grant number: JPMJCR1412) from JST, JSPS KAKENHI (Grant numbers: JP16H07406, JP18K12757, JP17H01557), and the Practical Research for Innovative Cancer Control (Grant number: 17ck0106266) from the Japan Agency for Medical Research.
Publisher Copyright:
© 2022 Wiley-VCH GmbH.
PY - 2022/8
Y1 - 2022/8
N2 - In genetic association studies, rare variants with extremely low allele frequencies play a crucial role in complex traits. Therefore, set-based testing methods that jointly assess the effects of groups of single nucleotide polymorphisms (SNPs) were developed to increase the powers of the association tests. However, these powers are still insufficient, and precise estimations of the effect sizes of individual SNPs are largely impossible. In this article, we provide an efficient set-based statistical inference framework that addresses both of these important issues simultaneously using an empirical Bayes method with semiparametric multilevel mixture modeling. We propose to utilize the hierarchical model that incorporates variations in set-specific effects and to apply the optimal discovery procedure (ODP) that achieves the largest overall power in multiple significance testing. In addition, we provide an optimal “set-based” estimator of the empirical distribution of effect sizes. The efficiency of the proposed methods is demonstrated through application to a genome-wide association study of coronary artery disease and through simulation studies. The results demonstrated numerous rare variants with large effect sizes for coronary artery disease, and the number of significant sets detected by the ODP was much greater than those identified by existing methods.
AB - In genetic association studies, rare variants with extremely low allele frequencies play a crucial role in complex traits. Therefore, set-based testing methods that jointly assess the effects of groups of single nucleotide polymorphisms (SNPs) were developed to increase the powers of the association tests. However, these powers are still insufficient, and precise estimations of the effect sizes of individual SNPs are largely impossible. In this article, we provide an efficient set-based statistical inference framework that addresses both of these important issues simultaneously using an empirical Bayes method with semiparametric multilevel mixture modeling. We propose to utilize the hierarchical model that incorporates variations in set-specific effects and to apply the optimal discovery procedure (ODP) that achieves the largest overall power in multiple significance testing. In addition, we provide an optimal “set-based” estimator of the empirical distribution of effect sizes. The efficiency of the proposed methods is demonstrated through application to a genome-wide association study of coronary artery disease and through simulation studies. The results demonstrated numerous rare variants with large effect sizes for coronary artery disease, and the number of significant sets detected by the ODP was much greater than those identified by existing methods.
KW - effect size estimation
KW - empirical Bayes
KW - genome-wide association study
KW - optimal discovery procedure
UR - http://www.scopus.com/inward/record.url?scp=85129724358&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129724358&partnerID=8YFLogxK
U2 - 10.1002/bimj.202100234
DO - 10.1002/bimj.202100234
M3 - Article
C2 - 35543501
AN - SCOPUS:85129724358
SN - 0323-3847
VL - 64
SP - 1142
EP - 1152
JO - Biometrische Zeitschrift
JF - Biometrische Zeitschrift
IS - 6
ER -