Statistical screening method for genetic factors influencing susceptibility to common diseases in a two-stage genome-wide association study

Yasunori Sato, Nan Laird, Hideki Suganami, Chikuma Hamada, Naoto Niki, Isao Yoshimura, Teruhiko Yoshida

Research output: Contribution to journalArticle

Abstract

A genome-wide association study (GWAS) is a standard strategy for detecting disease susceptibility genes, despite unsettled controversies on many aspects, including optimal study design and statistical analysis. As for study design, a two-stage design has been applied to maximize cost-effectiveness. However, there has been little consensus on appropriate statistical analysis for two-stage design. Thereby perplexing the researchers as to which statistical measures should be applied at the first stage, and how to determine the significance level of the differences at the second stage. Here, using simulation studies, we compared statistical operating characteristics of the screening in a two-stage GWAS by taking into consideration the proper balance of false-positive and false-negative error. As a result, the lower bound of confidence interval for odds ratios is recommended as the first stage measure, and then the second stage criteria should primarily depend on the purpose of the genome screen or its role in the overall gene-hunting scheme. Based on the simulation study, we suggest rules of thumb about which statistics to use in a given situation. An application of all operating characteristics of the screening method to an actual GWAS for gastric cancer illustrates the practical relevance of our discussion.

Original languageEnglish
Article number46
JournalStatistical Applications in Genetics and Molecular Biology
Volume8
Issue number1
DOIs
Publication statusPublished - 2009 Dec 1
Externally publishedYes

Fingerprint

Genome-Wide Association Study
Genetic Testing
Susceptibility
Screening
Genome
Genes
Two-stage Design
Operating Characteristics
Statistical Analysis
Disease Susceptibility
Simulation Study
Gene
Stomach Neoplasms
Cost-Benefit Analysis
Cost-effectiveness
Significance level
Odds Ratio
Statistical methods
False Positive
Research Personnel

Keywords

  • False discovery rate (FDR)
  • Gastric cancer susceptibility genes
  • Single nucleotide polymorphisms (SNPs)
  • Statistical screening method

ASJC Scopus subject areas

  • Statistics and Probability
  • Molecular Biology
  • Genetics
  • Computational Mathematics

Cite this

Statistical screening method for genetic factors influencing susceptibility to common diseases in a two-stage genome-wide association study. / Sato, Yasunori; Laird, Nan; Suganami, Hideki; Hamada, Chikuma; Niki, Naoto; Yoshimura, Isao; Yoshida, Teruhiko.

In: Statistical Applications in Genetics and Molecular Biology, Vol. 8, No. 1, 46, 01.12.2009.

Research output: Contribution to journalArticle

@article{43660ae819d04002836e36d05b5cf064,
title = "Statistical screening method for genetic factors influencing susceptibility to common diseases in a two-stage genome-wide association study",
abstract = "A genome-wide association study (GWAS) is a standard strategy for detecting disease susceptibility genes, despite unsettled controversies on many aspects, including optimal study design and statistical analysis. As for study design, a two-stage design has been applied to maximize cost-effectiveness. However, there has been little consensus on appropriate statistical analysis for two-stage design. Thereby perplexing the researchers as to which statistical measures should be applied at the first stage, and how to determine the significance level of the differences at the second stage. Here, using simulation studies, we compared statistical operating characteristics of the screening in a two-stage GWAS by taking into consideration the proper balance of false-positive and false-negative error. As a result, the lower bound of confidence interval for odds ratios is recommended as the first stage measure, and then the second stage criteria should primarily depend on the purpose of the genome screen or its role in the overall gene-hunting scheme. Based on the simulation study, we suggest rules of thumb about which statistics to use in a given situation. An application of all operating characteristics of the screening method to an actual GWAS for gastric cancer illustrates the practical relevance of our discussion.",
keywords = "False discovery rate (FDR), Gastric cancer susceptibility genes, Single nucleotide polymorphisms (SNPs), Statistical screening method",
author = "Yasunori Sato and Nan Laird and Hideki Suganami and Chikuma Hamada and Naoto Niki and Isao Yoshimura and Teruhiko Yoshida",
year = "2009",
month = "12",
day = "1",
doi = "10.2202/1544-6115.1490",
language = "English",
volume = "8",
journal = "Statistical Applications in Genetics and Molecular Biology",
issn = "1544-6115",
publisher = "Berkeley Electronic Press",
number = "1",

}

TY - JOUR

T1 - Statistical screening method for genetic factors influencing susceptibility to common diseases in a two-stage genome-wide association study

AU - Sato, Yasunori

AU - Laird, Nan

AU - Suganami, Hideki

AU - Hamada, Chikuma

AU - Niki, Naoto

AU - Yoshimura, Isao

AU - Yoshida, Teruhiko

PY - 2009/12/1

Y1 - 2009/12/1

N2 - A genome-wide association study (GWAS) is a standard strategy for detecting disease susceptibility genes, despite unsettled controversies on many aspects, including optimal study design and statistical analysis. As for study design, a two-stage design has been applied to maximize cost-effectiveness. However, there has been little consensus on appropriate statistical analysis for two-stage design. Thereby perplexing the researchers as to which statistical measures should be applied at the first stage, and how to determine the significance level of the differences at the second stage. Here, using simulation studies, we compared statistical operating characteristics of the screening in a two-stage GWAS by taking into consideration the proper balance of false-positive and false-negative error. As a result, the lower bound of confidence interval for odds ratios is recommended as the first stage measure, and then the second stage criteria should primarily depend on the purpose of the genome screen or its role in the overall gene-hunting scheme. Based on the simulation study, we suggest rules of thumb about which statistics to use in a given situation. An application of all operating characteristics of the screening method to an actual GWAS for gastric cancer illustrates the practical relevance of our discussion.

AB - A genome-wide association study (GWAS) is a standard strategy for detecting disease susceptibility genes, despite unsettled controversies on many aspects, including optimal study design and statistical analysis. As for study design, a two-stage design has been applied to maximize cost-effectiveness. However, there has been little consensus on appropriate statistical analysis for two-stage design. Thereby perplexing the researchers as to which statistical measures should be applied at the first stage, and how to determine the significance level of the differences at the second stage. Here, using simulation studies, we compared statistical operating characteristics of the screening in a two-stage GWAS by taking into consideration the proper balance of false-positive and false-negative error. As a result, the lower bound of confidence interval for odds ratios is recommended as the first stage measure, and then the second stage criteria should primarily depend on the purpose of the genome screen or its role in the overall gene-hunting scheme. Based on the simulation study, we suggest rules of thumb about which statistics to use in a given situation. An application of all operating characteristics of the screening method to an actual GWAS for gastric cancer illustrates the practical relevance of our discussion.

KW - False discovery rate (FDR)

KW - Gastric cancer susceptibility genes

KW - Single nucleotide polymorphisms (SNPs)

KW - Statistical screening method

UR - http://www.scopus.com/inward/record.url?scp=73849149839&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=73849149839&partnerID=8YFLogxK

U2 - 10.2202/1544-6115.1490

DO - 10.2202/1544-6115.1490

M3 - Article

C2 - 19954418

AN - SCOPUS:73849149839

VL - 8

JO - Statistical Applications in Genetics and Molecular Biology

JF - Statistical Applications in Genetics and Molecular Biology

SN - 1544-6115

IS - 1

M1 - 46

ER -