|Title||Covariate-modulated local false discovery rate for genome-wide association studies.|
|Publication Type||Journal Article|
|Year of Publication||2014|
|Authors||Zablocki RW, Schork AJ, Levine RA, Andreassen OA, Dale AM, Thompson WK|
|Date Published||2014 Aug 1|
|Keywords||Analysis of Variance, Bayes Theorem, Computational Biology, False Positive Reactions, Genome-Wide Association Study, Humans, Polymorphism, Single Nucleotide|
MOTIVATION: Genome-wide association studies (GWAS) have largely failed to identify most of the genetic basis of highly heritable diseases and complex traits. Recent work has suggested this could be because many genetic variants, each with individually small effects, compose their genetic architecture, limiting the power of GWAS, given currently obtainable sample sizes. In this scenario, Bonferroni-derived thresholds are severely underpowered to detect the vast majority of associations. Local false discovery rate (fdr) methods provide more power to detect non-null associations, but implicit assumptions about the exchangeability of single nucleotide polymorphisms (SNPs) limit their ability to discover non-null loci.
METHODS: We propose a novel covariate-modulated local false discovery rate (cmfdr) that incorporates prior information about gene element-based functional annotations of SNPs, so that SNPs from categories enriched for non-null associations have a lower fdr for a given value of a test statistic than SNPs in unenriched categories. This readjustment of fdr based on functional annotations is achieved empirically by fitting a covariate-modulated parametric two-group mixture model. The proposed cmfdr methodology is applied to a large Crohn's disease GWAS.
RESULTS: Use of cmfdr dramatically improves power, e.g. increasing the number of loci declared significant at the 0.05 fdr level by a factor of 5.4. We also demonstrate that SNPs were declared significant using cmfdr compared with usual fdr replicate in much higher numbers, while maintaining similar replication rates for a given fdr cutoff in de novo samples, using the eight Crohn's disease substudies as independent training and test datasets. Availability an implementation: https://sites.google.com/site/covmodfdr/
CONTACT: : email@example.com
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
|PubMed Central ID||PMC4103587|
|Grant List||R01 GM104400 / GM / NIGMS NIH HHS / United States|