RT Journal Article SR Electronic T1 Identifying Causal Variants at Loci with Multiple Signals of Association JF Genetics JO Genetics FD Genetics Society of America SP 497 OP 508 DO 10.1534/genetics.114.167908 VO 198 IS 2 A1 Hormozdiari, Farhad A1 Kostem, Emrah A1 Kang, Eun Yong A1 Pasaniuc, Bogdan A1 Eskin, Eleazar YR 2014 UL http://www.genetics.org/content/198/2/497.abstract AB Although genome-wide association studies have successfully identified thousands of risk loci for complex traits, only a handful of the biologically causal variants, responsible for association at these loci, have been successfully identified. Current statistical methods for identifying causal variants at risk loci either use the strength of the association signal in an iterative conditioning framework or estimate probabilities for variants to be causal. A main drawback of existing methods is that they rely on the simplifying assumption of a single causal variant at each risk locus, which is typically invalid at many risk loci. In this work, we propose a new statistical framework that allows for the possibility of an arbitrary number of causal variants when estimating the posterior probability of a variant being causal. A direct benefit of our approach is that we predict a set of variants for each locus that under reasonable assumptions will contain all of the true causal variants with a high confidence level (e.g., 95%) even when the locus contains multiple causal variants. We use simulations to show that our approach provides 20–50% improvement in our ability to identify the causal variants compared to the existing methods at loci harboring multiple causal variants. We validate our approach using empirical data from an expression QTL study of CHI3L2 to identify new causal variants that affect gene expression at this locus. CAVIAR is publicly available online at http://genetics.cs.ucla.edu/caviar/.