Variance component model to account for sample structure in genome-wide association studies.
|Title||Variance component model to account for sample structure in genome-wide association studies.|
|Publication Type||Journal Article|
|Year of Publication||2010|
|Authors||Kang, HM, Sul JH, Service SK, Zaitlen NA, Kong S-Y, Freimer NB, Sabatti C, Eskin E|
|Date Published||2010 Apr|
|Keywords||Genome-Wide Association Study, Humans, Models, Genetic, Models, Statistical, Polymorphism, Single Nucleotide, Population Groups, Principal Component Analysis, Quantitative Trait Loci, Software|
Although genome-wide association studies (GWASs) have identified numerous loci associated with complex traits, imprecise modeling of the genetic relatedness within study samples may cause substantial inflation of test statistics and possibly spurious associations. Variance component approaches, such as efficient mixed-model association (EMMA), can correct for a wide range of sample structures by explicitly accounting for pairwise relatedness between individuals, using high-density markers to model the phenotype distribution; but such approaches are computationally impractical. We report here a variance component approach implemented in publicly available software, EMMA eXpedited (EMMAX), that reduces the computational time for analyzing large GWAS data sets from years to hours. We apply this method to two human GWAS data sets, performing association analysis for ten quantitative traits from the Northern Finland Birth Cohort and seven common diseases from the Wellcome Trust Case Control Consortium. We find that EMMAX outperforms both principal component analysis and genomic control in correcting for sample structure.
|Alternate Journal||Nat. Genet.|