Identifying causal variants by fine mapping across multiple studies

Collaborators:

Nathan LaPiere, Kodi Taraszka, Rosemary He, Farhad Hormozdiari

Principal Investigator: Dr. Eleazar Eskin

2019.6 - 2021.6

Abstract. Increasingly large Genome-Wide Association Studies (GWAS) have yielded numerous variants associated with many complex traits, motivating the development of ''fine mapping'' methods to identify which of the associated variants are causal. Additionally, GWAS of the same trait for different populations are increasingly available, raising the possibility of refining fine mapping results further by leveraging different linkage disequilibrium (LD) structures across studies. Here, we introduce multiple study causal variants identification in associated regions (MsCAVIAR), a method that extends the popular CAVIAR fine mapping framework to a multiple study setting using a random effects model. MsCAVIAR only requires summary statistics and LD as input, accounts for uncertainty in association statistics using a multivariate normal model, allows for multiple causal variants at a locus, and explicitly models the possibility of different SNP effect sizes in different populations. In a trans-ethnic, trans-biobank Type 2 Diabetes analysis, we show that MsCAVIAR returns causal set sizes that are over 20% smaller than those given by current state of the art methods for trans-ethnic fine-mapping.

Software is available at: https://github.com/nlapier2/MsCAVIAR

I was involved in developing the software, fixing the math, designing & running simulation studies, analyzing & comparing simulation results, and drafting part of the manuscript.

BIGSummer19_He_Huang.jpg