Efficient and flexible Integration of variant characteristics in rare variant association studies using integrated nested Laplace approximation
dc.contributor.author
dc.date.accessioned
2022-04-27T07:07:37Z
dc.date.available
2022-04-27T07:07:37Z
dc.date.issued
2021-02-19
dc.identifier.issn
1553-734X
dc.identifier.uri
dc.description.abstract
Rare variants are thought to play an important role in the etiology of complex diseases and may explain a significant fraction of the missing heritability in genetic disease studies. Next-generation sequencing facilitates the association of rare variants in coding or regulatory regions with complex diseases in large cohorts at genome-wide scale. However, rare variant association studies (RVAS) still lack power when cohorts are small to medium-sized and if genetic variation explains a small fraction of phenotypic variance. Here we present a novel Bayesian rare variant Association Test using Integrated Nested Laplace Approximation (BATI). Unlike existing RVAS tests, BATI allows integration of individual or variant-specific features as covariates, while efficiently performing inference based on full model estimation. We demonstrate that BATI outperforms established RVAS methods on realistic, semi-synthetic whole-exome sequencing cohorts, especially when using meaningful biological context, such as functional annotation. We show that BATI achieves power above 70% in scenarios in which competing tests fail to identify risk genes, e.g. when risk variants in sum explain less than 0.5% of phenotypic variance. We have integrated BATI, together with five existing RVAS tests in the ‘Rare Variant Genome Wide Association Study’ (rvGWAS) framework for data analyzed by whole-exome or whole genome sequencing. rvGWAS supports rare variant association for genes or any other biological unit such as promoters, while allowing the analysis of essential functionalities like quality control or filtering. Applying rvGWAS to a Chronic Lymphocytic Leukemia study we identified eight candidate predisposition genes, including EHMT2 and COPS7A
dc.description.sponsorship
SO, HS, XE, FM and GD received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 635290 (PanCanRisk). SO and GD received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 779257 (Solve-RD). RR received support from the Fundació La Marató 70/307:201726. HS, GD, RR, LD, MB, FM, XE, GE and SO received support of the Spanish Ministry of Economy and Competitiveness, ‘Centro de Excelencia Severo Ochoa 2013-2017, and the CERCA Programme / Generalitat de Catalunya
dc.format.mimetype
application/pdf
dc.language.iso
eng
dc.publisher
Public Library of Science (PLoS)
dc.relation.isformatof
Reproducció digital del document publicat a: https://doi.org/10.1371/journal.pcbi.1007784
dc.relation.ispartof
PLoS Computational Biology, 2021, vol. 17, núm. 2, p. e1007784
dc.relation.ispartofseries
Articles publicats (D-EC)
dc.rights
Attribution 4.0 International
dc.rights.uri
dc.subject
dc.title
Efficient and flexible Integration of variant characteristics in rare variant association studies using integrated nested Laplace approximation
dc.type
info:eu-repo/semantics/article
dc.rights.accessRights
info:eu-repo/semantics/openAccess
dc.type.version
info:eu-repo/semantics/publishedVersion
dc.identifier.doi
dc.identifier.idgrec
033590
dc.type.peerreviewed
peer-reviewed
dc.identifier.eissn
1553-7358