Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples
Bailey, Matthew H.; Meyerson, William U.; Dursi, L. Jonathan; Wang, Liang-Bo; Dong, Guanlan; Liang, Wen-Wei; Weerasinghe, Amila; Li, Shantao; Li, Yize; Kelso, Sean; Aure, Miriam Ragle; Børresen-Dale, Anne-Lise; Langerød, Anita; Baumhoer, Daniel; Bjerkehagen, Bodil; Garred, Øystein; Lingjærde, Ole Christian; Sauer, Torill; Zaikova, Olga; Myklebost, Ola; Knappskog, Stian; Vazquez, Miguel; Akbani, Rehan; Anur, Pavana; Buchanan, Alex; Chiotti, Kami; Covington, Kyle; Creason, Allison; Niu, Beifang; Bieg, Matthias; Boutros, Paul C.; Buchhalter, Ivo; Butler, Adam P.; Chen, Ken; Chong, Zechen; Drechsel, Oliver; Aaltonen, Lauri; Abascal, Federico; Abeshouse, Adam; Aburatani, H; Adams, David J.; Van Loo, Peter; Saksena, Gordon; Ellrott, Kyle; Wendl, Michael C.; Wheeler, David A.; Getz, Gad; Simpson, Jared T.; Gerstein, Mark B.; Ding, Li
Journal article, Peer reviewed
MetadataShow full item record
Original versionNature Communications. 2020, 11, 4748 10.1038/s41467-020-18151-y
The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts.