CaMutQC: An R Package for Integrative Quality Control of Cancer Somatic Mutations

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The quality control and filtration of cancer somatic mutations (CAMs), including the elimination of false positives resulting from technical bias and the selection of key mutation candidates, is crucial for downstream analysis in cancer genomics. Due to diverse needs and the absence of standardized filtering criteria, the filtering strategies employed vary from study to study, often leading to reduced efficiency, accuracy, and comparability across similar analyses. Here we present CaMutQC, a heuristic quality control and soft-filtering R/Bioconductor package for CAMs. With CaMutQC, the removal of false positives, selection of potential mutation candidates, and Tumor Mutation Burden estimation can be executed in a single line of code, using default or customized parameters. A filter report and a code log generated after the filtration process assist with recording and comparison. The application of CaMutQC on a Whole-exome Sequencing (WES) benchmark dataset demonstrated its impressive capability by eliminating 85.55% of false positive mutations while retaining 90.72% of true positive mutations. Additionally, an extra 11.56% of true positive mutations were recused by the union strategy embedded in CaMutQC. CaMutQC is now available through Bioconductor at https://bioconductor.org/packages/CaMutQC/ under the GPL v3 license, and it will be updated regularly to incorporate top filtration strategy and parameter sets shared within the community.

Article activity feed