Panel Informativity Optimizer (PIO): an R package to improve cancer NGS panel informativity

Mutation detection by next generation sequencing (NGS) is routinely used for cancer diagnosis. Selecting an optimal set of genes for a given cancer is not trivial as it has to optimize informativity (i.e. the number of patients with at least one mutation in the panel), while minimizing panel length in order to reduce sequencing costs and increase sensitivity. We propose herein Panel Informativity Optimizer (PIO), an open-source software developed as an R package with a user-friendly graphical interface to help optimize cancer NGS panel informativity. Using patient-level mutational data from either private datasets or preloaded dataset of 91 independent cohort from 31 different cancer type, PIO selects an optimal set of genomic intervals to maximize informativity and panel size in a given cancer type. Different options are offered such as the definition of genomic intervals at the gene or exon level, and the use of optimization strategy at the patient or patient per kilobase level. PIO can also propose an optimal set of genomic intervals to increase informativity of custom panels. A panel tester function is also available for panel benchmarking. Using public databases, as well as data from real-life settings, we demonstrate that PIO allows panel size reduction of up to 1000kb, and accurately predicts the performance of custom or commercial panels. PIO is available online at https://vincentalcazer.shinyapps.io/Panel_informativity_optimizer/ or can be set on a locale machine from https://github.com/VincentAlcazer/PIO.