Misannotation of multiple-nucleotide variants risks misdiagnosis
dc.contributor.author | Wakeling, MN | |
dc.contributor.author | Laver, TW | |
dc.contributor.author | Colclough, K | |
dc.contributor.author | Parish, A | |
dc.contributor.author | Ellard, S | |
dc.contributor.author | Baple, EL | |
dc.date.accessioned | 2020-01-14T09:56:51Z | |
dc.date.issued | 2020-01-09 | |
dc.description.abstract | Multiple Nucleotide Variants (MNVs) are miscalled by the most widely utilised next generation sequencing analysis (NGS) pipelines, presenting the potential for missing diagnoses. These variants, which should be treated as a single insertion-deletion mutation event, are commonly called as separate single nucleotide variants. This can result in misannotation, incorrect amino acid predictions and potentially false positive and false negative diagnostic results. Using simulated data and re-analysis of sequencing data from a diagnostic targeted gene panel, we demonstrate that the widely adopted pipeline, GATK best practices, results in miscalling of MNVs and that alternative tools can call these variants correctly. The adoption of calling methods that annotate MNVs correctly would present a solution for individual laboratories, however GATK best practices are the basis for important public resources such as the gnomAD database. We suggest integrating a solution into these guidelines would be the optimal approach. | en_GB |
dc.description.sponsorship | Wellcome Trust | en_GB |
dc.description.sponsorship | Newlife Foundation for Disabled Children | en_GB |
dc.identifier.citation | Vol. 4, article 145 | en_GB |
dc.identifier.doi | 10.12688/wellcomeopenres.15420.2 | |
dc.identifier.grantnumber | 098395 | en_GB |
dc.identifier.grantnumber | SG/16-17/02 | en_GB |
dc.identifier.uri | http://hdl.handle.net/10871/40401 | |
dc.language.iso | en | en_GB |
dc.publisher | F1000 Research Ltd | en_GB |
dc.rights | © 2020 Wakeling MN et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. | en_GB |
dc.subject | multi nucleotide variants | en_GB |
dc.subject | GnomAD | en_GB |
dc.subject | GATK | en_GB |
dc.subject | variant calling | en_GB |
dc.subject | next generation sequencing | en_GB |
dc.subject | genetic testing | en_GB |
dc.title | Misannotation of multiple-nucleotide variants risks misdiagnosis | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2020-01-14T09:56:51Z | |
dc.description | This is the published version [version 2; peer review: 2 approved]. Available on open access from F1000Research via the DOI in this record | en_GB |
dc.description | Data availability: Underlying data: Simulated MNV data is available at https://github.com/rdemolgen/MNV-test-data Archived simulated MNV data at time of publication: http://doi.org/10.5281/zenodo.3375579 License: GNU General Public License v3.0 The dataset of 1447 samples previously sequenced cannot be shared due to patient confidentiality issues, as the genotype data could be used to identify individuals and so cannot be made openly available. Requests for access to the anonymised data by researchers will be considered following an application to the Genetic Beta Cell Research Bank (https://www.diabetesgenes.org/current-research/genetic-beta-cell-research-bank/) with proposals reviewed by the Genetic Data Access Committee. | en_GB |
dc.identifier.eissn | 2398-502X | |
dc.identifier.journal | Wellcome Open Research | en_GB |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_GB |
exeter.funder | ::Wellcome Trust | en_GB |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2020-01-09 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2020-01-14T09:53:44Z | |
refterms.versionFCD | VoR | |
refterms.dateFOA | 2020-01-14T09:57:03Z | |
refterms.panel | A | en_GB |
refterms.depositException | publishedGoldOA |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © 2020 Wakeling MN et al. This is an open access article distributed under the terms of the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.