Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data
dc.contributor.author | Aitken, S | |
dc.contributor.author | Firth, HV | |
dc.contributor.author | McRae, J | |
dc.contributor.author | Halachev, M | |
dc.contributor.author | Kini, U | |
dc.contributor.author | Parker, MJ | |
dc.contributor.author | Lees, MM | |
dc.contributor.author | Lachlan, K | |
dc.contributor.author | Sarkar, A | |
dc.contributor.author | Joss, S | |
dc.contributor.author | Splitt, M | |
dc.contributor.author | McKee, S | |
dc.contributor.author | Németh, AH | |
dc.contributor.author | Scott, RH | |
dc.contributor.author | Wright, CF | |
dc.contributor.author | Marsh, JA | |
dc.contributor.author | Hurles, ME | |
dc.contributor.author | FitzPatrick, DR | |
dc.date.accessioned | 2019-10-28T16:35:02Z | |
dc.date.issued | 2019-10-10 | |
dc.description.abstract | Trio-based whole-exome sequence (WES) data have established confident genetic diagnoses in ∼40% of previously undiagnosed individuals recruited to the Deciphering Developmental Disorders (DDD) study. Here we aim to use the breadth of phenotypic information recorded in DDD to augment diagnosis and disease variant discovery in probands. Median Euclidean distances (mEuD) were employed as a simple measure of similarity of quantitative phenotypic data within sets of ≥10 individuals with plausibly causative de novo mutations (DNM) in 28 different developmental disorder genes. 13/28 (46.4%) showed significant similarity for growth or developmental milestone metrics, 10/28 (35.7%) showed similarity in HPO term usage, and 12/28 (43%) showed no phenotypic similarity. Pairwise comparisons of individuals with high-impact inherited variants to the 32 individuals with causative DNM in ANKRD11 using only growth z-scores highlighted 5 likely causative inherited variants and two unrecognized DNM resulting in an 18% diagnostic uplift for this gene. Using an independent approach, naive Bayes classification of growth and developmental data produced reasonably discriminative models for the 24 DNM genes with sufficiently complete data. An unsupervised naive Bayes classification of 6,993 probands with WES data and sufficient phenotypic information defined 23 in silico syndromes (ISSs) and was used to test a "phenotype first" approach to the discovery of causative genotypes using WES variants strictly filtered on allele frequency, mutation consequence, and evidence of constraint in humans. This highlighted heterozygous de novo nonsynonymous variants in SPTBN2 as causative in three DDD probands. | en_GB |
dc.description.sponsorship | Wellcome Trust | en_GB |
dc.description.sponsorship | Department of Health | en_GB |
dc.description.sponsorship | Wellcome Sanger Institute | en_GB |
dc.description.sponsorship | National Institute for Health Research (NIHR) | en_GB |
dc.description.sponsorship | Medical Research Council (MRC) | en_GB |
dc.identifier.citation | Published online 1 October 2019 | en_GB |
dc.identifier.doi | 10.1016/j.ajhg.2019.09.015 | |
dc.identifier.grantnumber | WT098051 | en_GB |
dc.identifier.grantnumber | 200990/Z/16/Z | en_GB |
dc.identifier.grantnumber | WT091310 | en_GB |
dc.identifier.grantnumber | MR/M02122X/1 | en_GB |
dc.identifier.uri | http://hdl.handle.net/10871/39356 | |
dc.language.iso | en | en_GB |
dc.publisher | Elsevier (Cell Press) | en_GB |
dc.relation.url | https://www.ncbi.nlm.nih.gov/pubmed/31607427 | en_GB |
dc.rights | © 2019 The Authors. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). | en_GB |
dc.subject | developmental disease | en_GB |
dc.subject | genotype | en_GB |
dc.subject | naive Bayes | en_GB |
dc.subject | phenotype | en_GB |
dc.subject | tSNE | en_GB |
dc.title | Finding Diagnostically Useful Patterns in Quantitative Phenotypic Data | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2019-10-28T16:35:02Z | |
exeter.place-of-publication | United States | en_GB |
dc.description | This is the final version. Available on open access from Elsevier via the DOI in this record | en_GB |
dc.identifier.eissn | 1537-6605 | |
dc.identifier.journal | American Journal of Human Genetics | en_GB |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en_GB |
dcterms.dateAccepted | 2019-09-13 | |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2019-10-10 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2019-10-28T16:29:16Z | |
refterms.versionFCD | VoR | |
refterms.dateFOA | 2019-10-28T16:35:05Z | |
refterms.panel | A | en_GB |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © 2019 The Authors. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).