Comparison of Bayesian approaches for developing prediction models in rare disease: application to the identification of patients with Maturity-Onset Diabetes of the Young
dc.contributor.author | Cardoso, P | |
dc.contributor.author | McDonald, TJ | |
dc.contributor.author | Patel, KA | |
dc.contributor.author | Pearson, ER | |
dc.contributor.author | Hattersley, AT | |
dc.contributor.author | Shields, BM | |
dc.contributor.author | McKinley, TJ | |
dc.date.accessioned | 2024-06-04T14:11:59Z | |
dc.date.issued | 2024-06-04 | |
dc.date.updated | 2024-05-07T14:51:49Z | |
dc.description.abstract | Background Clinical prediction models can help identify high-risk patients and facilitate timely interventions. However, developing such models for rare diseases presents challenges due to the scarcity of affected patients for developing and calibrating models. Methods that pool information from multiple sources can help with these challenges. Methods We compared three approaches for developing clinical prediction models for population screening based on an example of discriminating a rare form of diabetes (Maturity-Onset Diabetes of the Young - MODY) in insulin-treated patients from the more common Type 1 diabetes (T1D). Two datasets were used: a case-control dataset (278 T1D, 177 MODY) and a population-representative dataset (1418 patients, 96 MODY tested with biomarker testing, 7 MODY positive). To build a population-level prediction model, we compared three methods for recalibrating models developed in case-control data. These were prevalence adjustment (“offset”), shrinkage recalibration in the population-level dataset (“recalibration”), and a refitting of the model to the population-level dataset (“re-estimation”). We then developed a Bayesian hierarchical mixture model combining shrinkage recalibration with additional informative biomarker information only available in the population-representative dataset. We developed a method for dealing with missing biomarker and outcome information using prior information from the literature and other data sources to ensure the clinical validity of predictions for certain biomarker combinations. Results The offset, re-estimation, and recalibration methods showed good calibration in the population-representative dataset. The offset and recalibration methods displayed the lowest predictive uncertainty due to borrowing information from the fitted case-control model. We demonstrate the potential of a mixture model for incorporating informative biomarkers, which significantly enhanced the model’s predictive accuracy, reduced uncertainty, and showed higher stability in all ranges of predictive outcome probabilities. Conclusion We have compared several approaches that could be used to develop prediction models for rare diseases. Our findings highlight the recalibration mixture model as the optimal strategy if a population-level dataset is available. This approach offers the flexibility to incorporate additional predictors and informed prior probabilities, contributing to enhanced prediction accuracy for rare diseases. It also allows predictions without these additional tests, providing additional information on whether a patient should undergo further biomarker testing before genetic testing. | en_GB |
dc.description.sponsorship | Research England | en_GB |
dc.description.sponsorship | Wellcome Trust | en_GB |
dc.description.sponsorship | Diabetes UK | en_GB |
dc.description.sponsorship | National Institute for Health and Care Research (NIHR) | en_GB |
dc.identifier.citation | Vol. 24, article 128 | en_GB |
dc.identifier.doi | 10.1186/s12874-024-02239-w | |
dc.identifier.grantnumber | 219606/Z/19/Z | en_GB |
dc.identifier.grantnumber | 21/0006328 | en_GB |
dc.identifier.uri | http://hdl.handle.net/10871/136123 | |
dc.identifier | ORCID: 0000-0002-9485-3236 (McKinley, Trevelyan) | |
dc.language.iso | en | en_GB |
dc.publisher | BMC | en_GB |
dc.relation.url | https://www.diabetesgenes.org/current-research/genetic-beta-cell-research-bank/ | en_GB |
dc.relation.url | https://exetercrfnihr.org/about/exeter-10000-prb/ | en_GB |
dc.rights | © The Author(s) 2024. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecom mons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data | en_GB |
dc.subject | MODY | en_GB |
dc.subject | Bayesian modelling | en_GB |
dc.subject | Rare diseases | en_GB |
dc.subject | Prior elicitation | en_GB |
dc.subject | Recalibration | en_GB |
dc.title | Comparison of Bayesian approaches for developing prediction models in rare disease: application to the identification of patients with Maturity-Onset Diabetes of the Young | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2024-06-04T14:11:59Z | |
dc.description | This is the final version. Available on open access from BMC via the DOI in this record | en_GB |
dc.description | Availability of data and materials: Data are available through application to the Genetic Beta Cell Research Bank https://www.diabetesgenes.org/current-research/genetic-beta-cell-research-bank/ and the Peninsula Research Bank https://exetercrfnihr.org/about/exeter-10000-prb/. Example R code for fitting the approaches used in this study is available on GitHub within the repository “Exeter-Diabetes/Pedro-MODY_recal_approaches”. | en_GB |
dc.identifier.eissn | 1471-2288 | |
dc.identifier.journal | BMC Medical Research Methodology | en_GB |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_GB |
dcterms.dateAccepted | 2024-05-06 | |
dcterms.dateSubmitted | 2024-01-12 | |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2024-05-06 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2024-05-07T14:51:51Z | |
refterms.versionFCD | AM | |
refterms.dateFOA | 2024-06-04T14:12:05Z | |
refterms.panel | A | en_GB |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © The Author(s) 2024. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which
permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the
original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or
other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line
to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory
regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this
licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecom mons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data