Perils and pitfalls of mixed-effects regression models in biology
dc.contributor.author | Silk, MJ | |
dc.contributor.author | Harrison, XA | |
dc.contributor.author | Hodgson, DJ | |
dc.date.accessioned | 2020-10-15T14:45:31Z | |
dc.date.issued | 2020-08-12 | |
dc.description.abstract | Biological systems, at all scales of organisation from nucleic acids to ecosystems, are inherently complex and variable. Biologists therefore use statistical analyses to detect signal among this systemic noise. Statistical models infer trends, find functional relationships and detect differences that exist among groups or are caused by experimental manipulations. They also use statistical relationships to help predict uncertain futures. All branches of the biological sciences now embrace the possibilities of mixed-effects modelling and its flexible toolkit for partitioning noise and signal. The mixed-effects model is not, however, a panacea for poor experimental design, and should be used with caution when inferring or deducing the importance of both fixed and random effects. Here we describe a selection of the perils and pitfalls that are widespread in the biological literature, but can be avoided by careful reflection, modelling and model-checking. We focus on situations where incautious modelling risks exposure to these pitfalls and the drawing of incorrect conclusions. Our stance is that statements of significance, information content or credibility all have their place in biological research, as long as these statements are cautious and well-informed by checks on the validity of assumptions. Our intention is to reveal potential perils and pitfalls in mixed model estimation so that researchers can use these powerful approaches with greater awareness and confidence. Our examples are ecological, but translate easily to all branches of biology. | en_GB |
dc.description.sponsorship | University of Exeter | en_GB |
dc.identifier.citation | Vol. 8, article e9522 | en_GB |
dc.identifier.doi | 10.7717/peerj.9522 | |
dc.identifier.uri | http://hdl.handle.net/10871/123254 | |
dc.language.iso | en | en_GB |
dc.publisher | PeerJ | en_GB |
dc.rights | © 2020 Silk et al. Open access. Distributed under Creative Commons CC BY 4.0: https://www.creativecommons.org/licenses/by/4.0/ | en_GB |
dc.subject | Data analysis | en_GB |
dc.subject | Random effect | en_GB |
dc.subject | Fixed effect | en_GB |
dc.subject | Autocorrelation | en_GB |
dc.subject | Hierarchical model | en_GB |
dc.subject | Informative cluster size | en_GB |
dc.subject | Confounding by cluster | en_GB |
dc.subject | GLMM | en_GB |
dc.title | Perils and pitfalls of mixed-effects regression models in biology | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2020-10-15T14:45:31Z | |
dc.description | This is the final version. Available on open access from PeerJ via the DOI in this record | en_GB |
dc.description | Data Availability: The following information was supplied regarding data availability: The R code used to conduct all simulations in the paper is available in the Supplemental Files. | en_GB |
dc.identifier.journal | PeerJ | en_GB |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_GB |
dcterms.dateAccepted | 2020-06-20 | |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2020-08-12 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2020-10-15T14:42:26Z | |
refterms.versionFCD | VoR | |
refterms.dateFOA | 2020-10-15T14:45:39Z | |
refterms.panel | A | en_GB |
refterms.depositException | publishedGoldOA |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © 2020 Silk et al. Open access. Distributed under Creative Commons CC BY 4.0: https://www.creativecommons.org/licenses/by/4.0/