University of Exeter
Browse

The power of noise and the art of prediction

Download (9.92 MB)
journal contribution
posted on 2025-08-01, 08:39 authored by Z Xiao, S Higgins
Data analysis usually aims to identify a particular signal, such as an intervention effect. Conventional analyses often assume a specific data generation process, which implies a theoretical model that best fits the data. Machine learning techniques do not make such an assumption. In fact, they encourage multiple models to compete on the same data. Applying logistic regression and machine learning algorithms to real and simulated datasets with different features of noise and signal, we demonstrate that no single model dominates others under all circumstances. By showing when different models shine or struggle, we argue that it is important to conduct predictive analyses using cross-validation for better evidence that informs decision making.

Funding

Education Endowment Foundation

History

Related Materials

Rights

© 2017. This version is made available under the CC-BY-NC-ND 4.0 license: https://creativecommons.org/licenses/by-nc-nd/4.0/

Journal

International Journal of Educational Research

Publisher

Elsevier

Version

  • Accepted Manuscript

Language

en

FCD date

2020-01-25T08:44:14Z

FOA date

2020-01-27T08:44:42Z

Citation

Vol. 87, pp. 36 - 46

Department

  • School of Education

Usage metrics

    University of Exeter

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC