Administrative social science data: The challenge of reproducible research
Playford, CJ; Gayle, V; Connelly, R; et al.Gray, AJG
Date: 1 December 2016
Article
Journal
Big Data and Society
Publisher
SAGE Publications
Publisher DOI
Abstract
Powerful new social science data resources are emerging. One particularly important source is administrative data, which
were originally collected for organisational purposes but often contain information that is suitable for social science
research. In this paper we outline the concept of reproducible research in relation to micro-level ...
Powerful new social science data resources are emerging. One particularly important source is administrative data, which
were originally collected for organisational purposes but often contain information that is suitable for social science
research. In this paper we outline the concept of reproducible research in relation to micro-level administrative social
science data. Our central claim is that a planned and organised workflow is essential for high quality research using microlevel
administrative social science data. We argue that it is essential for researchers to share research code, because code
sharing enables the elements of reproducible research. First, it enables results to be duplicated and therefore allows the
accuracy and validity of analyses to be evaluated. Second, it facilitates further tests of the robustness of the original piece
of research. Drawing on insights from computer science and other disciplines that have been engaged in e-Research we
discuss and advocate the use of Git repositories to provide a useable and effective solution to research code sharing and
rendering social science research using micro-level administrative data reproducible.
Social and Political Sciences, Philosophy, and Anthropology
Faculty of Humanities, Arts and Social Sciences
Item views 0
Full item downloads 0