Show simple item record

dc.contributor.authorBotta, F
dc.contributor.authorLovelace, R
dc.contributor.authorGilbert, L
dc.contributor.authorTurrell, A
dc.date.accessioned2024-09-27T10:13:47Z
dc.date.issued2024-07-24
dc.date.updated2024-09-27T08:44:51Z
dc.description.abstractThe effective and ethical use of data to inform decision-making offers huge value to the public sector, especially when delivered by transparent, reproducible, and robust data processing workflows. One way that governments are unlocking this value is through making their data publicly available, allowing more people and organisations to derive insights. However, open data is not enough in many cases: publicly available datasets need to be accessible in an analysis-ready form from popular data science tools, such as R and Python, for them to realise their full potential. This paper explores ways to maximise the impact of open data with reference to a case study of packaging code to facilitate reproducible analysis. We present the jtstats project, which consists of a main Python package, and a smaller R version, for importing, processing, and visualising large and complex datasets representing journey times, for many transport modes and trip purposes at multiple geographic levels, released by the UK Department for Transport (DfT). jtstats shows how domain specific packages can enable reproducible research within the public sector and beyond, saving duplicated effort and reducing the risks of errors from repeated analyses. We hope that the jtstats project inspires others, particularly those in the public sector, to add value to their data sets by making them more accessible.en_GB
dc.description.sponsorshipEconomic and Social Research Council (ESRC)en_GB
dc.description.sponsorshipAdministrative Data Research UK (ADR UK)en_GB
dc.identifier.citationPublished online 24 July 2024.en_GB
dc.identifier.doihttps://doi.org/10.1177/23998083241267331
dc.identifier.grantnumberES/W003937/1en_GB
dc.identifier.grantnumberES/W004305/1en_GB
dc.identifier.urihttp://hdl.handle.net/10871/137563
dc.identifierORCID: 0000-0002-5681-4535 (Botta, Federico)
dc.identifierScopusID: 56890901600 (Botta, Federico)
dc.identifierResearcherID: I-3688-2019 (Botta, Federico)
dc.language.isoenen_GB
dc.publisherSAGE Publicationsen_GB
dc.relation.urlhttps://www.gov.uk/government/statistics/journey-time-statistics-england-2019en_GB
dc.rights© The Author(s) 2024. This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).en_GB
dc.subjectData science for public gooden_GB
dc.subjectGovernment dataen_GB
dc.subjectOpen sourceen_GB
dc.titlePackaging code and data for reproducible research: A case study of journey time statisticsen_GB
dc.typeArticleen_GB
dc.date.available2024-09-27T10:13:47Z
dc.identifier.issn2399-8083
dc.descriptionThis is the final version. Available from SAGE publications via the DOI in this record. en_GB
dc.descriptionData availability statement. The data set is publicly available at https://www.gov.uk/government/statistics/journey-time-statistics-england-2019en_GB
dc.identifier.eissn2399-8091
dc.identifier.journalEnvironment and Planning B Urban Analytics and City Scienceen_GB
dc.relation.ispartofEnvironment and Planning B Urban Analytics and City Science
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_GB
rioxxterms.versionVoRen_GB
rioxxterms.licenseref.startdate2024-07-24
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2024-09-27T10:04:31Z
refterms.versionFCDVoR
refterms.dateFOA2024-09-27T10:13:56Z
refterms.panelBen_GB
refterms.dateFirstOnline2024-07-24


Files in this item

This item appears in the following Collection(s)

Show simple item record

© The Author(s) 2024. This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
Except where otherwise noted, this item's licence is described as © The Author(s) 2024. This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).