dc.contributor.author | Watson, J | |
dc.contributor.author | Zidek, JV | |
dc.contributor.author | Shaddick, G | |
dc.date.accessioned | 2019-09-30T08:45:03Z | |
dc.date.issued | 2019-11-28 | |
dc.description.abstract | This paper presents a general model framework for detecting the
preferential sampling of environmental monitors recording an environmental process across space and/or time. This is achieved by considering the joint distribution of an environmental process with a
site–selection process that considers where and when sites are placed
to measure the process. The environmental process may be spatial,
temporal or spatio–temporal in nature. By sharing random effects between the two processes, the joint model is able to establish whether
site placement was stochastically dependent of the environmental process under study. Furthermore, if stochastic dependence is identified
between the two processes, then inferences about the probability distribution of the spatio–temporal process will change, as will predictions made of the process across space and time. The embedding
into a spatio–temporal framework also allows for the modelling of
the dynamic site—selection process itself. Real–world factors affecting both the size and location of the network can be easily modelled
and quantified. Depending upon the choice of population of locations to consider for selection across space and time under the site–
selection process, different insights about the precise nature of preferential sampling can be obtained. The general framework developed
in the paper is designed to be easily and quickly fit using the R-INLA
package. We apply this framework to a case study involving particulate air pollution over the UK where a major reduction in the size
of a monitoring network through time occurred. It is demonstrated
that a significant response–biased reduction in the air quality monitoring network occurred, namely the relocation of monitoring sites to
locations with the highest pollution levels, and the routine removal
of sites at locations with the lowest. We also show that the network
was consistently unrepresentative of the levels of particulate matter
seen across much of GB throughout the operating life of the network.
Finally we show that this may have led to a severe over-reporting of
the population–average exposure levels experienced across GB. This
could have great impacts on estimates of the health effects of black
smoke levels. | en_GB |
dc.description.sponsorship | Natural Science and Engineering Research Council of Canada | en_GB |
dc.identifier.citation | Vol. 13 (4), pp. 2662-2700 | en_GB |
dc.identifier.doi | 10.1214/19-AOAS1288 | |
dc.identifier.uri | http://hdl.handle.net/10871/38960 | |
dc.language.iso | en | en_GB |
dc.publisher | Institute of Mathematical Statistics | en_GB |
dc.rights | © Institute of Mathematical Statistics, 2019 | |
dc.subject | preferential sampling | en_GB |
dc.subject | INLA | en_GB |
dc.subject | random fields | en_GB |
dc.subject | mobile monitors | en_GB |
dc.subject | health effects | en_GB |
dc.subject | air pollution | en_GB |
dc.subject | Big Data | en_GB |
dc.title | A general theory for preferential sampling in environmental networks | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2019-09-30T08:45:03Z | |
dc.identifier.issn | 1932-6157 | |
dc.description | This is the final version. Available from the Institute of Mathematical Statistics via the DOI in this record | en_GB |
dc.identifier.journal | Annals of Applied Statistics | en_GB |
dc.rights.uri | http://www.rioxx.net/licenses/all-rights-reserved | en_GB |
dcterms.dateAccepted | 2019-07-19 | |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2019-07-19 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2019-09-27T16:00:36Z | |
refterms.versionFCD | AM | |
refterms.dateFOA | 2020-02-12T15:42:08Z | |
refterms.panel | B | en_GB |