University of Exeter
Browse

Theory and Application of Highly Multivariate High-dimensional Spatial Stochastic Processes

Download (12.15 MB)
thesis
posted on 2025-08-02, 12:52 authored by X Chen
The global reanalysis data set produced by the Copernicus Atmosphere Monitoring Service (CAMS) comprises gridded concentration estimates of various pollutants. The complex inter-pollutant relationships across a large spatial domain characterise the data as highly multivariate and spatially high-dimensional (HMHD). Sparsity among variates p and spatial locations n is the key to addressing the HMHD spatial data problems. Without such sparsity, the joint var-covariance matrix Σnp×np and the precision matrix (Σnp×np)^{-1}, where both p and n are large, would be prohibitive to construct and intractable for inference. The thesis proposes a hybrid mixed spatial graphical model framework and novel concepts such as cross-Markov Random Field (cross-MRF) to comprehensively address all aspects of HMHD spatial data features. Specifically, the framework accommodates any customised conditional independence (CI) among any number of p variate fields at the first stage, alleviating the dynamic memory burden associated with Σnp×np construction. Meanwhile, it facilitates parallelled generation of covariance and precision matrix, with the latter's generation order only scaling linearly in p. In the second stage, the thesis demonstrates the multivariate Hammersley-Clifford theorem from a column-wise conditional perspective and unearths the existence of cross-MRF. The link of the mixed spatial graphical framework and the cross-MRF allows for a mixed conditional approach which achieves the sparsest possible representation of the precision matrix via accommodating the doubly CI among both p and n, resulting in the highest possible exact-zero-value percentage in the precision matrix, alongside its lowest possible generation order. The thesis also explores the possibility of the co-existence of geostatistical and MRF modelling approaches in one unified framework, imparting a potential solution to an open problem. The derived theories are illustrated with 1D and 2D spatial data.

Funding

The Alan Turing Institute

History

Thesis type

  • PhD Thesis

Supervisors

Shaddick, Gavin

Academic Department

Mathematics and Statistics

Degree Title

Doctor of Philosophy

Qualification Level

  • Doctoral

Publisher

University of Exeter

Language

en

Department

  • Doctoral Theses

Usage metrics

    University of Exeter

    Categories

    No categories selected

    Keywords

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC