University of Exeter
Browse

Federated ensemble model-based reinforcement learning in edge computing

Download (1.86 MB)
journal contribution
posted on 2025-08-01, 16:44 authored by J Wang, J Hu, J Mills, G Min, M Xia, N Georgalas
Federated learning (FL) is a privacy-preserving distributed machine learning paradigm that enables collaborative training among geographically distributed and heterogeneous devices without gathering their data. Extending FL beyond the supervised learning models, federated reinforcement learning (FRL) was proposed to handle sequential decision-making problems in edge computing systems. However, the existing FRL algorithms directly combine model-free RL with FL, thus often leading to high sample complexity and lacking theoretical guarantees. To address the challenges, we propose a novel FRL algorithm that effectively incorporates modelbased RL and ensemble knowledge distillation into FL for the first time. Specifically, we utilise FL and knowledge distillation to create an ensemble of dynamics models for clients, and then train the policy by solely using the ensemble model without interacting with the environment. Furthermore, we theoretically prove that the monotonic improvement of the proposed algorithm is guaranteed. The extensive experimental results demonstrate that our algorithm obtains much higher sample efficiency compared to classic model-free FRL algorithms in the challenging continuous control benchmark environments under edge computing settings. The results also highlight the significant impact of heterogeneous client data and local model update steps on the performance of FRL, validating the insights obtained from our theoretical analysis.

Funding

101008297

EP/X019160/1

EP/X038866/1

Engineering and Physical Sciences Research Council (EPSRC)

European Union Horizon 2020

IEC/NSFC/211460

Royal Society

UK Research and Innovation

History

Related Materials

Rights

© 2023, IEEE. This version is made available under the CC-BY 4.0 license: https://creativecommons.org/licenses/by/4.0/

Notes

This is the author accepted manuscript. The final version is available from the IEEE via the DOI in this record

Journal

IEEE Transactions on Parallel and Distributed Systems

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Version

  • Accepted Manuscript

Language

en

FCD date

2023-04-21T10:57:43Z

FOA date

2023-04-21T11:01:59Z

Citation

Vol. 34 (6), pp. 1848 - 1859

Department

  • Computer Science

Usage metrics

    University of Exeter

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC