Show simple item record

dc.contributor.authorWang, J
dc.contributor.authorHu, J
dc.contributor.authorMills, J
dc.contributor.authorMin, G
dc.contributor.authorXia, M
dc.contributor.authorGeorgalas, N
dc.date.accessioned2023-04-21T11:01:57Z
dc.date.issued2023-04-05
dc.date.updated2023-04-21T08:58:41Z
dc.description.abstractFederated learning (FL) is a privacy-preserving distributed machine learning paradigm that enables collaborative training among geographically distributed and heterogeneous devices without gathering their data. Extending FL beyond the supervised learning models, federated reinforcement learning (FRL) was proposed to handle sequential decision-making problems in edge computing systems. However, the existing FRL algorithms directly combine model-free RL with FL, thus often leading to high sample complexity and lacking theoretical guarantees. To address the challenges, we propose a novel FRL algorithm that effectively incorporates modelbased RL and ensemble knowledge distillation into FL for the first time. Specifically, we utilise FL and knowledge distillation to create an ensemble of dynamics models for clients, and then train the policy by solely using the ensemble model without interacting with the environment. Furthermore, we theoretically prove that the monotonic improvement of the proposed algorithm is guaranteed. The extensive experimental results demonstrate that our algorithm obtains much higher sample efficiency compared to classic model-free FRL algorithms in the challenging continuous control benchmark environments under edge computing settings. The results also highlight the significant impact of heterogeneous client data and local model update steps on the performance of FRL, validating the insights obtained from our theoretical analysis.en_GB
dc.description.sponsorshipEuropean Union Horizon 2020en_GB
dc.description.sponsorshipRoyal Societyen_GB
dc.description.sponsorshipEngineering and Physical Sciences Research Council (EPSRC)en_GB
dc.description.sponsorshipUK Research and Innovationen_GB
dc.identifier.citationVol. 34 (6), pp. 1848 - 1859en_GB
dc.identifier.doihttps://doi.org/10.1109/tpds.2023.3264480
dc.identifier.grantnumber101008297en_GB
dc.identifier.grantnumberIEC/NSFC/211460en_GB
dc.identifier.grantnumberEP/X019160/1en_GB
dc.identifier.grantnumberEP/X038866/1en_GB
dc.identifier.urihttp://hdl.handle.net/10871/132961
dc.identifierORCID: 0000-0001-5406-8420 (Hu, Jia)
dc.identifierORCID: 0000-0003-1395-7314 (Min, Geyong)
dc.language.isoenen_GB
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_GB
dc.rights© 2023, IEEE. This version is made available under the CC-BY 4.0 license: https://creativecommons.org/licenses/by/4.0/  en_GB
dc.subjectEdge Computingen_GB
dc.subjectdistributed machine learningen_GB
dc.subjectfederated learningen_GB
dc.subjectdeep reinforcement learningen_GB
dc.titleFederated ensemble model-based reinforcement learning in edge computingen_GB
dc.typeArticleen_GB
dc.date.available2023-04-21T11:01:57Z
dc.identifier.issn1045-9219
dc.descriptionThis is the author accepted manuscript. The final version is available from the IEEE via the DOI in this record en_GB
dc.identifier.eissn1558-2183
dc.identifier.journalIEEE Transactions on Parallel and Distributed Systemsen_GB
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/  en_GB
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2023-01-01
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2023-04-21T10:57:43Z
refterms.versionFCDAM
refterms.dateFOA2023-04-21T11:01:59Z
refterms.panelBen_GB
refterms.dateFirstOnline2023-04-05


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2023, IEEE. This version is made available under the CC-BY 4.0 license: https://creativecommons.org/licenses/by/4.0/  
Except where otherwise noted, this item's licence is described as © 2023, IEEE. This version is made available under the CC-BY 4.0 license: https://creativecommons.org/licenses/by/4.0/