University of Exeter
Browse

Reinforcement learning for PV-integrated battery storage energy management system: Case study of chemical manufacturing plant in Malaysia

Download (918.44 kB)
conference contribution
posted on 2025-09-17, 08:32 authored by Kai Yi Chow, Saptarshi DasSaptarshi Das
<p dir="ltr">The escalating global demand for energy, coupled with the depletion of fossil fuel reserves, necessitates a shift towards sustainable and alternative energy sources. In regions like Malaysia, abundant sunlight offers a promising avenue for solar energy adoption. However, the intermittent nature of renewable sources requires effective energy storage solution. This study focuses on optimizing an Energy Management System (EMS) for a Photovoltaic (PV)- integrated battery storage system in a chemical manufacturing plant in Malaysia. Traditional approaches to EMS optimization, such as linear programming, face challenges in handling the dynamic and uncertain nature of renewable energy generation. In response, the reinforcement learning (RL), particularly the n-step Q-learning algorithm, emerges as a viable solution. This machine learning technique enables an agent to make decisions in real-time without relying on forecasted data, crucial for unpredictable variables like load demand and energy generation. This paper investigates the economic benefits of implementing an RL-based EMS in the context of Malaysia’s tariff rate. It also explores how varying n-step values influence the performance and decisionmaking efficiency of the Q-learning-based EMS. The simulation framework utilizes historical data from a local chemical manufacturing factory, considering constraints of the battery storage system. Results demonstrate that learning the hyper-parameters significantly impact the agent’s performance, highlighting the importance of fine-tuning these hyper-parameters for efficient decision-making. Utilizing a larger n-step value in the algorithm enhances the agent’s decisionmaking in battery operations, considering cumulative rewards over multiple upcoming time intervals. The EMS, optimized through RL, shows robustness and adaptability, while reducing the cost of energy.</p>

History

Related Materials

  1. 1.
  2. 2.
  3. 3.
    ISBN - Is identical to 9789819623280 (urn:isbn:9789819623280)

Rights

© 2025 The author(s). For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) licence to any Author Accepted Manuscript version arising from this submission.

Rights Retention Status

  • No

Submission date

2024-06-04

Notes

This is the author accepted manuscript. The final version is available from Springer Nature via the DOI in this record

Volume

1234

Pagination

193-206

Publisher

Springer Nature

Name of conference

International Conference on Data Analytics and Insights

Location

Kolkata

Published proceedings

Lecture Notes in Networks and Systems

Version

  • Accepted Manuscript

Language

en

Department

  • Faculty of Environment, Science and Economy

Usage metrics

    University of Exeter

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC