Hyperparameter optimized classification pipeline for handling unbalanced urban and rural energy consumption patterns
dc.contributor.author | Panda, DK | |
dc.contributor.author | Das, S | |
dc.contributor.author | Townley, S | |
dc.date.accessioned | 2022-11-02T14:45:02Z | |
dc.date.issued | 2022-10-30 | |
dc.date.updated | 2022-11-02T13:30:38Z | |
dc.description.abstract | Energy consumer locations are required for framing effective energy policies. However, due to privacy concerns, it is becoming increasingly difficult to obtain the locational data of the consumers. Machine learning (ML) based classification strategies can be used to find the locational information of the consumers based on their historical energy consumption patterns. The ML methods in this paper are applied to the Residential Energy Consumption Survey 2009 dataset. In this dataset, the number of consumers in the urban area is higher than the rural area, thus making the classification problem unbalanced. The unbalanced classification problem has been solved in original and transformed or reduced feature space using Monte Carlo based under-sampling of the majority class datapoints. The hyperparameters for each classification algorithm family is represented as an optimized pipeline, obtained using the genetic programming (GP) optimizer. The classification performance metrics are then obtained for different algorithm families on the original and transformed feature spaces. Performance comparisons have been reported using univariate and bivariate distributions of the classification metrics viz. accuracy, geometric mean score (GMS), F1 score, precision, area under the curve (AUC) of receiver operator characteristics (ROC). The energy policy aspects for the urban and rural residential consumers based on the classification results have also been discussed. | en_GB |
dc.description.sponsorship | European Regional Development Fund (ERDF) | en_GB |
dc.identifier.citation | Vol. 214, article 119127 | en_GB |
dc.identifier.doi | https://doi.org/10.1016/j.eswa.2022.119127 | |
dc.identifier.grantnumber | OC05R18P 0782 | en_GB |
dc.identifier.uri | http://hdl.handle.net/10871/131569 | |
dc.identifier | ORCID: 0000-0002-8394-5303 (Das, Saptarshi) | |
dc.language.iso | en | en_GB |
dc.publisher | Elsevier | en_GB |
dc.rights | © 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). | en_GB |
dc.subject | residential energy consumption | en_GB |
dc.subject | unbalanced data classification | en_GB |
dc.subject | ROC curve | en_GB |
dc.subject | genetic programming | en_GB |
dc.title | Hyperparameter optimized classification pipeline for handling unbalanced urban and rural energy consumption patterns | en_GB |
dc.type | Article | en_GB |
dc.date.available | 2022-11-02T14:45:02Z | |
dc.identifier.issn | 0957-4174 | |
exeter.article-number | 119127 | |
dc.description | This is the final version. Available on open access from Elsevier via the DOI in this record | en_GB |
dc.description | Data availability: Data will be made available on request. | en_GB |
dc.identifier.journal | Expert Systems with Applications | en_GB |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_GB |
dcterms.dateAccepted | 2022-10-21 | |
rioxxterms.version | VoR | en_GB |
rioxxterms.licenseref.startdate | 2022-10-30 | |
rioxxterms.type | Journal Article/Review | en_GB |
refterms.dateFCD | 2022-11-02T14:24:31Z | |
refterms.versionFCD | AM | |
refterms.dateFOA | 2022-11-02T14:45:11Z | |
refterms.panel | B | en_GB |
refterms.dateFirstOnline | 2022-10-30 |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).