Show simple item record

dc.contributor.authorPanda, DK
dc.contributor.authorDas, S
dc.contributor.authorTownley, S
dc.date.accessioned2022-11-02T14:45:02Z
dc.date.issued2022-10-30
dc.date.updated2022-11-02T13:30:38Z
dc.description.abstractEnergy consumer locations are required for framing effective energy policies. However, due to privacy concerns, it is becoming increasingly difficult to obtain the locational data of the consumers. Machine learning (ML) based classification strategies can be used to find the locational information of the consumers based on their historical energy consumption patterns. The ML methods in this paper are applied to the Residential Energy Consumption Survey 2009 dataset. In this dataset, the number of consumers in the urban area is higher than the rural area, thus making the classification problem unbalanced. The unbalanced classification problem has been solved in original and transformed or reduced feature space using Monte Carlo based under-sampling of the majority class datapoints. The hyperparameters for each classification algorithm family is represented as an optimized pipeline, obtained using the genetic programming (GP) optimizer. The classification performance metrics are then obtained for different algorithm families on the original and transformed feature spaces. Performance comparisons have been reported using univariate and bivariate distributions of the classification metrics viz. accuracy, geometric mean score (GMS), F1 score, precision, area under the curve (AUC) of receiver operator characteristics (ROC). The energy policy aspects for the urban and rural residential consumers based on the classification results have also been discussed.en_GB
dc.description.sponsorshipEuropean Regional Development Fund (ERDF)en_GB
dc.identifier.citationVol. 214, article 119127en_GB
dc.identifier.doihttps://doi.org/10.1016/j.eswa.2022.119127
dc.identifier.grantnumberOC05R18P 0782en_GB
dc.identifier.urihttp://hdl.handle.net/10871/131569
dc.identifierORCID: 0000-0002-8394-5303 (Das, Saptarshi)
dc.language.isoenen_GB
dc.publisherElsevieren_GB
dc.rights© 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).en_GB
dc.subjectresidential energy consumptionen_GB
dc.subjectunbalanced data classificationen_GB
dc.subjectROC curveen_GB
dc.subjectgenetic programmingen_GB
dc.titleHyperparameter optimized classification pipeline for handling unbalanced urban and rural energy consumption patternsen_GB
dc.typeArticleen_GB
dc.date.available2022-11-02T14:45:02Z
dc.identifier.issn0957-4174
exeter.article-number119127
dc.descriptionThis is the final version. Available on open access from Elsevier via the DOI in this record en_GB
dc.descriptionData availability: Data will be made available on request.en_GB
dc.identifier.journalExpert Systems with Applicationsen_GB
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_GB
dcterms.dateAccepted2022-10-21
rioxxterms.versionVoRen_GB
rioxxterms.licenseref.startdate2022-10-30
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2022-11-02T14:24:31Z
refterms.versionFCDAM
refterms.dateFOA2022-11-02T14:45:11Z
refterms.panelBen_GB
refterms.dateFirstOnline2022-10-30


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Except where otherwise noted, this item's licence is described as © 2022 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).