Show simple item record

dc.contributor.authorBeig Zali, R
dc.contributor.authorLatifi, M
dc.contributor.authorJavadi, AA
dc.contributor.authorFarmani, R
dc.date.accessioned2023-10-13T09:14:54Z
dc.date.issued2023-11-22
dc.date.updated2023-10-13T05:53:40Z
dc.description.abstractIn recent years, machine learning (ML) approaches have been used widely for water pipe condition assessment and failure prediction. These methods require a considerable amount of data from water distribution networks (WDNs). Imbalanced and missing data, whether asset or failure data, compromise a model’s prediction performance. In this research, using only 2 years of failure data in a real WDN, three ML methods—XGBoost, random forest and logistic regression—were used to prioritize asset rehabilitation. To address the issue of imbalanced data, a novel method of semisupervised clustering is proposed to leverage the domain knowledge in combination with unsupervised learning to divide the data set into homogeneous categories and enhance the classification accuracy. The introduced approach performed better than well-known data science class imbalance treatment techniques. Furthermore, analysis of the results indicated that classification evaluation metrics struggled to assess practically the effectiveness of various methods. To address this, an economic indicator is proposed to rank the pipes for rehabilitation based on their cost and likelihood of failure (LoF). Preventive maintenance using the results of an economic indicator reduces the number of failures with a small fraction of the total replacement cost. Moreover, another indicator was developed to consider the consequence of the failures and LoF simultaneously. This indicator mitigates in a cost-effective manner the flow capacity reductions in WDNs caused by failures. The results of this study provide asset managers with a powerful tool to prioritize assets for rehabilitation.en_GB
dc.description.sponsorshipDatatecnics Corporation Limiteden_GB
dc.description.sponsorshipUKRIen_GB
dc.description.sponsorshipInnovate UKen_GB
dc.identifier.citationVol. 150 (2), article 04023078en_GB
dc.identifier.doihttps://doi.org/10.1061/JWRMD5.WRENG-6263
dc.identifier.grantnumber12418en_GB
dc.identifier.urihttp://hdl.handle.net/10871/134233
dc.language.isoenen_GB
dc.publisherAmerican Society of Civil Engineers (ASCE)en_GB
dc.rights© 2023 American Society of Civil Engineers
dc.subjectwater distribution networksen_GB
dc.subjectpipe failure predictionen_GB
dc.subjectsemi-supervised clusteringen_GB
dc.subjectclass imbalanceen_GB
dc.subjectmachine learningen_GB
dc.titleSemisupervised Clustering Approach for Pipe Failure Prediction with Imbalanced Data Seten_GB
dc.typeArticleen_GB
dc.date.available2023-10-13T09:14:54Z
dc.identifier.issn1943-5452
dc.descriptionThis is the author accepted manuscript. The final version is available from the American Society of Civil Engineers via the DOI in this recorden_GB
dc.descriptionData Availability Statement: Some or all data, models, or code generated or used during the study are proprietary or confidential in nature and may only be provided with restrictions. All case study data is owned by the utility company and is subject to a non-disclosure agreement (NDA), thereby limiting its availability for public dissemination. Requests for non-commercial usage of the scripts will be evaluated on a case-by-case basis.en_GB
dc.identifier.journalJournal of Water Resources Planning and Managementen_GB
dc.rights.urihttp://www.rioxx.net/licenses/all-rights-reserveden_GB
dcterms.dateAccepted2023-09-21
dcterms.dateSubmitted2023-05-26
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2023-09-21
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2023-10-13T05:53:42Z
refterms.versionFCDAM
refterms.dateFOA2024-01-31T13:40:46Z
refterms.panelBen_GB


Files in this item

This item appears in the following Collection(s)

Show simple item record