Show simple item record

dc.contributor.authorArciniegas-Alarcón, S
dc.contributor.authorGarcía-Peña, M
dc.contributor.authorKrzanowski, WJ
dc.contributor.authorRengifo, C
dc.date.accessioned2024-08-23T12:17:46Z
dc.date.issued2023-07-26
dc.date.updated2024-08-23T11:30:59Z
dc.description.abstractSome statistical analysis techniques may require complete data matrices, but a frequent problem in the construction of databases is the incomplete collection of information for different reasons. One option to tackle the problem is to estimate and impute the missing data. This paper describes a form of imputation that mixes regression with lower rank approximations. To improve the quality of the imputations, a generalisation is proposed that replaces the singular value decomposition (SVD) of the matrix with a regularised SVD in which the regularisation parameter is estimated by cross-validation. To evaluate the performance of the proposal, ten sets of real data from multienvironment trials were used. Missing values were created in each set at four percentages of missing not at random, and three criteria were then considered to investigate the effectiveness of the proposal. The results show that the regularised method proves very competitive when compared to the original method, beating it in several of the considered scenarios. As it is a very general system, its application can be extended to all multivariate data matrices. •The imputation method is modified through the inclusion of a stable and efficient computational algorithm that replaces the classical SVD least squares criterion by a penalised criterion. This penalty produces smoothed eigenvectors and eigenvalues that avoid overfitting problems, improving the performance of the method when the penalty is necessary. The size of the penalty can be determined by minimising one of the following criteria: the prediction errors, the Procrustes similarity statistic or the critical angles between subspaces of principal components.en_GB
dc.description.sponsorshipPontificia Universidad Javerianaen_GB
dc.description.sponsorshipUniversidad de La Sabanaen_GB
dc.format.extent102289-
dc.format.mediumElectronic-eCollection
dc.identifier.citationVol. 11, article 102289en_GB
dc.identifier.doihttps://doi.org/10.1016/j.mex.2023.102289
dc.identifier.grantnumber10756en_GB
dc.identifier.grantnumberING-309-2023en_GB
dc.identifier.urihttp://hdl.handle.net/10871/137247
dc.identifierORCID: 0000-0003-2772-8838 (Krzanowski, Wojtek J)
dc.identifierScopusID: 57198112101 | 7003599299 (Krzanowski, Wojtek J)
dc.language.isoenen_GB
dc.publisherElsevieren_GB
dc.relation.urlhttps://www.ncbi.nlm.nih.gov/pubmed/37560402en_GB
dc.rights© 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)en_GB
dc.subjectCross-validationen_GB
dc.subjectEigenvaluesen_GB
dc.subjectEigenvectorsen_GB
dc.subjectGabrielEigen imputation systemen_GB
dc.subjectGenotype-by-environment interactionen_GB
dc.subjectIterative computational schemeen_GB
dc.subjectOverfittingen_GB
dc.titleMissing value imputation in a data matrix using the regularised singular value decomposition.en_GB
dc.typeArticleen_GB
dc.date.available2024-08-23T12:17:46Z
exeter.article-number102289
exeter.place-of-publicationNetherlands
dc.descriptionThis is the final version. Available from Elsevier via the DOI in this record. en_GB
dc.descriptionData availability: Data will be made available on request.en_GB
dc.identifier.eissn2215-0161
dc.identifier.journalMethodsXen_GB
dc.relation.ispartofMethodsX, 11
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/ en_GB
dcterms.dateAccepted2023-07-16
dc.rights.licenseCC BY-NC-ND
rioxxterms.versionVoRen_GB
rioxxterms.licenseref.startdate2023-07-26
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2024-08-23T12:14:26Z
refterms.versionFCDVoR
refterms.dateFOA2024-08-23T12:17:55Z
refterms.panelBen_GB
refterms.dateFirstOnline2023-07-26


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/)
Except where otherwise noted, this item's licence is described as © 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)