Show simple item record

dc.contributor.authorBessudnov, A
dc.contributor.authorTarasov, D
dc.contributor.authorPanasovets, V
dc.contributor.authorKostenko, V
dc.contributor.authorSmirnov, I
dc.contributor.authorUspenskiy, V
dc.date.accessioned2023-04-04T13:26:51Z
dc.date.issued2023-04-04
dc.date.updated2023-04-04T12:28:20Z
dc.description.abstractIn this paper, we develop a machine learning classifier that predicts perceived ethnicity from data on personal names for major ethnic groups populating Russia. We collect data from VK, the largest Russian social media website. Ethnicity was coded from languages spoken by users and their geographical location, with the data manually cleaned by crowd workers. The classifier shows the accuracy of 0.82 for a scheme with 24 ethnic groups and 0.92 for 15 aggregated ethnic groups. It can be used for research on ethnicity and ethnic relations in Russia, with the data sets that have personal names but not ethnicity.en_GB
dc.identifier.citationPublished online 4 April 2023en_GB
dc.identifier.doihttps://doi.org/10.1007/s42001-023-00205-y
dc.identifier.urihttp://hdl.handle.net/10871/132840
dc.identifierORCID: 0000-0002-2541-9794 (Bessudnov, Aleksei)
dc.language.isoenen_GB
dc.publisherSpringeren_GB
dc.relation.urlhttps://github.com/abessudnov/ruEthnicNamesPublicen_GB
dc.rights© The Author(s) 2023. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/en_GB
dc.subjectethnicityen_GB
dc.subjectRussiaen_GB
dc.subjectmachine learningen_GB
dc.subjectpredictionen_GB
dc.subjectpersonal namesen_GB
dc.titlePredicting perceived ethnicity with data on personal names in Russiaen_GB
dc.typeArticleen_GB
dc.date.available2023-04-04T13:26:51Z
dc.descriptionThis is the final version. Available on open access from Springer via the DOI in this recorden_GB
dc.descriptionData availability statement: The research data supporting this publication and the Python code are openly available from Github at: https://github.com/abessudnov/ruEthnicNamesPublicen_GB
dc.identifier.eissn2432-2725
dc.identifier.journalJournal of Computational Social Scienceen_GB
dc.relation.ispartofJournal of Computational Social Science
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_GB
dcterms.dateAccepted2023-03-13
dcterms.dateSubmitted2022-12-20
rioxxterms.versionVoRen_GB
rioxxterms.licenseref.startdate2023-04-04
rioxxterms.typeJournal Article/Reviewen_GB
refterms.dateFCD2023-04-04T12:28:22Z
refterms.versionFCDAM
refterms.dateFOA2023-04-04T13:26:52Z
refterms.panelCen_GB


Files in this item

This item appears in the following Collection(s)

Show simple item record

© The Author(s) 2023. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/
Except where otherwise noted, this item's licence is described as © The Author(s) 2023. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/