Show simple item record

dc.contributor.authorDe Peuter, S
dc.contributor.authorZhu, S
dc.contributor.authorGuo, Y
dc.contributor.authorHowes, A
dc.contributor.authorKaski, S
dc.date.accessioned2024-12-16T14:20:14Z
dc.date.issued2024-12-11
dc.date.updated2024-12-16T13:42:59Z
dc.description.abstractPreference learning methods make use of models of human choice in order to infer the latent utilities that underlie human behaviour. However, accurate modeling of human choice behavior is challenging due to a range of context effects that arise from how humans contrast and evaluate options. Cognitive science has proposed several models that capture these intricacies but, due to their intractable nature, work on preference learning has, in practice, had to rely on tractable but simplified variants of the well-known Bradley-Terry model. In this paper, we take one state-of-the-art intractable cognitive model and propose a tractable surrogate that is suitable for deployment in preference learning. We then introduce a mechanism for fitting the surrogate to human data and it extend it to account for data that cannot be explained by the original cognitive model. We demonstrate on large-scale human data that this model produces significantly better inferences on static and actively elicited data than existing Bradley-Terry variants. We further show in simulation that when using this model for preference learning, we can significantly improve a utility in a range of real-world tasks.en_GB
dc.description.sponsorshipResearch Council of Finlanden_GB
dc.description.sponsorshipFinnish Center for Artificial Intelligence (FCAI)en_GB
dc.description.sponsorshipUKRI Turing AI World-Leading Researcher Fellowshipen_GB
dc.identifier.citationNeurIPS 2024 - The Thirty-Eighth Annual Conference on Neural Information Processing Systems, 10 - 15 December 2024, Vancouver, Canadaen_GB
dc.identifier.grantnumber345604en_GB
dc.identifier.grantnumber341763en_GB
dc.identifier.grantnumber359207en_GB
dc.identifier.grantnumberEP/W002973/1en_GB
dc.identifier.urihttp://hdl.handle.net/10871/139382
dc.language.isoenen_GB
dc.publisherNeurIPSen_GB
dc.relation.urlhttps://neurips.cc/virtual/2024/poster/93675en_GB
dc.rights© 2024 The author(s)en_GB
dc.titlePreference learning of latent decision utilities with a human-like model of preferential choiceen_GB
dc.typeConference paperen_GB
dc.date.available2024-12-16T14:20:14Z
exeter.locationVancouver, Canada
dc.contributorHowes, A
dc.descriptionThis is the author accepted manuscript. The final version is available from NeurIPS via the link in this recorden_GB
dc.rights.urihttp://www.rioxx.net/licenses/all-rights-reserveden_GB
dcterms.dateSubmitted2024-08-01
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2024-12-11
rioxxterms.typeConference Paper/Proceeding/Abstracten_GB
refterms.dateFCD2024-12-16T13:43:11Z
refterms.versionFCDAM
refterms.dateFOA2024-12-16T14:20:15Z
refterms.panelBen_GB
pubs.name-of-conferenceAdvances in Neural Information Processing Systems
exeter.rights-retention-statementNo


Files in this item

This item appears in the following Collection(s)

Show simple item record