Show simple item record

dc.contributor.authorMesnage, C
dc.contributor.authorWang, X
dc.contributor.authorDong, H
dc.contributor.authorAishwaryaprajna
dc.date.accessioned2024-09-12T12:39:15Z
dc.date.issued2024-10-20
dc.date.updated2024-09-12T11:24:46Z
dc.description.abstractWe present an initial automated test to evaluate the LLMs’ capacity to perform inductive reasoning tasks. We use the GPT-3.5/4 models to create a system which generates Python code as hypotheses for inductive reasoning to transform sequences of the One Dimensional Abstract Reasoning Corpus (1D-ARC) challenge. We experiment with 3 prompting techniques, namely standard prompting, Chain of Thought (CoT) and direct feedback. We provide results and an analysis of cost to success rate and benefit-cost ratio. Our best result is an overall 25% success rate with our CoT prompting on GPT-4, significantly surpass- ing the standard prompting approach. We discuss potential avenues to improve our experiments and test other strategies.en_GB
dc.identifier.citationHYDRA 2024: 3rd International Workshop on HYbrid Models for Coupling Deductive and Inductive ReAsoning at ECAI 2024, Santiago de Compostela, Spain, 20 October 2024en_GB
dc.identifier.urihttp://hdl.handle.net/10871/137421
dc.identifierORCID: 0000-0002-2004-6378 (Mesnage, Cedric)
dc.language.isoenen_GB
dc.publisherInternational Workshop on HYbrid models for coupling Deductive and inductive ReAsoning (HYDRA)en_GB
dc.relation.urlhttps://sites.google.com/unical.it/hydra-2024/en_GB
dc.rights© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
dc.titleEvaluating Inductive Reasoning Capabilities of Large Language Models With The One Dimensional Abstract Reasoning Corpusen_GB
dc.typeConference paperen_GB
dc.date.available2024-09-12T12:39:15Z
exeter.locationSantiago de Compostella, colocated with ECAI
dc.descriptionThis is the author accepted manuscript.en_GB
dc.descriptionThe workshop is co-located with the 27th European Conference on Artificial Intelligence (ECAI 2024)en_GB
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_GB
dcterms.dateAccepted2024-08-08
dcterms.dateSubmitted2024-06-22
rioxxterms.versionAMen_GB
rioxxterms.licenseref.startdate2024-08-08
rioxxterms.typeConference Paper/Proceeding/Abstracten_GB
refterms.dateFCD2024-09-12T12:36:48Z
refterms.versionFCDAM
refterms.dateFOA2024-10-22T15:32:54Z
refterms.panelBen_GB
pubs.name-of-conferenceHYDRA 2024: 3rd International Workshop on HYbrid Models for Coupling Deductive and Inductive ReAsoning
exeter.rights-retention-statementNo


Files in this item

This item appears in the following Collection(s)

Show simple item record

© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
Except where otherwise noted, this item's licence is described as © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).