Evaluating Inductive Reasoning Capabilities of Large Language Models With The One Dimensional Abstract Reasoning Corpus
dc.contributor.author | Mesnage, C | |
dc.contributor.author | Wang, X | |
dc.contributor.author | Dong, H | |
dc.contributor.author | Aishwaryaprajna | |
dc.date.accessioned | 2024-09-12T12:39:15Z | |
dc.date.issued | 2024-10-20 | |
dc.date.updated | 2024-09-12T11:24:46Z | |
dc.description.abstract | We present an initial automated test to evaluate the LLMs’ capacity to perform inductive reasoning tasks. We use the GPT-3.5/4 models to create a system which generates Python code as hypotheses for inductive reasoning to transform sequences of the One Dimensional Abstract Reasoning Corpus (1D-ARC) challenge. We experiment with 3 prompting techniques, namely standard prompting, Chain of Thought (CoT) and direct feedback. We provide results and an analysis of cost to success rate and benefit-cost ratio. Our best result is an overall 25% success rate with our CoT prompting on GPT-4, significantly surpass- ing the standard prompting approach. We discuss potential avenues to improve our experiments and test other strategies. | en_GB |
dc.identifier.citation | HYDRA 2024: 3rd International Workshop on HYbrid Models for Coupling Deductive and Inductive ReAsoning at ECAI 2024, Santiago de Compostela, Spain, 20 October 2024 | en_GB |
dc.identifier.uri | http://hdl.handle.net/10871/137421 | |
dc.identifier | ORCID: 0000-0002-2004-6378 (Mesnage, Cedric) | |
dc.language.iso | en | en_GB |
dc.publisher | International Workshop on HYbrid models for coupling Deductive and inductive ReAsoning (HYDRA) | en_GB |
dc.relation.url | https://sites.google.com/unical.it/hydra-2024/ | en_GB |
dc.rights | © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0). | |
dc.title | Evaluating Inductive Reasoning Capabilities of Large Language Models With The One Dimensional Abstract Reasoning Corpus | en_GB |
dc.type | Conference paper | en_GB |
dc.date.available | 2024-09-12T12:39:15Z | |
exeter.location | Santiago de Compostella, colocated with ECAI | |
dc.description | This is the author accepted manuscript. | en_GB |
dc.description | The workshop is co-located with the 27th European Conference on Artificial Intelligence (ECAI 2024) | en_GB |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_GB |
dcterms.dateAccepted | 2024-08-08 | |
dcterms.dateSubmitted | 2024-06-22 | |
rioxxterms.version | AM | en_GB |
rioxxterms.licenseref.startdate | 2024-08-08 | |
rioxxterms.type | Conference Paper/Proceeding/Abstract | en_GB |
refterms.dateFCD | 2024-09-12T12:36:48Z | |
refterms.versionFCD | AM | |
refterms.dateFOA | 2024-10-22T15:32:54Z | |
refterms.panel | B | en_GB |
pubs.name-of-conference | HYDRA 2024: 3rd International Workshop on HYbrid Models for Coupling Deductive and Inductive ReAsoning | |
exeter.rights-retention-statement | No |
Files in this item
This item appears in the following Collection(s)
Except where otherwise noted, this item's licence is described as © 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).