dc.contributor.author | Mesnage, CS | |
dc.date.accessioned | 2024-07-23T14:59:59Z | |
dc.date.issued | 2024-07-17 | |
dc.date.updated | 2024-07-23T13:19:16Z | |
dc.description.abstract | We propose a novel architecture to build an Artificial General Intelligence (AGI) in a virtual environment. To experiment with curiosity we use as a reward in a reinforcement learning (RL) algorithm the cosine similarity between recent thoughts and past thoughts as sentences given by a large language model (LLM). The agent can decide, using the Bellman equation to act as a standard agent, by moving, jumping, performing a task, observing and thinking. Observing and thinking is the process of modifying its inner dialogue by given a representation of the environment to a LLM and reflecting on its past thoughts which will consequently change its predicted Q values and decision making. We have developed an experimental intelligent agent which interacts with the open source Minetest video game as a virtual environment. | en_GB |
dc.format.extent | 130-133 | |
dc.identifier.citation | In: Artificial General Intelligence 17th International Conference (AGI 2024), 13–16 August 2024, Seattle, USA, pp. 130 - 133. Lecture Notes in Computer Science volume 14951 | en_GB |
dc.identifier.doi | https://doi.org/10.1007/978-3-031-65572-2_14 | |
dc.identifier.uri | http://hdl.handle.net/10871/136845 | |
dc.language.iso | en | en_GB |
dc.publisher | Springer | en_GB |
dc.rights.embargoreason | Under embargo until 17 July 2025 in compliance with publisher policy | en_GB |
dc.rights | © 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG | en_GB |
dc.subject | AGI architecture | en_GB |
dc.subject | RL | en_GB |
dc.subject | Virtual Environment | en_GB |
dc.subject | LLM | en_GB |
dc.title | Thinking as an Action | en_GB |
dc.type | Conference paper | en_GB |
dc.date.available | 2024-07-23T14:59:59Z | |
dc.identifier.isbn | 9783031655715 | |
exeter.location | Seattle | |
dc.description | This is the author accepted manuscript. The final version is available from Springer via the DOI in this record | en_GB |
dc.identifier.eissn | 1611-3349 | |
dc.rights.uri | http://www.rioxx.net/licenses/all-rights-reserved | en_GB |
dcterms.dateSubmitted | 2024-04-26 | |
rioxxterms.version | AM | en_GB |
rioxxterms.licenseref.startdate | 2024-06-17 | |
rioxxterms.type | Conference Paper/Proceeding/Abstract | en_GB |
refterms.dateFCD | 2024-07-23T13:19:24Z | |
refterms.versionFCD | VoR | |
refterms.panel | B | en_GB |
refterms.dateFirstOnline | 2024-07-17 | |
pubs.name-of-conference | International Conference on Artificial General Intelligence | |
exeter.rights-retention-statement | No | |