Interpreting multi-stable behaviour in input-driven recurrent neural networks

Ceni, A

dc.contributor.author	Ceni, A
dc.date.accessioned	2021-09-06T14:35:33Z
dc.date.issued	2021-08-23
dc.description.abstract	Recurrent neural networks (RNNs) are computational models inspired by the brain. Although RNNs stand out as state-of-the-art machine learning models to solve challenging tasks as speech recognition, handwriting recognition, language translation, and others, they are plagued by the so-called vanishing/exploding gradient issue. This prevents us from training RNNs with the aim of learning long term dependencies in sequential data. Moreover, a problem of interpretability affects these models, known as the ``black-box issue'' of RNNs. We attempt to open the black box by developing a mechanistic interpretation of errors occurring during the computation. We do this from a dynamical system theory perspective, specifically building on the notion of Excitable Network Attractors. Our methodology is effective at least for those tasks where a number of attractors and a switching pattern between them must be learned. RNNs can be seen as massively large nonlinear dynamical systems driven by external inputs. When it comes to analytically investigate RNNs, often in the literature the input-driven property is neglected or dropped in favour of tight constraints on the input driving the dynamics, which do not match the reality of RNN applications. Trying to bridge this gap, we framed RNNs dynamics driven by generic input sequences in the context of nonautonomous dynamical system theory. This brought us to enquire deeply into a fundamental principle established for RNNs known as the echo state property (ESP). In particular, we argue that input-driven RNNs can be reliable computational models even without satisfying the classical ESP formulation. We prove a sort of input-driven fixed point theorem and exploit it to (i) demonstrate the existence and uniqueness of a global attracting solution for strongly (in amplitude) input-driven RNNs, (ii) deduce the existence of multiple responses for certain input signals which can be reliably exploited for computational purposes, and (iii) study the stability of attracting solutions w.r.t. input sequences. Finally, we highlight the active role of the input in determining qualitative changes in the RNN dynamics, e.g. the number of stable responses, in contrast to commonly known qualitative changes due to variations of model parameters.	en_GB
dc.identifier.uri	http://hdl.handle.net/10871/126978
dc.publisher	University of Exeter	en_GB
dc.title	Interpreting multi-stable behaviour in input-driven recurrent neural networks	en_GB
dc.type	Thesis or dissertation	en_GB
dc.date.available	2021-09-06T14:35:33Z
dc.contributor.advisor	Livi, L	en_GB
dc.contributor.advisor	Ashwin, P	en_GB
dc.publisher.department	Computer Science, College of Engineering, Mathematics & Physical Sciences	en_GB
dc.rights.uri	http://www.rioxx.net/licenses/all-rights-reserved	en_GB
dc.type.degreetitle	PhD in Computer Science	en_GB
dc.type.qualificationlevel	Doctoral	en_GB
dc.type.qualificationname	Doctoral Thesis	en_GB
rioxxterms.version	NA	en_GB
rioxxterms.licenseref.startdate	2021-08-16
rioxxterms.type	Thesis	en_GB
refterms.dateFOA	2021-09-06T14:35:39Z

Files in this item

Name:: CeniA.pdf
Size:: 10.54Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Doctoral Theses

Show simple item record

Show Statistical Information