| 
View
 

Alergic talks Wed 25 June 2008

Page history last edited by PBworks 16 years, 7 months ago

Alergic Wed 25 June 2008 4:30pm

In ARUN-1B -- NB different from usual room

Unique chance to preview two talks due to be presented at SAB 2008 in Japan

========================

Paul Chorley presents:

 

Closing the Sensory-Motor Loop on Dopamine Signalled Reinforcement Learning

 

Paul Chorley and Anil K Seth

 

It has been shown recently that dopamine signalled modulation of spike timing-dependent synaptic plasticity (DA-STDP) can enable reinforcement learning of delayed stimulus-reward associations when both stimulus and reward are delivered at precisely timed intervals. Here, we test whether a similar model can support learning in an embodied context, in which timing of both sensory input and delivery of reward depend on the agent’s behaviour. We show that effective reinforcement learning is indeed possible, but only when stimuli are gated so as to occur as near-synchronous patterns of neural activity and when neuroanatomical constraints are imposed which predispose agents to exploratative behaviours. Extinction of learned responses in this model is subsequently shown to result from agent-environment interactions and not directly from any specific neural mechanism.

 

========================

Peter Fine presents:

 

Monostable controllers for adaptive behaviour

 

Christopher L. Buckley, Peter Fine, Seth Bullock and Ezequiel Di Paolo

 

Recent artificial neural networks for machine learning have exploited transient dynamics around globally stable attractors (i.e. networks possesing not more than one attractor), inspired by the properties of cortical microcolumns. Here we explore whether similarly constrained neural network controllers can be exploited for embodied, situated adaptive behaviour. We demonstrate that it is possible to evolve globally stable solutions that exhibit multiple modes of behaviour by exploiting interaction between environmental input and transient dynamics in a single basin of attraction. We present results that suggest that this globally stable regime may constitute an evolvable and dynamically rich subset of recurrent neural network configurations, especially in larger networks. We discuss the issue of scalability and the possibility that there may be alternative adaptive behaviour tasks that are more ‘attractor hungry’.

Comments (0)

You don't have permission to comment on this page.