Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning

Tamosiunaite, Minija; Ainge, James A; Kulvicius, Tomas; Porr, Bernd; Dudchenko, Paul; Worgotter, Florentin

doi:10.1007/s10827-008-0094-6

Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/2807

Full metadata record

DC Field	Value	Language
dc.contributor.author	Tamosiunaite, Minija	en_UK
dc.contributor.author	Ainge, James A	en_UK
dc.contributor.author	Kulvicius, Tomas	en_UK
dc.contributor.author	Porr, Bernd	en_UK
dc.contributor.author	Dudchenko, Paul	en_UK
dc.contributor.author	Worgotter, Florentin	en_UK
dc.date.accessioned	2016-12-01T00:40:29Z	-
dc.date.available	2016-12-01T00:40:29Z	en_UK
dc.date.issued	2008-12	en_UK
dc.identifier.uri	http://hdl.handle.net/1893/2807	-
dc.description.abstract	A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The convergence properties of RL-algorithms are affected by the exploration patterns of the learner. Therefore, we first analyzed the path characteristics of freely exploring rats in a test arena. We found that straight path segments with mean length 23 cm up to a maximal length of 80 cm take up a significant proportion of the total paths. Thus, rat paths are biased as compared to random exploration. Next we designed a RL system that reproduces these specific path characteristics. Our model arena is covered by overlapping, probabilistically firing place fields (PF) of realistic size and coverage. Because convergence of RL-algorithms is also influenced by the state space characteristics, different PF-sizes and densities, leading to a different degree of overlap, were also investigated. The model rat learns finding a reward opposite to its starting point. We observed that the combination of biased straight exploration, overlapping coverage and probabilistic firing will strongly impair the convergence of learning. When the degree of randomness in the exploration is increased, convergence improves, but the distribution of straight path segments becomes unrealistic and paths become ‘wiggly’. To mend this situation without affecting the path characteristic two additional mechanisms are implemented: A gradual drop of the learned weights (weight decay) and path length limitation, which prevents learning if the reward is not found after some expected time. Both mechanisms limit the memory of the system and thereby counteract effects of getting trapped on a wrong path. When using these strategies individually divergent cases get substantially reduced and for some parameter settings no divergence was found anymore at all. Using weight decay and path length limitation at the same time, convergence is not much improved but instead time to convergence increases as the memory limiting effect is getting too strong. The degree of improvement relies also on the size and degree of overlap (coverage density) in the place field system. The used combination of these two parameters leads to a trade-off between convergence and speed to convergence. Thus, this study suggests that the role of the PF-system in navigation learning cannot be considered independently from the animals’ exploration pattern.	en_UK
dc.language.iso	en	en_UK
dc.publisher	Springer	en_UK
dc.relation	Tamosiunaite M, Ainge JA, Kulvicius T, Porr B, Dudchenko P & Worgotter F (2008) Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning. Journal of Computational Neuroscience, 25 (3), pp. 562-582. https://doi.org/10.1007/s10827-008-0094-6	en_UK
dc.rights	The publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author; you can only request a copy if you wish to use this work for your own research or private study.	en_UK
dc.rights.uri	http://www.rioxx.net/licenses/under-embargo-all-rights-reserved	en_UK
dc.subject	Reinforcement learning	en_UK
dc.subject	SARSA	en_UK
dc.subject	Place field system	en_UK
dc.subject	Function approximation	en_UK
dc.subject	Weight decay	en_UK
dc.subject	Animal navigation	en_UK
dc.subject	Hippocampus (Brain)	en_UK
dc.title	Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning	en_UK
dc.type	Journal Article	en_UK
dc.rights.embargodate	3000-01-01	en_UK
dc.identifier.doi	10.1007/s10827-008-0094-6	en_UK
dc.citation.jtitle	Journal of Computational Neuroscience	en_UK
dc.citation.issn	1573-6873	en_UK
dc.citation.issn	0929-5313	en_UK
dc.citation.volume	25	en_UK
dc.citation.issue	3	en_UK
dc.citation.spage	562	en_UK
dc.citation.epage	582	en_UK
dc.citation.publicationstatus	Published	en_UK
dc.citation.peerreviewed	Refereed	en_UK
dc.type.status	VoR - Version of Record	en_UK
dc.author.email	p.a.dudchenko@stir.ac.uk	en_UK
dc.contributor.affiliation	University of Stirling	en_UK
dc.contributor.affiliation	University of Stirling	en_UK
dc.contributor.affiliation	Vytautas Magnus University	en_UK
dc.contributor.affiliation	University of Glasgow	en_UK
dc.contributor.affiliation	Psychology	en_UK
dc.contributor.affiliation	University of Stirling	en_UK
dc.identifier.isi	WOS:000259438100009	en_UK
dc.identifier.scopusid	2-s2.0-53149129441	en_UK
dc.identifier.wtid	811177	en_UK
dc.contributor.orcid	0000-0002-1531-5713	en_UK
dcterms.dateAccepted	2008-12-31	en_UK
dc.date.filedepositdate	2011-03-16	en_UK
rioxxterms.type	Journal Article/Review	en_UK
rioxxterms.version	VoR	en_UK
local.rioxx.author	Tamosiunaite, Minija\|	en_UK
local.rioxx.author	Ainge, James A\|	en_UK
local.rioxx.author	Kulvicius, Tomas\|	en_UK
local.rioxx.author	Porr, Bernd\|	en_UK
local.rioxx.author	Dudchenko, Paul\|0000-0002-1531-5713	en_UK
local.rioxx.author	Worgotter, Florentin\|	en_UK
local.rioxx.project	Internal Project\|University of Stirling\|https://isni.org/isni/0000000122484331	en_UK
local.rioxx.freetoreaddate	3000-01-01	en_UK
local.rioxx.licence	http://www.rioxx.net/licenses/under-embargo-all-rights-reserved\|\|	en_UK
local.rioxx.filename	dudchenko_2008.pdf	en_UK
local.rioxx.filecount	1	en_UK
local.rioxx.source	0929-5313	en_UK
Appears in Collections:	Psychology Journal Articles

Files in This Item:

File	Description	Size	Format
dudchenko_2008.pdf	Fulltext - Published Version	873.19 kB	Adobe PDF	Under Embargo until 3000-01-01 Request a copy

This item is protected by original copyright

View License

Show simple item record

Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.

STORRE

STORRE: Stirling Online Research Repository