Skip to main content
Log in

Real-time sensory–motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization

  • Original Article
  • Published:
Biological Cybernetics Aims and scope Submit manuscript

Abstract

An open problem in the cognitive dimensions of navigation concerns how previous exploratory experience is reorganized in order to allow the creation of novel efficient navigation trajectories. This behavior is revealed in the “traveling salesrat problem” (TSP) when rats discover the shortest path linking baited food wells after a few exploratory traversals. We have recently published a model of navigation sequence learning, where sharp wave ripple replay of hippocampal place cells transmit “snippets” of the recent trajectories that the animal has explored to the prefrontal cortex (PFC) (Cazin et al. in PLoS Comput Biol 15:e1006624, 2019). PFC is modeled as a recurrent reservoir network that is able to assemble these snippets into the efficient sequence (trajectory of spatial locations coded by place cell activation). The model of hippocampal replay generates a distribution of snippets as a function of their proximity to a reward, thus implementing a form of spatial credit assignment that solves the TSP task. The integrative PFC reservoir reconstructs the efficient TSP sequence based on exposure to this distribution of snippets that favors paths that are most proximal to rewards. While this demonstrates the theoretical feasibility of the PFC–HIPP interaction, the integration of such a dynamic system into a real-time sensory–motor system remains a challenge. In the current research, we test the hypothesis that the PFC reservoir model can operate in a real-time sensory–motor loop. Thus, the main goal of the paper is to validate the model in simulated and real robot scenarios. Place cell activation encoding the current position of the simulated and physical rat robot feeds the PFC reservoir which generates the successor place cell activation that represents the next step in the reproduced sequence in the readout. This is input to the robot, which advances to the coded location and then generates de novo the current place cell activation. This allows demonstration of the crucial role of embodiment. If the spatial code readout from PFC is played back directly into PFC, error can accumulate, and the system can diverge from desired trajectories. This required a spatial filter to decode the PFC code to a location and then recode a new place cell code for that location. In the robot, the place cell vector output of PFC is used to physically displace the robot and then generate a new place cell coded input to the PFC, replacing part of the software recoding procedure that was required otherwise. We demonstrate how this integrated sensory–motor system can learn simple navigation sequences and then, importantly, how it can synthesize novel efficient sequences based on prior experience, as previously demonstrated (Cazin et al. 2019). This contributes to the understanding of hippocampal replay in novel navigation sequence formation and the important role of embodiment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

(From Cazin et al. 2019 with permission)

Fig. 2
Fig. 3

From Cazin et al. (2019) with permission

Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Notes

  1. https://medium.com/@rosbots/ready-to-use-image-raspbian-stretch-ros-opencv-324d6f8dcd96.

  2. https://github.com/RoboCup-SSL/ssl-vision/wiki.

References

  • Ambrose RE, Pfeiffer BE, Foster DJ (2016) Reverse replay of hippocampal place cells is uniquely modulated by changing reward. Neuron 91:1124–1136

    CAS  PubMed  PubMed Central  Google Scholar 

  • Andrychowicz M, Wolski F, Ray A, Schneider J, Fong R, et al (2017) Advances in neural information processing systems, pp 5048–5058

  • Arleo A, Gerstner W (2000) Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity. Biol Cybern 83:287–299

    CAS  PubMed  Google Scholar 

  • Barrera A, Weitzenfeld A (2008) Biologically-inspired robot spatial cognition based on rat neurophysiological studies. Auton Robots 25:147–169

    Google Scholar 

  • Barrera A, Cáceres A, Weitzenfeld A, Ramirez-Amaya V (2011) Comparative experimental studies on spatial memory and learning in rats and robots. J Intell Rob Syst 63:361–397

    Google Scholar 

  • Barrera A, Tejera G, Llofriu M, Weitzenfeld A (2015) Learning spatial localization: from rat studies to computational models of the Hippocampus. Spat Cognit Comput 15:27–59

    Google Scholar 

  • Bendor D, Wilson MA (2012) Biasing the content of hippocampal replay during sleep. Nat Neurosci 15:1439–1444

    CAS  PubMed  PubMed Central  Google Scholar 

  • Brown MA, Sharp PE (1995) Simulation of spatial learning in the Morris water maze by a neural network model of the hippocampal formation and nucleus accumbens. Hippocampus 5:171–188

    CAS  PubMed  Google Scholar 

  • Burgess N, Recce M, O’Keefe J (1994) A model of hippocampal function. Neural Netw 7:1065–1081

    Google Scholar 

  • Burgess N, Donnett JG, Jeffery KJ, O–keefe J (1997) Robotic and neuronal simulation of the hippocampus and rat navigation. Philos Trans R Soc Lond Ser B Biol Sci 352:1535–1543

    CAS  Google Scholar 

  • Buzsáki G (1989) Two-stage model of memory trace formation: a role for “noisy” brain states. Neuroscience 31:551–570

    PubMed  Google Scholar 

  • Caluwaerts K, Staffa M, N’Guyen S, Grand C, Dollé L et al (2012) A biologically inspired meta-control navigation system for the psikharpax rat robot. Bioinspir Biomim 7:025009

    CAS  PubMed  Google Scholar 

  • Carr MF, Jadhav SP, Frank LM (2011) Hippocampal replay in the awake state: a potential substrate for memory consolidation and retrieval. Nat Neurosci 14:147–153

    CAS  PubMed  PubMed Central  Google Scholar 

  • Cazé R, Khamassi M, Aubin L, Girard B (2018) Hippocampal replays under the scrutiny of reinforcement learning models. J Neurophysiol 120:2877–2896

    PubMed  Google Scholar 

  • Cazin N, Llofriu Alonso M, Scleidorovich Chiodi P, Pelc T, Harland B et al (2019) Reservoir computing model of prefrontal cortex creates novel combinations of previous navigation sequences from hippocampal place-cell replay with spatial reward propagation. PLoS Comput Biol 15:e1006624

    CAS  PubMed  PubMed Central  Google Scholar 

  • Davidson TJ, Kloosterman F, Wilson MA (2009) Hippocampal replay of extended experience. Neuron 63:497–507

    CAS  PubMed  PubMed Central  Google Scholar 

  • de Jong LW, Gereke B, Martin GM, Fellous J-M (2011) The traveling salesrat: insights into the dynamics of efficient spatial navigation in the rodent. J Neural Eng 8:065010

    PubMed  Google Scholar 

  • De Lavilléon G, Lacroix MM, Rondi-Reig L, Benchenane K (2015) Explicit memory creation during sleep demonstrates a causal role of place cells in navigation. Nat Neurosci 18:493

    PubMed  Google Scholar 

  • Diba K, Buzsaki G (2007) Forward and reverse hippocampal place-cell sequences during ripples. Nat Neurosci 10:1241

    CAS  PubMed  PubMed Central  Google Scholar 

  • Dollé L, Sheynikhovich D, Girard B, Chavarriaga R, Guillot A (2010) Path planning versus cue responding: a bio-inspired model of switching between navigation strategies. Biol Cybern 103:299–317

    PubMed  Google Scholar 

  • Dominey PF (1995) Complex sensory–motor sequence learning based on recurrent state representation and reinforcement learning. Biol Cybern 73:265–274

    CAS  PubMed  Google Scholar 

  • Dominey PF (1998a) Influences of temporal organization on sequence learning and transfer: comments on Stadler (1995) and Curran and Keele (1993). J Exp Psychol Learn Mem Cogn 24:14

    Google Scholar 

  • Dominey PF (1998b) A shared system for learning serial and temporal structure of sensori-motor sequences? Evidence from simulation and human experiments. Brain Res Cogn Brain Res 6:163–172

    CAS  PubMed  Google Scholar 

  • Dominey PF, Ramus F (2000) Neural network processing of natural language: I. Sensitivity to serial, temporal and abstract structure of language in the infant. Lang Cognit Process 15:40

    Google Scholar 

  • Dominey PF, Arbib MA, Joseph JP (1995) A model of corticostriatal plasticity for learning oculomotor associations and sequences. J Cogn Neurosci 7:25

    Google Scholar 

  • Dominey PF, Inui T, Hoen M (2009) Neural network processing of natural language: II. Towards a unified model of corticostriatal function in learning sentence comprehension and non-linguistic sequencing. Brain Lang 109:80–92

    PubMed  Google Scholar 

  • Euston DR, Tatsuno M, McNaughton BL (2007) Fast-forward playback of recent memory sequences in prefrontal cortex during sleep. Science 318:1147–1150

    CAS  PubMed  Google Scholar 

  • Foster DJ, Wilson MA (2006) Reverse replay of behavioural sequences in hippocampal place cells during the awake state. Nature 440:680–683

    CAS  PubMed  Google Scholar 

  • Gaussier P, Revel A, Banquet J-P, Babeau V (2002) From view cells and place cells to cognitive map learning: processing stages of the hippocampal system. Biol Cybern 86:15–28

    CAS  PubMed  Google Scholar 

  • Gaussier P, Banquet J, Sargolini F, Giovannangeli C, Save E, Poucet B (2007) A model of grid cells involving extra hippocampal path integration, and the hippocampal loop. J Integr Neurosci 6:447–476

    CAS  PubMed  Google Scholar 

  • Guazzelli A, Bota M, Corbacho FJ, Arbib MA (1998) Affordances. Motivations, and the world graph theory. Adapt Behav 6:435–471

    Google Scholar 

  • Gupta AS, van der Meer MA, Touretzky DS, Redish AD (2010) Hippocampal replay is not a simple function of experience. Neuron 65:695–705

    CAS  PubMed  PubMed Central  Google Scholar 

  • Hasselmo ME (2008) Temporally structured replay of neural activity in a model of entorhinal cortex, hippocampus and postsubiculum. Eur J Neurosci 28:1301–1315

    PubMed  PubMed Central  Google Scholar 

  • Hinaut X, Dominey PF (2013) Real-time parallel processing of grammatical structure in the fronto-striatal system: a recurrent network simulation study using reservoir computing. PLoS ONE 8:1–18

    Google Scholar 

  • Hoffman KL, McNaughton BL (2002) Coordinated reactivation of distributed memory traces in primate neocortex. Science 297:2070–2073

    CAS  PubMed  Google Scholar 

  • Jaeger H, Haas H (2004) Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Science 304:78–80

    CAS  PubMed  Google Scholar 

  • Jaeger H, Lukosevicius M, Popovici D, Siewert U (2007) Optimization and applications of echo state networks with leaky-integrator neurons. Neural Netw 20:335–352

    PubMed  Google Scholar 

  • Ji D, Wilson MA (2007) Coordinated memory replay in the visual cortex and hippocampus during sleep. Nat Neurosci 10:100–107

    CAS  PubMed  Google Scholar 

  • Johnson A, Redish AD (2005) Hippocampal replay contributes to within session learning in a temporal difference reinforcement learning model. Neural Netw 18:1163–1171

    PubMed  Google Scholar 

  • Lansink CS, Goltstein PM, Lankelma JV, Joosten RN, McNaughton BL, Pennartz CM (2008) Preferential reactivation of motivationally relevant information in the ventral striatum. J Neurosci 28:6372–6382

    CAS  PubMed  PubMed Central  Google Scholar 

  • Llofriu M, Tejera G, Contreras M, Pelc T, Fellous J-M, Weitzenfeld A (2015) Goal-oriented robot navigation learning using a multi-scale space representation. Neural Netw 72:62–74

    CAS  PubMed  Google Scholar 

  • Lukosevicius M (2012) A practical guide to applying echo state networks. In: Montavon G, Orr GB, Müller KR (eds) Neural networks: tricks of the trade. Springer, Berlin, pp 659–686

    Google Scholar 

  • Lukosevicius M, Jaeger H (2009) Reservoir computing approaches to recurrent neural network training. Comput Sci Rev 3:22

    Google Scholar 

  • Maass W, Natschlager T, Markram H (2002) Real-time computing without stable states: a new framework for neural computation based on perturbations. Neural Comput 14:2531–2560

    PubMed  Google Scholar 

  • McClelland JL, McNaughton BL, O’Reilly RC (1995) Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. Psychol Rev 102:419–457

    PubMed  Google Scholar 

  • Moser EI, Moser M-B, McNaughton BL (2017) Spatial representation in the hippocampal formation: a history. Nat Neurosci 20:1448

    CAS  PubMed  Google Scholar 

  • Nadel L, Moscovitch M (1997) Memory consolidation, retrograde amnesia and the hippocampal complex. Curr Opin Neurobiol 7:217–227

    CAS  PubMed  Google Scholar 

  • Nikolic D, Hausler S, Singer W, Maass W (2009) Distributed fading memory for stimulus properties in the primary visual cortex. PLoS Biol 7:e1000260

    PubMed  PubMed Central  Google Scholar 

  • Peyrache A, Khamassi M, Benchenane K, Wiener SI, Battaglia FP (2009) Replay of rule-learning related neural patterns in the prefrontal cortex during sleep. Nat Neurosci 12:919–926

    CAS  PubMed  Google Scholar 

  • Pfeifer R, Lungarella M, Iida F (2007) Self-organization, embodiment, and biologically inspired robotics. Science 318:1088–1093

    CAS  PubMed  Google Scholar 

  • Pfeiffer BE, Foster DJ (2013) Hippocampal place-cell sequences depict future paths to remembered goals. Nature 497:74

    CAS  PubMed  PubMed Central  Google Scholar 

  • Redish AD, Touretzky DS (1997) Cognitive maps beyond the hippocampus. Hippocampus 7:15–35

    CAS  PubMed  Google Scholar 

  • Rigotti M, Barak O, Warden MR, Wang X-J, Daw ND et al (2013) The importance of mixed selectivity in complex cognitive tasks. Nature 497:585–590

    CAS  PubMed  PubMed Central  Google Scholar 

  • Ruder S. 2016. An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747

  • Singer AC, Frank LM (2009) Rewarded outcomes enhance reactivation of experience in the Hippocampus. Neuron 64:910–921

    CAS  PubMed  PubMed Central  Google Scholar 

  • Tejera G, Llofriu M, Barrera A, Weitzenfeld A (2018) Bio-inspired robotics: a spatial cognition model integrating place cells, grid cells and head direction cells. J Intell Rob Syst 91:85–99

    Google Scholar 

  • Widrow B, Hoff ME (1960) Adaptive switching circuits. Stanford University, CA Stanford Electronics Labs

  • Wilson MA, McNaughton BL (1994) Reactivation of hippocampal ensemble memories during sleep. Science 265:676–679

    CAS  PubMed  Google Scholar 

  • Wylie TR (2013) The discrete Fréchet distance with applications. College of Engineering, Montana State University-Bozeman, Bozeman

    Google Scholar 

Download references

Funding

Funding was provided by NFS-ANR CRCNS (Grant No. #1429929, SPAQUENCE).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Peter Ford Dominey.

Additional information

Communicated by Jean-Marc Fellous.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Special Issue entitled ‘Complex Spatial Navigation in Animals, Computational Models and Neuro-inspired Robots’.

Appendix

Appendix

Code for the HIPP–PFC model simulator is available at: https://github.com/NicolasCAZIN/TRN. Code for the animat SCS simulator integrated with the model is available at: https://github.com/biorobaw/scs.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cazin, N., Scleidorovich, P., Weitzenfeld, A. et al. Real-time sensory–motor integration of hippocampal place cell replay and prefrontal sequence learning in simulated and physical rat robots for novel path optimization. Biol Cybern 114, 249–268 (2020). https://doi.org/10.1007/s00422-020-00820-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00422-020-00820-2

Keywords

Navigation