Martin Pearson Martin.Pearson@uwe.ac.uk
Senior Lecturer
Multimodal Representation Learning for Place Recognition Using Deep Hebbian Predictive Coding
Pearson, Martin J.; Dora, Shirin; Struckmeier, Oliver; Knowles, Thomas C.; Mitchinson, Ben; Tiwari, Kshitij; Kyrki, Ville; Bohte, Sander; Pennartz, Cyriel M.A.
Authors
Shirin Dora
Oliver Struckmeier
Thomas C. Knowles
Ben Mitchinson
Kshitij Tiwari
Ville Kyrki
Sander Bohte
Cyriel M.A. Pennartz
Abstract
Recognising familiar places is a competence required in many engineering applications that interact with the real world such as robot navigation. Combining information from different sensory sources promotes robustness and accuracy of place recognition. However, mismatch in data registration, dimensionality, and timing between modalities remain challenging problems in multisensory place recognition. Spurious data generated by sensor drop-out in multisensory environments is particularly problematic and often resolved through adhoc and brittle solutions. An effective approach to these problems is demonstrated by animals as they gracefully move through the world. Therefore, we take a neuro-ethological approach by adopting self-supervised representation learning based on a neuroscientific model of visual cortex known as predictive coding. We demonstrate how this parsimonious network algorithm which is trained using a local learning rule can be extended to combine visual and tactile sensory cues from a biomimetic robot as it naturally explores a visually aliased environment. The place recognition performance obtained using joint latent representations generated by the network is significantly better than contemporary representation learning techniques. Further, we see evidence of improved robustness at place recognition in face of unimodal sensor drop-out. The proposed multimodal deep predictive coding algorithm presented is also linearly extensible to accommodate more than two sensory modalities, thereby providing an intriguing example of the value of neuro-biologically plausible representation learning for multimodal navigation.
Journal Article Type | Article |
---|---|
Acceptance Date | Nov 19, 2021 |
Online Publication Date | Dec 13, 2021 |
Publication Date | Dec 13, 2021 |
Deposit Date | Dec 13, 2021 |
Publicly Available Date | Dec 14, 2021 |
Journal | Frontiers in Robotics and AI |
Electronic ISSN | 2296-9144 |
Publisher | Frontiers Media |
Peer Reviewed | Peer Reviewed |
Volume | 8 |
Article Number | 732023 |
DOI | https://doi.org/10.3389/frobt.2021.732023 |
Keywords | Artificial Intelligence; Computer Science Applications |
Public URL | https://uwe-repository.worktribe.com/output/8260284 |
Files
Multimodal Representation Learning for Place Recognition Using Deep Hebbian Predictive Coding
(6.9 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
This is the author's accepted manuscript.
The published version is available: https://doi.org/10.3389/frobt.2021.732023
You might also like
Joint conferences - TAROS 2012 and FIRA 2012
(2013)
Journal Article
Self-adaptive context aware audio localization
(2017)
Book Chapter
Downloadable Citations
About UWE Bristol Research Repository
Administrator e-mail: repository@uwe.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search