Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision

Please use this identifier to cite or link to this item:
https://doi.org/10.48693/63
Open Access logo originally created by the Public Library of Science (PLoS)
Title: Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision
Authors: Lukanov, Hristofor
König, Peter
Pipa, Gordon
ORCID of the author: https://orcid.org/0000-0003-3654-5267
https://orcid.org/0000-0002-3416-2652
Abstract: While abundant in biology, foveated vision is nearly absent from computational models and especially deep learning architectures. Despite considerable hardware improvements, training deep neural networks still presents a challenge and constraints complexity of models. Here we propose an end-to-end neural model for foveal-peripheral vision, inspired by retino-cortical mapping in primates and humans. Our model has an efficient sampling technique for compressing the visual signal such that a small portion of the scene is perceived in high resolution while a large field of view is maintained in low resolution. An attention mechanism for performing “eye-movements” assists the agent in collecting detailed information incrementally from the observed scene. Our model achieves comparable results to a similar neural architecture trained on full-resolution data for image classification and outperforms it at video classification tasks. At the same time, because of the smaller size of its input, it can reduce computational effort tenfold and uses several times less memory. Moreover, we present an easy to implement bottom-up and top-down attention mechanism which relies on task-relevant features and is therefore a convenient byproduct of the main architecture. Apart from its computational efficiency, the presented work provides means for exploring active vision for agent training in simulated environments and anthropomorphic robotics.
Citations: Lukanov H, König P and Pipa G (2021): Biologically Inspired Deep Learning Model for Efficient Foveal-Peripheral Vision. Front. Comput. Neurosci. 15:746204.
URL: https://doi.org/10.48693/63
https://osnadocs.ub.uni-osnabrueck.de/handle/ds-202202106316
Subject Keywords: space-variant vision; active vision; foveal vision; peripheral vision; deep learning-artificial neural network (DL-ANN); bottom-up attention; top-down attention
Issue Date: 22-Nov-2021
License name: Attribution 4.0 International
License url: http://creativecommons.org/licenses/by/4.0/
Type of publication: Einzelbeitrag in einer wissenschaftlichen Zeitschrift [article]
Appears in Collections:FB08 - Hochschulschriften
Open-Access-Publikationsfonds

Files in This Item:
File Description SizeFormat 
fncom_Lukanov_etal_2021.pdfArticle2,42 MBAdobe PDF
fncom_Lukanov_etal_2021.pdf
Thumbnail
View/Open


This item is licensed under a Creative Commons License Creative Commons