Loop closure detection based on feature pyramids and NetVLAD - Details

Author：

Ren, Mingrong (Ren, Mingrong.) | Gao, Bo (Gao, Bo.)

Indexed by：

SCIE

Abstract：

Most　traditional　loop　closure　detection　(LCD)　methods　rely　on　manual　feature　design,　which　is　sensitive　to　environmental　conditions.　Convolutional　neural　networks　(CNNs)　cope　better　with　illumination　changes　by　extracting　hierarchical　features　and　ignoring　the　local　spatial　characteristics　of　images.　We　propose　an　LCD　algorithm　that　combines　VGG16,　NetVLAD,　and　image　pyramids　to　enhance　its　accuracy　and　robustness.　In　particular,　a　three-level　image　pyramid　was　constructed　via　downsampling,　and　then　a　feature　pyramid　(FP)　layer　was　obtained　by　extracting　features　through　VGG16　network　on　different　image　resolutions.　The　obtained　FPs　were　then　passed　into　the　VLAD　model,　and　this　model　outputted　VLAD　vectors　by　performing　residual　summation　with　L2　normalization.　Finally,　a　triplet　loss　function　was　employed　for　training.　Experimental　results　on　two　benchmark　datasets　and　a　real　scenario　dataset　demonstrated　that　this　algorithm　outperforms　the　NetVLAD　baseline　and　the　VGG16　network,　exhibiting　superior　feature-learning　capabilities　and　achieving　a　higher　LCD　accuracy.　Further,　it　maintained　real-time　performance　with　only　a　2%　increase　in　processing　time.　The　results　indicate　that　the　proposed　method　detects　loop　closures　even　in　complex　environments　with　varying　conditions　and　perspectives.　Hence,　the　approach　can　be　used　for　large-scale　visual　simultaneous　localization　and　mapping　applications,　such　as　autonomous　driving,　where　LCD　plays　a　crucial　role　in　mapping.

Keyword：

feature pyramid VGG16 loop closure detection convolutional neural network NetVLAD

Author Community：

[ 1 ] [Ren, Mingrong]Beijing Univ Technol, Coll Automat, Fac Informat & Technol, Beijing, Peoples R China
[ 2 ] [Gao, Bo]Imperial Coll London, Fac Engn, Dept Elect & Elect Engn, London, England
[ 3 ] [Gao, Bo]Beijing Univ Technol, Beijing Dublin Int Coll, Dept Elect Informat Engn, Beijing, Peoples R China

Reprint Author's Address：

Email：

renmingrong@bjut.edu.cn |
bg623@ic.ac.uk

Show more details

Related Keywords：

The loop closure detection algorithm based on bag of semantic word for robot navigation
2020，2020 IEEE International Conference on Information Technology, Big Data and Artificial Intelligence, ICIBA 2020
Fast and Robust Visual Loop Closure Detection based on MobileNetV3 and NetVLAD
2024，3rd International Symposium on Control Engineering and Robotics, ISCER 2024
Foreground Capture Feature Pyramid Network-Oriented Object Detection in Complex Backgrounds
2024，IEEE Transactions on Neural Networks and Learning Systems
Feature Pyramid Based Scene Text Detector
2017，14th IAPR International Conference on Document Analysis and Recognition (ICDAR)

Source ：

JOURNAL OF ELECTRONIC IMAGING

ISSN： 1017-9909

Year： 2023

Issue： 6

Volume： 32

1 . 1 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 7

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to