TransFG: A Cross-View Geo-Localization of Satellite and UAVs Imagery Pipeline Using Transformer-Based Feature Aggregation and Gradient Guidance - Details

Author：

Zhao, H. (Zhao, H..) | Ren, K. (Ren, K..) | Yue, T. (Yue, T..) | Zhang, C. (Zhang, C..) | Yuan, S. (Yuan, S..)

Indexed by：

EI Scopus SCIE

Abstract：

Cross-view　geo-localization　of　satellite　and　unmanned　aerial　vehicles　(UAVs)　imagery　has　attracted　extensive　attention　due　to　its　tremendous　potential　for　global　navigation　satellite　system　(GNSS)　denied　navigation.　However,　inadequate　feature　representation　across　different　views　coupled　with　positional　shifts　and　distance-scale　uncertainty　are　key　challenges.　Most　of　the　existing　research　mainly　focused　on　extracting　comprehensive　and　fine-grained　information,　yet　effective　feature　representation　and　alignment　should　be　imposed　equal　importance.　In　this　article,　we　propose　an　innovative　transformer-based　pipeline　TransFG　for　robust　cross-view　image　matching,　which　incorporates　feature　aggregation　(FA)　and　gradient　guidance　(GG)　module.　TransFG　synergically　takes　advantage　of　FA　and　GG,　achieving　an　effective　balance　in　feature　representation　and　alignment.　Specifically,　the　proposed　FA　module　implicitly　learns　salient　features　and　dynamically　aggregates　contextual　features　from　the　vision　transformer　(ViT).　The　proposed　GG　module　uses　the　gradient　information　of　local　features　to　further　enhance　the　cross-view　feature　representation　and　aligns　specific　instances　across　different　views.　Extensive　experiments　demonstrate　that　our　pipeline　outperforms　existing　methods　in　cross-view　geo-localization.　It　achieves　an　impressive　improvement　in　R@1　and　AP　than　the　state-of-the-art　(SOTA)　methods.　The　code　has　been　released　at　https://github.com/happyboy1234/TransFG.　　©　1980-2012　IEEE.

Keyword：

transformer feature aggregation (FA) unmanned aerial vehicles (UAVs) Cross-view geo-localization image matching

Author Community：

[ 1 ] [Zhao H.]Beijing University of Technology, Laboratory of Low-Altitude Intelligent Perception, Beijing, 100124, China
[ 2 ] [Ren K.]Beijing University of Technology, Laboratory of Low-Altitude Intelligent Perception, Beijing, 100124, China
[ 3 ] [Yue T.]Beijing University of Technology, Laboratory of Low-Altitude Intelligent Perception, Beijing, 100124, China
[ 4 ] [Zhang C.]Beijing University of Technology, Laboratory of Low-Altitude Intelligent Perception, Beijing, 100124, China
[ 5 ] [Yuan S.]Beijing University of Technology, Laboratory of Low-Altitude Intelligent Perception, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Transformer-Based Discriminative and Strong Representation Deep Hashing for Cross-Modal Retrieval
2023，IEEE ACCESS
Maritime greenhouse gas emission estimation and forecasting through AIS data analytics: a case study of Tianjin port in the context of sustainable development
2023，FRONTIERS IN MARINE SCIENCE
Semi-supervised hierarchical Transformer for hyperspectral Image classification
2024，International Journal of Remote Sensing
Automatic Roadside Camera Calibration with Transformers
2023，SENSORS

Source ：

IEEE Transactions on Geoscience and Remote Sensing

ISSN： 0196-2892

Year： 2024

Volume： 62

Page： 1-12

8 . 2 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 19

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 34

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to