• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Fu, Yingkai (Fu, Yingkai.) | Li, Meng (Li, Meng.) | Liu, Wenxi (Liu, Wenxi.) | Wang, Yuanchen (Wang, Yuanchen.) | Zhang, Jiqing (Zhang, Jiqing.) | Yin, Baocai (Yin, Baocai.) | Wei, Xiaopeng (Wei, Xiaopeng.) | Yang, Xin (Yang, Xin.)

Indexed by:

EI SCIE

Abstract:

Event cameras, or dynamic vision sensors, have recently achieved success from fundamental vision tasks to high-level vision researches. Due to its ability to asynchronously capture light intensity changes, event camera has an inherent advantage to capture moving objects in challenging scenarios including objects under low light, high dynamic range, or fast moving objects. Thus event camera are natural for visual object tracking. However, the current event-based trackers derived from RGB trackers simply modify the input images to event frames and still follow conventional tracking pipeline that mainly focus on object texture for target distinction. As a result, the trackers may not be robust dealing with challenging scenarios such as moving cameras and cluttered foreground. In this paper, we propose a distractor-aware event-based tracker that introduces transformer modules into Siamese network architecture (named DANet). Specifically, our model is mainly composed of a motion-aware network and a target-aware network, which simultaneously exploits both motion cues and object contours from event data, so as to discover motion objects and identify the target object by removing dynamic distractors. Our DANet can be trained in an end-to-end manner without any post-processing and can run at over 80 FPS on a single V100. We conduct comprehensive experiments on two large event tracking datasets to validate the proposed model. We demonstrate that our tracker has superior performance against the state-of-the-art trackers in terms of both accuracy and efficiency. © 2023 IEEE.

Keyword:

Object recognition Target tracking Textures Cameras Deep neural networks Computer vision Image segmentation Large dataset Network architecture

Author Community:

  • [ 1 ] [Fu, Yingkai]Key Laboratory of Social Computing and Cognitive Intelligence,Ministry of Education, Dalian University of Technology, Dalian; 116024, China
  • [ 2 ] [Li, Meng]HiSilicon (Shanghai) Technologies Company Ltd., Shanghai; 201799, China
  • [ 3 ] [Liu, Wenxi]College of Computer and Data Science, Fuzhou University, Fuzhou; 350108, China
  • [ 4 ] [Wang, Yuanchen]Key Laboratory of Social Computing and Cognitive Intelligence,Ministry of Education, Dalian University of Technology, Dalian; 116024, China
  • [ 5 ] [Zhang, Jiqing]Key Laboratory of Social Computing and Cognitive Intelligence,Ministry of Education, Dalian University of Technology, Dalian; 116024, China
  • [ 6 ] [Yin, Baocai]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 7 ] [Wei, Xiaopeng]Key Laboratory of Social Computing and Cognitive Intelligence,Ministry of Education, Dalian University of Technology, Dalian; 116024, China
  • [ 8 ] [Yang, Xin]Key Laboratory of Social Computing and Cognitive Intelligence,Ministry of Education, Dalian University of Technology, Dalian; 116024, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

IEEE Transactions on Image Processing

ISSN: 1057-7149

Year: 2023

Volume: 32

Page: 6129-6141

1 0 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:742/10582484
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.