• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhang, Wenbo (Zhang, Wenbo.) | Wu, Chaoyi (Wu, Chaoyi.) | Bao, Zhenshan (Bao, Zhenshan.)

Indexed by:

EI Scopus SCIE

Abstract:

The sustainable development of marine fisheries depends on the accurate measurement of data on fish stocks. Semantic segmentation methods based on deep learning can be applied to automatically obtain segmentation masks of fish in images to obtain measurement data. However, general semantic segmentation methods cannot accurately segment fish objects in underwater images. In this study, a Dual Pooling-aggregated Attention Network (DPANet) to adaptively capture long-range dependencies through an efficient and computing-friendly manner to enhance feature representation and improve segmentation performance is proposed. Specifically, a novel pooling-aggregate position attention module and a pooling-aggregate channel attention module are designed to aggregate contexts in the spatial dimension and channel dimension, respectively. These two modules adopt pooling operations along the channel dimension and along the spatial dimension to aggregate information, respectively, thus reducing computational costs. In these modules, attention maps are generated by four different paths and are aggregated into one. The authors conduct extensive experiments to validate the effectiveness of the DPANet and achieve new state-of-the-art segmentation performance on the well-known fish image dataset DeepFish as well as on the underwater image dataset SUIM, achieving a Mean IoU score of 91.08% and 85.39% respectively, while significantly reducing FLOPs of attention modules by about 93%.

Keyword:

image segmentation learning (artificial intelligence) computer vision convolutional neural nets

Author Community:

  • [ 1 ] [Zhang, Wenbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Wu, Chaoyi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Zhenshan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Zhang, Wenbo]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Source :

IET COMPUTER VISION

ISSN: 1751-9632

Year: 2021

Issue: 1

Volume: 16

Page: 67-82

1 . 7 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:87

JCR Journal Grade:4

Cited Count:

WoS CC Cited Count: 24

SCOPUS Cited Count: 27

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 11

Online/Total:499/10584189
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.