• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

He, Dongzhi (He, Dongzhi.) | Li, Chenxi (Li, Chenxi.) | Ma, Zeyuan (Ma, Zeyuan.) | Li, Yunqi (Li, Yunqi.)

Indexed by:

EI Scopus SCIE

Abstract:

The precise and reliable segmentation of endoscopic images plays a pivotal role in the early diagnosis of gastrointestinal cancer lesions like gastric cancer and colon polyp. Due to the variations in appearance and blurred boundaries of cancer lesions, achieving precise automatic segmentation remains challenging. Therefore, this paper proposes a novel CNN-Transformer hybrid network named MSNet, which comprehensively utilizes multi-scale features. It aims to improve the segmentation accuracy of gastrointestinal cancer lesions by enhancing the ability to capture multi-scale features and distinguish boundaries. Firstly, a multi-scale perception module (MSPM) is designed specifically to capture and enhance multi-scale features, employing multiple pathways with stripe convolutions of various kernel sizes. Secondly, to enhance the model's ability to identify target boundaries, an improved dual feature pyramid (IDFP) strategy is proposed to obtain multi-scale prediction maps from various pathways and stages, thereby aggregating multi-scale features. Thirdly, to refine the network's recognition of target boundaries, a Boundary Enhancement Module (BEM) is devised, which integrates the multi-scale prediction maps obtained from IDFP with the final output of the decoder. This integration intends to mitigate spatial information loss caused by consecutive downsampling and upsampling operations, thereby achieving more precise segmentation outcomes. Extensive experiments are conducted on the privately collected gastric endoscopy dataset and five publicly available colonoscopy polyp datasets, aiming to assess the effectiveness and generalization of the proposed method. The proposed method achieves 88.3% mDice, 93.6% mDice and 94.8% mDice on the gastroscopy dataset, Kvasir-SEG and CVC-ClinicDB respectively. The source code will be available at https://github.com/0xChunFeng/MSNet.

Keyword:

Artificial intelligence Multi-scale feature representation Deep learning Medical image segmentation

Author Community:

  • [ 1 ] [He, Dongzhi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Li, Chenxi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Ma, Zeyuan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Li, Yunqi]Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Gastroenterol, Beijing 100853, Peoples R China

Reprint Author's Address:

  • [He, Dongzhi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;[Li, Yunqi]Chinese Peoples Liberat Army Gen Hosp, Med Ctr 1, Dept Gastroenterol, Beijing 100853, Peoples R China

Show more details

Related Keywords:

Source :

SIGNAL IMAGE AND VIDEO PROCESSING

ISSN: 1863-1703

Year: 2025

Issue: 1

Volume: 19

2 . 3 0 0

JCR@2022

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 10

Affiliated Colleges:

Online/Total:474/10579955
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.