Expanding the Effective Receptive Field for Learned Image Compression - Details

Author：

Shi, Yunhui (Shi, Yunhui.) | Su, Yalong (Su, Yalong.) | Wang, Jin (Wang, Jin.) | Ling, Nam (Ling, Nam.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

CPCI-S EI Scopus

Abstract：

The　Transformer　architecture　has　surpassed　traditional　CNN-based　methods　in　the　field　of　learned　image　compression,　primarily　due　to　its　expansion　of　the　receptive　field.　In　learned　image　compression,　the　effective　receptive　field　is　crucial.　Although　the　Transformer　theoretically　has　an　extensive　receptive　field,　in　image　compression　models,　its　effective　receptive　field　is　much　smaller　than　the　theoretical　value,　accompanied　by　higher　computational　costs.　To　address　this　challenge,　this　paper　proposes　an　innovative　Multi-scale　Spatial　Channel　Fusion(MSCF)　mechanism　that　not　only　brings　the　effective　receptive　field　of　CNNs　on　par　with　Transformers　but　also　retains　the　low　complexity　and　high　efficiency　of　CNNs.　Additionally,　learned　image　compression　tends　to　lose　a　significant　amount　of　high-frequency　components.　To　compensate　for　this　deficiency,　we　introduce　a　High-Frequency　Enhancement(HFE)　module.　We　integrate　the　MSCF　mechanism　and　the　HFE　module　into　the　MLIC++　framework.　Experimental　results　indicate　that　our　proposed　model,　Multiscale　Feature　Extraction　and　High-Frequency　Enhancement　for　Learned　Image　Compression　(HMLIC)　achieves　a　substantial　performance　improvement　over　the　baseline　model　across　the　Kodak,　CLIC　Professional　Validation　and　Tecnick　test　datasets,　while　incurring　only　a　minimal　increase　in　model　complexity.　©　2024　IEEE.

Keyword：

Image compression Electric transformer testing Image enhancement

Author Community：

[ 1 ] [Shi, Yunhui]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Su, Yalong]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Wang, Jin]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 4 ] [Ling, Nam]Santa Clara University, Department of Computer Science and Engineering, Santa Clara; CA; 95053, United States
[ 5 ] [Yin, Baocai]Faculty of Information Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

A class-specified learning based super resolution for low-bit-rate compressed images
2014，2014 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2014
Fusion of multi-resolution visible image and infrared images based on guided filter
2018，37th Chinese Control Conference, CCC 2018
Optimized tag ranking based on visual vocabulary for social images in compressed domain
2014，2014 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2014
Deep Correlated Image Set Compression Based on Distributed Source Coding and Multi-Scale Fusion
2022，2022 Data Compression Conference, DCC 2022

Source ：

Year： 2024

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to