Scalable Image Coding for Human and Machines: Based on partial channel context model - Details

Author：

Shi, Y. (Shi, Y..) | Ren, J. (Ren, J..) | Wang, L. (Wang, L..) | Wang, J. (Wang, J..) | Liu, J. (Liu, J..)

Indexed by：

CPCI-S EI Scopus

Abstract：

In　recent　years,　there　has　been　a　substantial　increase　in　the　amount　of　visual　data　generated　by　edge　devices.　Machines　typically　process　this　data　to　accomplish　tasks　such　as　object　detection　without　human　visual　judgment.　However,　human　viewing　is　sometimes　required　during　human-robot　interaction.　Here,　there　exists　a　significant　difference　in　the　focus　of　information　between　humans　and　machines.　To　tackle　this　issue,　we　propose　an　end-to-end　learning-based　image　coding　framework,　aiming　to　strike　a　balance　between　human　and　machine　vision　tasks.　Also,　a　portion　of　the　latent　space　is　used　for　both　machine　vision　and　human　vision.　This　is　different　from　a　compression　framework　that　only　targets　human　vision.　Because　of　this　difference,　correlations　still　exist　between　tasks.　So　we　propose　a　partial-channel　context　model　to　improve　coding　performance.Our　scalable　coding　framework　achieves　simultaneous　support　for　both　human　and　machine　vision　by　partitioning　the　latent　space.　Machine　vision　tasks　are　handled　by　a　subset　of　the　latent　space,　referred　to　as　the　base　layer.　More　complex　human　visual　reconstruction　tasks　are　accomplished　by　an　additional　subset　of　the　latent　space,　comprising　both　base　and　enhancement　layers.　In　the　experimental　section,　we　present　the　performance　of　human　visual　reconstruction　and　machine　vision　tasks,　comparing　them　with　other　benchmarks.　The　experiments　demonstrate　that　our　framework　achieves　a　28.27%-38.16%　reduction　in　bitrate　for　machine　vision　tasks　and　matches　the　performance　of　state-of-the-art　image　codecs　in　terms　of　input　reconstruction.　©　2024　IEEE.

Keyword：

Context model Deep neural network Video Coding for Machines Image compression

Author Community：

[ 1 ] [Shi Y.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
[ 2 ] [Ren J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
[ 3 ] [Wang L.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
[ 4 ] [Wang J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
[ 5 ] [Liu J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Deep Neural Network Technique for High-Dimensional Microwave Modeling and Applications to Parameter Extraction of Microwave Filters
2019，IEEE TRANSACTIONS ON MICROWAVE THEORY AND TECHNIQUES
A Novel Deep Neural Network Topology for Parametric Modeling of Passive Microwave Components
2020，IEEE ACCESS
Deep Neural Network with Batch Normalization for Automated Modeling of Microwave Components
2020，IEEE-MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization (NEMO)
Recent advances in neural network-based inverse modeling techniques for microwave applications
2020，INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS

Source ：

Year： 2024

Page： 1852-1857

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to