• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Shi, Y. (Shi, Y..) | Ren, J. (Ren, J..) | Wang, L. (Wang, L..) | Wang, J. (Wang, J..) | Liu, J. (Liu, J..)

Indexed by:

CPCI-S EI Scopus

Abstract:

In recent years, there has been a substantial increase in the amount of visual data generated by edge devices. Machines typically process this data to accomplish tasks such as object detection without human visual judgment. However, human viewing is sometimes required during human-robot interaction. Here, there exists a significant difference in the focus of information between humans and machines. To tackle this issue, we propose an end-to-end learning-based image coding framework, aiming to strike a balance between human and machine vision tasks. Also, a portion of the latent space is used for both machine vision and human vision. This is different from a compression framework that only targets human vision. Because of this difference, correlations still exist between tasks. So we propose a partial-channel context model to improve coding performance.Our scalable coding framework achieves simultaneous support for both human and machine vision by partitioning the latent space. Machine vision tasks are handled by a subset of the latent space, referred to as the base layer. More complex human visual reconstruction tasks are accomplished by an additional subset of the latent space, comprising both base and enhancement layers. In the experimental section, we present the performance of human visual reconstruction and machine vision tasks, comparing them with other benchmarks. The experiments demonstrate that our framework achieves a 28.27%-38.16% reduction in bitrate for machine vision tasks and matches the performance of state-of-the-art image codecs in terms of input reconstruction. © 2024 IEEE.

Keyword:

Context model Deep neural network Video Coding for Machines Image compression

Author Community:

  • [ 1 ] [Shi Y.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
  • [ 2 ] [Ren J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
  • [ 3 ] [Wang L.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
  • [ 4 ] [Wang J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China
  • [ 5 ] [Liu J.]Faculty Of Information Technology, Beijing University Of Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Year: 2024

Page: 1852-1857

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:686/10700273
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.