• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Jia, Maoshen (Jia, Maoshen.) (Scholars:贾懋珅) | Zhang, Jiaming (Zhang, Jiaming.) | Bao, Changchun (Bao, Changchun.) (Scholars:鲍长春) | Zheng, Xiguang (Zheng, Xiguang.)

Indexed by:

Scopus SCIE

Abstract:

Rendering spatial sound scenes via audio objects has become popular in recent years, since it can provide more flexibility for different auditory scenarios, such as 3D movies, spatial audio communication and virtual classrooms. To facilitate high-quality bitrate-efficient distribution for spatial audio objects, an encoding scheme based on intra-object sparsity (approximate k-sparsity of the audio object itself) is proposed in this paper. The statistical analysis is presented to validate the notion that the audio object has a stronger sparseness in the Modified Discrete Cosine Transform (MDCT) domain than in the Short Time Fourier Transform (STFT) domain. By exploiting intra-object sparsity in the MDCT domain, multiple simultaneously occurring audio objects are compressed into a mono downmix signal with side information. To ensure a balanced perception quality of audio objects, a Psychoacoustic-based time-frequency instants sorting algorithm and an energy equalized Number of Preserved Time-Frequency Bins (NPTF) allocation strategy are proposed, which are employed in the underlying compression framework. The downmix signal can be further encoded via Scalar Quantized Vector Huffman Coding (SQVH) technique at a desirable bitrate, and the side information is transmitted in a lossless manner. Both objective and subjective evaluations show that the proposed encoding scheme outperforms the Sparsity Analysis (SPA) approach and Spatial Audio Object Coding (SAOC) in cases where eight objects were jointly encoded.

Keyword:

audio object coding sparsity multi-channel audio coding psychoacoustic model

Author Community:

  • [ 1 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 2 ] [Zhang, Jiaming]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 4 ] [Zheng, Xiguang]Univ Wollongong, Fac Engn & Informat Sci, Wollongong, NSW 2522, Australia

Reprint Author's Address:

  • 贾懋珅

    [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

APPLIED SCIENCES-BASEL

ISSN: 2076-3417

Year: 2017

Issue: 12

Volume: 7

2 . 7 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:165

CAS Journal Grade:4

Cited Count:

WoS CC Cited Count: 6

SCOPUS Cited Count: 8

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:778/10657358
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.