• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Liu, X. (Liu, X..) (Scholars:刘晓) | Bao, C. (Bao, C..) | Sr. (Sr..)

Indexed by:

Scopus

Abstract:

The bandwidth limitation of wideband audio systems degrades the subjective quality and naturalness of audio signals. In this paper, a new method for blind bandwidth extension of wideband audio signals is proposed based on ensemble echo state network with temporal evolution. The high-frequency components in the band of 7 ∼ 14 kHz are artificially restored only from the information in the wideband audio. For each region in the wideband feature space, a specific echo state network with recurrent structure is explored to dynamically model the local mapping relationship between wideband audio features and highfrequency spectral envelope. The transition process among regions is modeled by a hidden Markov model, and a network ensemble technique based on temporal evolution is used to fuse multiple echo state networks such that the high-frequency spectral envelope is estimated. Combining the high-frequency fine spectrum extended by spectral translation, the proposed method can effectively extend the wideband audio to super wideband. In addition, the proposed extension method is applied to the ITU-T G.729.1 wideband audio codec and is further evaluated in comparison with the ITU-T G.729.1 Annex E super-wideband audio codec and the hidden Markov model-based reference bandwidth extension method. Objective quality evaluation results indicate that the proposed method is preferred over the hidden Markov model-based reference bandwidth extension method in terms of log spectral distortion, cosh measure, and differential log spectral distortion. Further, the proposed method improves the auditory quality of the wideband audio and also gains a good performance in the subjective listening tests. ©2016 IEEE.

Keyword:

Audio bandwidth extension; Audio coding; Echo state network; Hidden markov model

Author Community:

  • [ 1 ] [Liu, X.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Bao, C.]School of Electronic Information and Control Engineering, Speech and Audio Signal Processing Laboratory, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ACM Transactions on Audio Speech and Language Processing

ISSN: 2329-9290

Year: 2016

Issue: 3

Volume: 24

Page: 594-607

5 . 4 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:166

CAS Journal Grade:2

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 7

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Online/Total:344/10621214
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.