• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Huang, J.-W. (Huang, J.-W..) | Bao, C.-C. (Bao, C.-C..) | Zhou, J. (Zhou, J..)

Indexed by:

EI Scopus

Abstract:

For the neural network-based speech Packet Loss Concealment (PLC), the input features are crucial factors that directly affect the final recovery performance. Additionally, the challenge of restoring high natural speech through PLC remains to be addressed. To effectively recover packet loss speech and improve its naturalness, this paper proposes a PLC method of speech signal based on the priori Mel-spectrum and neural vocoder. The proposed method adopts an asymmetric encoding and decoding network structure. At the encoding stage, this method utilizes two independent encoding networks to extract the latent time-frequency features from the waveform and Mel-spectrogram, respectively. At the decoding stage, the latent time-frequency features are jointly fed into a neural vocoder which is composed of several temporal adaptive denor-malization layer to restore the lost speech signals and enhance the naturalness. Simulation experiments demonstrate that the proposed method outperforms two existing packet loss concealment algorithms in terms of perceptual evaluation of speech quality and short-time objective intelligibility. © 2024 Chinese Institute of Electronics. All rights reserved.

Keyword:

Mel-spectrum packet loss concealment temporal adaptive de-normalization layer time-frequency features neural vocoder

Author Community:

  • [ 1 ] [Huang J.-W.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 2 ] [Bao C.-C.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
  • [ 3 ] [Zhou J.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Acta Electronica Sinica

ISSN: 0372-2112

Year: 2024

Issue: 8

Volume: 52

Page: 2581-2590

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 7

Affiliated Colleges:

Online/Total:694/10672497
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.