• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Deng, Shuhao (Deng, Shuhao.) | Bao, Changchun (Bao, Changchun.)

Indexed by:

EI Scopus SCIE

Abstract:

Recently, deep neural networks (DNN) has been employed to conduct a kind of packet loss concealment (PLC) method for digital speech transmission. Due to good mapping ability of the DNN, the DNN-based PLC method usually can achieve better speech recovery than some traditional methods. However, because the phase of speech is limited between -pi and pi and there is no obvious spectral structure like magnitude spectrum, it is not suitable for the DNN learning. The DNN-based PLC method is usually difficult to accurately estimate the phase of the lost speech, which also limits the quality of the recovered speech. In order to solve the problem of inaccurate phase estimation on the DNN, a new PLC method based on phase unwrapping is proposed in this paper. This method is divided into two stages: training stage and test stage. In the training stage, we first employ cellular automata (CA) to unwrap the phase of speech for making its spectrum structural. Then the unwrapped phase spectrum is used as the input feature and the training target to train the DNN model. The input feature of the DNN is consisted of the unwrapped phase spectra of few frames in front of the lost speech, and the training target is the unwrapped phase spectrum of the lost speech. In the test stage, firstly, the unwrapped phase of few frames in front of the lost speech is extracted as the input features of the DNN to obtain the unwrapped phase of the lost speech. Then the unwrapped phase of the lost speech is re-wrapped into the phase of the lost speech. Finally, combining with the estimated phase spectrum and logarithmic power spectrum (LPS) of the lost speech, we can recover the lost speech by the PLC. Experimental results show that, compared with the existing DNN-based PLC method, the proposed PLC method can better recover the lost speech and improve the quality of speech, which is suitable for the PLC of digital speech transmission.

Keyword:

phase estimation phase unwrapping cellular-automata Packet loss concealment deep neural network

Author Community:

  • [ 1 ] [Deng, Shuhao]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address:

Show more details

Related Keywords:

Source :

SPEECH COMMUNICATION

ISSN: 0167-6393

Year: 2022

Volume: 138

Page: 88-97

3 . 2

JCR@2022

3 . 2 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:46

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:371/10703585
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.