A Speech Packet Loss Concealment Method Based on Priori Mel-Spectrum and Neural Vocoder; [基于先验梅尔谱和神经声码器的语音丢包隐藏方法] - Details

Author：

Huang, J.-W. (Huang, J.-W..) | Bao, C.-C. (Bao, C.-C..) | Zhou, J. (Zhou, J..)

Indexed by：

EI Scopus

Abstract：

For　the　neural　network-based　speech　Packet　Loss　Concealment　(PLC),　the　input　features　are　crucial　factors　that　directly　affect　the　final　recovery　performance.　Additionally,　the　challenge　of　restoring　high　natural　speech　through　PLC　remains　to　be　addressed.　To　effectively　recover　packet　loss　speech　and　improve　its　naturalness,　this　paper　proposes　a　PLC　method　of　speech　signal　based　on　the　priori　Mel-spectrum　and　neural　vocoder.　The　proposed　method　adopts　an　asymmetric　encoding　and　decoding　network　structure.　At　the　encoding　stage,　this　method　utilizes　two　independent　encoding　networks　to　extract　the　latent　time-frequency　features　from　the　waveform　and　Mel-spectrogram,　respectively.　At　the　decoding　stage,　the　latent　time-frequency　features　are　jointly　fed　into　a　neural　vocoder　which　is　composed　of　several　temporal　adaptive　denor-malization　layer　to　restore　the　lost　speech　signals　and　enhance　the　naturalness.　Simulation　experiments　demonstrate　that　the　proposed　method　outperforms　two　existing　packet　loss　concealment　algorithms　in　terms　of　perceptual　evaluation　of　speech　quality　and　short-time　objective　intelligibility.　©　2024　Chinese　Institute　of　Electronics.　All　rights　reserved.

Keyword：

Mel-spectrum packet loss concealment temporal adaptive de-normalization layer time-frequency features neural vocoder

Author Community：

[ 1 ] [Huang J.-W.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 2 ] [Bao C.-C.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China
[ 3 ] [Zhou J.]Institute of Speech and Audio Signal Processing, Faculty of Information Technology, Beijing University of Technology, Beijing, 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

A Neural Vocoder Based Packet Loss Concealment Algorithm
2022，
Packet Loss Concealment Based on Phase Correction and Deep Neural Network
2022，APPLIED SCIENCES-BASEL
Packet Loss Concealment Method Based On The Simplified Residual Network
2022，2022 16TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP2022), VOL 1
Research advance in packet loss processing techniques for VoIP
2007，Journal on Communications

Source ：

Acta Electronica Sinica

ISSN： 0372-2112

Year： 2024

Issue： 8

Volume： 52

Page： 2581-2590

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 7

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to