DNN-based speech enhancement using MBE model - Details

Author：

Huang, Qizheng (Huang, Qizheng.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Wang, Xianyun (Wang, Xianyun.) | Xiang, Yang (Xiang, Yang.)

Indexed by：

EI Scopus

Abstract：

This　paper　provides　a　novel　deep　neural　networks　(DNN)　based　speech　enhancement　method　using　multi-band　excitation　(MBE)　model.　Generally,　the　proposed　system　contains　two　stages,　namely　training　stage　and　enhancing　stage.　In　the　training　stage,　two　DNNs　with　different　targets　are　trained.　The　training　targets　are　harmonic　magnitude　and　band　difference　function　of　clean　speech,　respectively.　The　input　feature　for　two　DNNs　is　log-power　spectra　(LPS)　of　noisy　speech.　In　the　enhancing　stage,　using　the　output　of　DNNs　and　online　estimated　pitch　period,　the　enhanced　speech　can　be　obtained　by　MBE　speech　synthesis.　Using　the　proposed　method,　the　parameters　of　MBE　model　can　be　accurately　estimated　to　synthesize　the　enhanced　speech　with　the　high　quality.　At　the　same　time,　the　noise　between　the　harmonics　is　effectively　eliminated.　The　experiments　show　that　the　proposed　method　outperforms　the　reference　methods　for　speech　quality　and　intelligibility.　©　2018　IEEE.

Keyword：

Acoustic waves Speech synthesis Deep neural networks Speech intelligibility Continuous speech recognition Speech enhancement

Author Community：

[ 1 ] [Huang, Qizheng]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Laboratory, Beijing; 100124, China
[ 2 ] [Bao, Changchun]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Laboratory, Beijing; 100124, China
[ 3 ] [Wang, Xianyun]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Laboratory, Beijing; 100124, China
[ 4 ] [Xiang, Yang]Faculty of Information Technology, Beijing University of Technology, Speech and Audio Signal Processing Laboratory, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Speech enhancement via generative adversarial LSTM networks
2018，16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
Phase unwrapping based speech enhancement
2019，2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019
Speech enhancement based on cepstral mapping and deep neural networks
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018
A Weekly Supervised Speech Enhancement Strategy using Cycle-GAN
2020，2020 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2020

Source ：

Year： 2018

Page： 196-200

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 11

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to