Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter - Details

Author：

Wang, Dujuan (Wang, Dujuan.) | Bao, Changchun (Bao, Changchun.)

Indexed by：

EI Scopus

Abstract：

Deep　neural　network　(DNN)　based　ideal　ratio　mask　(IRM)　estimation　methods　have　yielded　good　performance　in　monaural　speech　enhancement.　Meanwhile,　these　methods　have　also　shown　considerable　potential　for　beamforming　and　multichannel　speech　enhancement.　It　is　crucial　for　minimum　variance　distortionless　response　(MVDR)　beamformer　to　estimate　the　covariance　matrix　of　the　speech　and　noise　accurately.　The　accurate　estimation　of　time-frequency　(T-F)　mask　has　significant　impact　on　the　estimation　of　the　covariance　matrices.　So,　in　this　paper,　a　complex　real　and　imaginary　ratio　mask　(CRIRM)　based　MVDR　beamformer　for　speech　enhancement　using　residual　network　is　proposed.　First,　the　real　and　imaginary　masks　of　speech　and　noise　are　estimated　by　taking　advantage　of　a　residual　neural　network.　After　that,　the　estimations　of　speech　and　noise　are　obtained　by　using　the　estimated　masks.　Finally,　the　covariance　matrices　of　speech　and　noise　are　estimated,　and　applied　into　the　MVDR　beamformer.　In　addition,　in　order　to　further　reduce　residual　noise　interference,　the　output　of　the　MVDR　beamformer　is　further　processed　by　an　end-to-end　monaural　speech　enhancement　module.　Experiments　show　that,　the　proposed　method　can　better　improve　the　quality　and　intelligibility　of　the　enhanced　speech.　©　2020　IEEE.

Keyword：

Speech intelligibility Deep neural networks Frequency estimation Covariance matrix Speech enhancement Beamforming

Author Community：

[ 1 ] [Wang, Dujuan]Beijing University of Technology, Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing, China
[ 2 ] [Bao, Changchun]Beijing University of Technology, Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Speech enhancement based on cepstral mapping and deep neural networks
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018
A Weekly Supervised Speech Enhancement Strategy using Cycle-GAN
2020，2020 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2020
An ideal wiener filter correction-based cIRM speech enhancement method using deep neural networks with skip connections
2018，14th IEEE International Conference on Signal Processing, ICSP 2018
Multi-channel Speech Enhancement with Multiple-target GANs
2020，2020 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2020

Source ：

Year： 2020

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to