• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Deng, Shuhao (Deng, Shuhao.) | Bao, Changchun (Bao, Changchun.) | Cheng, Rui (Cheng, Rui.)

Indexed by:

EI Scopus

Abstract:

Beamforming method can effectively remove background noise, even in the complex environment, so it is widely used in speech enhancement. We propose a novel Generalized Eigenvalue (GEV) beamforming with Blind Analytic Normalization (BAN) method. In this method, the GEV beamformer coefficients are constructed by estimating logarithmic power spectrum (LPS), which are used to filter multichannel speech signals, and post filter technology is used to further remove noise in the beamformed signals. Firstly, in order to estimate the LPS of speech signal in each channel, we use the data-driven method to train the deep neural network (DNN) model. Then, we use the well trained DNN model to estimate LPS, which is used to calculate the power spectral density (PSD) matrix of speech, and further obtain the coefficients of the GEV beamformer. Since the GEV beamformer will cause speech distortion, the BAN is employed to post-process the beamformed signal. Furthermore, single channel speech enhancement is used to reduce residual noise. Our experiment is conducted in 8-channel simulation data set. The experimental results show that, compared with some existing speech enhancement methods, the proposed method can effectively remove background noise and achieve better speech enhancement effect. © 2020 IEEE.

Keyword:

Audio signal processing Deep neural networks Spectral density Beamforming Speech enhancement Speech communication Eigenvalues and eigenfunctions

Author Community:

  • [ 1 ] [Deng, Shuhao]Beijing University of Technology, Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing, China
  • [ 2 ] [Bao, Changchun]Beijing University of Technology, Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing, China
  • [ 3 ] [Cheng, Rui]Beijing University of Technology, Speech and Audio Signal Processing Lab, Faculty of Information Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Year: 2020

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:338/10505119
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.