• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Bai, Zhigang (Bai, Zhigang.) | Bao, Changchun (Bao, Changchun.) (Scholars:鲍长春) | Yan, Bofang (Yan, Bofang.)

Indexed by:

CPCI-S

Abstract:

In this paper, we present a novel approach for speech enhancement based on nonnegative matrix factorization (NMF) with the speech magnitude spectrum constrained by a codebook. First, we utilize a codebook to model the magnitude spectrum of clean speech and a speech magnitude spectrum codebook is trained containing the priori information of speech. Second, a classic noise estimation algorithm is employed to estimate the power spectral density (PSD) of noise to avoid noise classification. Then, we obtain the basis matrix of the noisy speech by combining the noise spectral with the optimal entry from the speech codebook. The magnitude spectrum of the noisy speech is decomposed by performing NMF and the estimated speech and noise components are obtained. Finally, the obtained speech and noise components are used to enhance the noisy speech. Moreover, the residual noise is further eliminated by applying the speech presence probability (SPP). The objective evaluations demonstrate that the proposed algorithm outperforms the conventional NMF based method for all the evaluated noise types at various input signal-to-noise ratios.

Keyword:

Nonnegative matrix factorization priori codebook NMF speech enhancement

Author Community:

  • [ 1 ] [Bai, Zhigang]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 3 ] [Yan, Bofang]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address:

  • [Bai, Zhigang]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP)

Year: 2018

Page: 361-365

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Online/Total:688/10526028
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.