• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Cui, Zihao (Cui, Zihao.) | Bao, Changchun (Bao, Changchun.)

Indexed by:

EI Scopus SCIE

Abstract:

In conventional speech enhancement methods, the target of noise mask in the time-frequency domain is based on deep neural networks (DNN), such as ideal ratio mask and phase-sensitive mask, in which they do not consider the dependency of spectrum. In this paper, an ideal real-valued ratio mask (IRVRM) extraction method is proposed based on the analysis-by-synthesis (ABS) for utilizing the dependency of spectrum. In the synthesis process, the enhanced speech is obtained by inverse short-time Fourier transform (ISTFT) of the masked spectrum, whereas in the analysis process, the IRVRM is determined by maximizing speech quality of the reconstructed speech from mask space. The ABS loop algorithm is proposed to reduce computational complexity, namely, the best mask in the specifically generated subspace is conducted in each loop. After the ABS loop, the approximated IRVRM is conducted. This IRVRM is further utilized as the training target of the DNN. The experimental results show that when the extracted IRVRM with the ABS loop is employed as the training target of the DNN, the speech quality is effectively improved in the DNN-based noise masking. © 2022 Elsevier B.V.

Keyword:

Frequency domain analysis Extraction Spectrum analysis Speech enhancement Quality control Deep neural networks

Author Community:

  • [ 1 ] [Cui, Zihao]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Bao, Changchun]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Speech Communication

ISSN: 0167-6393

Year: 2022

Volume: 144

Page: 26-41

3 . 2

JCR@2022

3 . 2 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:46

JCR Journal Grade:2

CAS Journal Grade:3

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Affiliated Colleges:

Online/Total:555/10663285
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.