• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Hao, Yue (Hao, Yue.) | Bao, Feng (Bao, Feng.) | Bao, Changchun (Bao, Changchun.) (Scholars:鲍长春)

Indexed by:

EI Scopus SCIE

Abstract:

This paper proposed a data-driven speech enhancement method based on the modeled long-range temporal dynamics (LRTDs). First, by extracting the Mel-Frequency Cepstral coefficient (MFCC) features from speech and noise corpora, the Gaussian Mixture Models (GMMs) of the speech and noise were trained respectively based on the expectation-maximization (EM) algorithm. Then, the LRTDs were obtained from the GMM models. Next, based on the LRTDs, a modified maximum a posterior (MAP) based adaptive longest matching segment searching (ALMSS) method derived from A* search technique was combined with the Vector Taylor Series (VTS) approximation algorithm in order to search the longest matching speech and noise segments (LMSNS) from speech and noise corpora. Finally, using the obtained LMSNS, the estimation of speech spectrum was achieved. Furthermore, a modified Wiener filter was constructed to further eliminate residual noise. The objective and subjective test results show that the proposed method outperforms the reference methods. (C) 2017 Elsevier B.V. All rights reserved.

Keyword:

Speech enhancement ALMSS VTS GMM A* search technique Modified Wiener filter LRTDs

Author Community:

  • [ 1 ] [Hao, Yue]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Feng]Univ Auckland, Dept Elect & Comp Engn, Auckland 1010, New Zealand

Reprint Author's Address:

  • 鲍长春

    [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

SPEECH COMMUNICATION

ISSN: 0167-6393

Year: 2017

Volume: 92

Page: 142-151

3 . 2 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:175

CAS Journal Grade:4

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 7

Online/Total:330/10642431
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.