A data-driven speech enhancement method based on A* longest segment searching technique - Details

Author：

Hao, Yue (Hao, Yue.) | Bao, Feng (Bao, Feng.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus SCIE

Abstract：

This　paper　proposed　a　data-driven　speech　enhancement　method　based　on　the　modeled　long-range　temporal　dynamics　(LRTDs).　First,　by　extracting　the　Mel-Frequency　Cepstral　coefficient　(MFCC)　features　from　speech　and　noise　corpora,　the　Gaussian　Mixture　Models　(GMMs)　of　the　speech　and　noise　were　trained　respectively　based　on　the　expectation-maximization　(EM)　algorithm.　Then,　the　LRTDs　were　obtained　from　the　GMM　models.　Next,　based　on　the　LRTDs,　a　modified　maximum　a　posterior　(MAP)　based　adaptive　longest　matching　segment　searching　(ALMSS)　method　derived　from　A*　search　technique　was　combined　with　the　Vector　Taylor　Series　(VTS)　approximation　algorithm　in　order　to　search　the　longest　matching　speech　and　noise　segments　(LMSNS)　from　speech　and　noise　corpora.　Finally,　using　the　obtained　LMSNS,　the　estimation　of　speech　spectrum　was　achieved.　Furthermore,　a　modified　Wiener　filter　was　constructed　to　further　eliminate　residual　noise.　The　objective　and　subjective　test　results　show　that　the　proposed　method　outperforms　the　reference　methods.　(C)　2017　Elsevier　B.V.　All　rights　reserved.

Keyword：

Speech enhancement ALMSS VTS GMM A* search technique Modified Wiener filter LRTDs

Author Community：

[ 1 ] [Hao, Yue]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 3 ] [Bao, Feng]Univ Auckland, Dept Elect & Comp Engn, Auckland 1010, New Zealand

Reprint Author's Address：

鲍长春
[Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

baochch@bjut.edu.cn

Show more details

Related Keywords：

A Data-Driven Speech Enhancement Method Based on Modeled Long-Range Temporal Dynamics
2015，16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015)
A codebook-driven speech enhancement method by exploiting speech harmonicity
2017，7th IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2017
An Improved Dictionary Learning Method for Speech Enhancement
2015，Asia-Pacific-Signal-and-Information-Processing-Association Annual Summit and Conference (APSIPA ASC)
An improved dictionary learning method for speech enhancement
2015，2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2015

Source ：

SPEECH COMMUNICATION

ISSN： 0167-6393

Year： 2017

Volume： 92

Page： 142-151

3 . 2 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：175

CAS Journal Grade：4

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 7

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to