A Data-Driven Speech Enhancement Method Based on Modeled Long-Range Temporal Dynamics - Details

Author：

Hao, Yue (Hao, Yue.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Bao, Feng (Bao, Feng.) | Deng, Feng (Deng, Feng.)

Indexed by：

CPCI-S Scopus

Abstract：

In　this　paper,　a　data-driven　speech　enhancement　method　based　on　modeled　long-range　temporal　dynamics　(LRTDs)　is　proposed.　First,　given　speech　and　noise　corpora,　Gaussian　Mixture　Models　(GMMs)　of　the　speech　and　noise　can　be　trained　respectively　based　on　the　expectation-maximization　(EM)　algorithm.　Then,　the　LRTDs　are　obtained　from　the　GMM　models.　Next,　based　on　the　LRTDs,　a　noise　robustness　longest　segment　searching　(NRLSS)　method　combined　with　the　Vector　Taylor　Series　(VTS)　approximation　algorithm　is　adopted　to　search　the　longest　matching　speech　and　noise　segments　(LMSNS)　from　speech　and　noise　corpora.　Finally,　using　the　obtained　LMSNS,　the　estimation　of　speech　spectrum　is　achieved.　Furthermore,　a　modified　Wiener　filter　is　constructed　to　further　eliminate　residual　noise.　The　test　results　show　that　the　proposed　method　outperforms　the　state-of-the-art　speech　enhancement　methods.

Keyword：

modified Wiener filter LRTDs GMM speech enhancement VTS NRLSS

Author Community：

[ 1 ] [Hao, Yue]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 3 ] [Bao, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 4 ] [Deng, Feng]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

[Hao, Yue]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

s201302071@emails.bjut.edu.cn |
baochch@bjut.edu.cn |
baofeng@emails.bjut.edu.cn |
dengfeng@emails.bjut.edu.cn

Show more details

Related Keywords：

A data-driven speech enhancement method based on A* longest segment searching technique
2017，SPEECH COMMUNICATION
HMM-based speech enhancement using vector Taylor series and parallel modeling in Mel-frequency domain
2014，2014 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2014
HMM-Based Speech Enhancement Using Vector Taylor Series and Parallel Modeling in Mel-Frequency Domain
2014，IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)
Compressed domain speech enhancement based on Gaussian mixture model
2012，Acta Electronica Sinica

Source ：

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5

Year： 2015

Page： 1790-1794

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to