HMM-based speech enhancement using vector Taylor series and parallel modeling in Mel-frequency domain - Details

Author：

Gao, Zhen-Zhen (Gao, Zhen-Zhen.) | Bao, Chang-Chun (Bao, Chang-Chun.) (Scholars：鲍长春) | Bao, Feng (Bao, Feng.) | Jia, Mao-Shen (Jia, Mao-Shen.)

Indexed by：

EI Scopus

Abstract：

Speech　enhancement　based　on　hidden　Markov　model　(HMM)　and　the　minimum　mean　square　error　(MMSE)　criterion　in　Mel-frequency　domain　is　generally　considered　as　a　weighted-sum　filtering　of　the　noisy　speech.　The　weights　of　filters　are　often　estimated　by　the　HMM　of　noisy　speech,　and　the　estimation　of　filters　usually　requires　an　inverse　operation　from　the　Mel-frequency　to　the　spectral　domain　which　often　causes　spectral　distortion.　In　order　to　obtain　a　more　accurate　HMM　of　noisy　speech,　the　vector　Taylor　series　(VTS)　is　used　to　estimated　the　mean　vectors　and　covariance　matrices　of　HMM　for　noisy　speech.　To　reduce　the　distortion　derived　from　inversion　operation,　a　parallel　Mel-frequency　and　log-magnitude　(PMLM)　modeling　approach　is　proposed.　In　PMLM,　a　simultaneous　modeling　in　both　Mel-frequency　domain　and　log-magnitude　(LOG-MAG)　domain　is　performed　to　train　the　HMMs　of　the　clean　speech　and　noise.　Experimental　results　show　that,　in　comparison　with　the　reference　methods,　the　proposed　method　can　get　better　performance　for　different　noise　environments　and　input　SNRs.　©　2014　IEEE.

Keyword：

Inverse problems Hidden Markov models Frequency estimation Signal processing Taylor series Frequency domain analysis Speech enhancement Mean square error Covariance matrix Vectors

Author Community：

[ 1 ] [Gao, Zhen-Zhen]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
[ 2 ] [Bao, Chang-Chun]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
[ 3 ] [Bao, Feng]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China
[ 4 ] [Jia, Mao-Shen]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Research and analysis of ocular artifact automatic removal
2014，Chinese Journal of Scientific Instrument
HMM-Based Speech Enhancement Using Vector Taylor Series and Parallel Modeling in Mel-Frequency Domain
2014，IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)
Estimation of bessel operator inversion by shearlet
2011，
Speech enhancement based on energy-focused nmf
2018，4th IEEE International Conference on Computer and Communications, ICCC 2018

Source ：

Year： 2014

Page： 733-737

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to