Speech enhancement method with geometric phase estimation by incorporating MIXMAX model - Details

Author：

Wang, Xianyun (Wang, Xianyun.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

EI Scopus

Abstract：

In　this　paper,　we　propose　a　frequency-domain　speech　enhancement　algorithm　with　phase　estimation,　in　which　the　speech　model　is　modeled　by　a　Gaussian　mixture　model　(GMM)　in　the　log-spectral　domain　and　two　closed-form　log-spectral　amplitude　estimators　for　speech　and　noise　are　derived　directly　by　using　a　Mixture-Maximum　(MIXMAX)　model.　Because　the　accurate　estimation　of　speech　phase　could　help　to　reduce　the　undesired　noise　residues　in　the　enhanced　signal,　our　two　log-spectral　estimators　are　also　used　to　construct　a　geometric　approach　for　phase　estimation　in　each　frequency　bin.　In　order　to　solve　the　ambiguity　problem　in　phase　estimation,　we　utilize　the　complex　linear　predictive　analysis　(CLPA)　and　inconsistency　constraint　to　find　an　appropriate　phase.　Experimental　results　show　that,　in　comparison　with　the　reference　methods,　the　proposed　method　achieves　an　efficient　improvement　in　speech　quality.　©　2016　Asia　Pacific　Signal　and　Information　Processing　Association.

Keyword：

Predictive analytics Frequency estimation Speech enhancement Frequency domain analysis Gaussian distribution

Author Community：

[ 1 ] [Wang, Xianyun]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Bao, Changchun]Speech and Audio Signal Processing Laboratory, School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Public Traffic Passenger Flow Prediction Model for Short-Term Large Scale Activities Based on Wavelet Analysis
2020，9th International Conference on Green Intelligent Transportation Systems and Safety, 2018
A Multiple-motion-pattern Trajectory Prediction Model for Uncertain Moving Objects
2018，Acta Automatica Sinica
HMM-based speech enhancement using vector Taylor series and parallel modeling in Mel-frequency domain
2014，2014 IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2014
Generative bridging network for neural sequence prediction
2018，2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2018

Source ：

Year： 2016

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to