An Improved Dictionary Learning Method for Speech Enhancement - Details

Author：

Hao, Yue (Hao, Yue.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春)

Indexed by：

CPCI-S

Abstract：

In　this　paper,　an　improved　dictionary　learning　method　for　speech　enhancement　is　proposed.　Given　prior　information　of　the　noise,　the　dictionaries　of　speech　and　noise　are　firstly　trained　by　an　approximate　KSVD　algorithm,　respectively.　Then,　the　estimated　short-time　Fourier　transform　(STFT)　magnitudes　of　speech　and　noise　can　be　sparsely　represented　by　multiplying　the　dictionary　with　sparse　coefficients,　which　are　calculated　by　the　least　angle　regression　(LAR)　algorithm.　A　geometrical　stopping　criterion　with　an　adaptive　threshold　is　utilized　to　adjust　the　conventional　stopping　criterion　in　LAR　algorithm　so　that　it　can　increase　the　adaptability　of　LAR.　Next,　we　propose　a　framework　that　utilizes　the　expectation　maximization　(EM)　method　to　refine　the　energy　of　the　estimated　speech　and　noise　in　order　to　obtain　more　accurate　estimation　of　STFT　magnitudes.　Finally,　a　modified　wiener　filter　is　constructed　to　further　eliminate　residual　noise.　When　the　prior　information　of　noise　is　unknown,　an　online　noise　estimation　method　is　applied　to　replace　the　noise　dictionary.　The　test　results　show　that　the　proposed　method　outperforms　the　reference　speech　enhancement　methods.

Keyword：

Dictionary learning EM framework Noise estimation Modified Wiener filtering Speech enhancement Sparse representation

Author Community：

[ 1 ] [Hao, Yue]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China
[ 2 ] [Bao, Changchun]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Reprint Author's Address：

[Hao, Yue]Beijing Univ Technol, Sch Elect Informat & Control Engn, Speech & Audio Signal Proc Lab, Beijing 100124, Peoples R China

Email：

S201302071@emails.bjut.edu.cn |
baochch@bjut.edu.cn

Show more details

Related Keywords：

Sparse representation of acoustic emission signals and its application in pipeline leak location
2023，Measurement: Journal of the International Measurement Confederation
INCOHERENT DICTIONARY LEARNING FOR SPARSE REPRESENTATION BASED IMAGE DENOISING
2014，IEEE International Conference on Image Processing (ICIP)
Non-parametric Bayesian dictionary learning based on Laplace noise
2021，MULTIMEDIA TOOLS AND APPLICATIONS
Nonparametric tensor dictionary learning with beta process priors
2016，NEUROCOMPUTING

Source ：

2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA)

ISSN： 2309-9402

Year： 2015

Page： 144-147

Language： English

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 11

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to