Speaker segmentation based on discriminative deep belief networks - Details

Author：

Ma, Yong (Ma, Yong.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Xia, Bingyin (Xia, Bingyin.)

Indexed by：

EI Scopus PKU CSCD

Abstract：

A　discriminative　deep　belief　network　(DDBN)　based　on　the　Fisher　criterion　is　used　here　to　calculate　the　super-vector　feature　space　of　speech　signals.　The　network　extracts　the　feature　codebook　of　the　speaker　that　is　superior　to　the　one　from　the　traditional　deep　belief　network　(DBN)　algorithm　for　multi-speaker　clustering　and　segmentation.　Evaluations　on　the　multi-speaker　audio　stream　corpus　generated　from　the　TIMIT　database　show　that　the　speaker　segmentation　algorithm　based　on　the　DDBN　with　the　Fisher　criterion　performs　better　than　the　traditional　Bayesian　information　criterion　(BIC)　method　and　the　DBN　method.

Keyword：

Vector spaces Clustering algorithms Bayesian networks

Author Community：

[ 1 ] [Ma, Yong]School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China
[ 2 ] [Ma, Yong]School of Physics and Electronic Engineering, Jiangsu Normal University, Xuzhou 221009, China
[ 3 ] [Bao, Changchun]School of Physics and Electronic Engineering, Jiangsu Normal University, Xuzhou 221009, China
[ 4 ] [Xia, Bingyin]School of Electronic Information and Control Engineering, Beijing University of Technology, Beijing 100124, China

Reprint Author's Address：

Email：

baochch@bjut.edu.cn

Show more details

Related Keywords：

A multilevel trusted clustering mechanism for the awareness layer of the internet of things
2019，12th Chinese Conference on Trusted Computing and Information Security, CTCIS 2018
Exploring performance of clustering methods on document sentiment analysis
2017，Journal of Information Science
Uniform color space based facial complexion recognition for Traditional Chinese Medicine
2014，2014 13th International Conference on Control Automation Robotics and Vision, ICARCV 2014
A novel hybrid apporach based on normalized color spaces and 2DPCA for color face recognition
2010，

Source ：

Journal of Tsinghua University

ISSN： 1000-0054

Year： 2013

Issue： 6

Volume： 53

Page： 804-807,812

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to