Indexed by:
Abstract:
A discriminative deep belief network (DDBN) based on the Fisher criterion is used here to calculate the super-vector feature space of speech signals. The network extracts the feature codebook of the speaker that is superior to the one from the traditional deep belief network (DBN) algorithm for multi-speaker clustering and segmentation. Evaluations on the multi-speaker audio stream corpus generated from the TIMIT database show that the speaker segmentation algorithm based on the DDBN with the Fisher criterion performs better than the traditional Bayesian information criterion (BIC) method and the DBN method.
Keyword:
Reprint Author's Address:
Email:
Source :
Journal of Tsinghua University
ISSN: 1000-0054
Year: 2013
Issue: 6
Volume: 53
Page: 804-807,812
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 3
Affiliated Colleges: