• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Jia, Maoshen (Jia, Maoshen.) (Scholars:贾懋珅) | Sun, Jundai (Sun, Jundai.) | Bao, Changchun (Bao, Changchun.) (Scholars:鲍长春) | Ritz, Christian (Ritz, Christian.)

Indexed by:

EI Scopus SCIE

Abstract:

This paper proposes a blind source separation (BSS) method for recovering multiple speech sources from sound fields recorded by a B-format microphone. This microphone provides a four channel representation that can be used to derive the direction of arrival (DOA) of spatially distinct time-frequency (TF) components. Such sparse components correspond to bins where only one speech source is active and are identified based on the inter correlation among the mixture signals. They are recovered via a degenerate unmixing estimation technique (DUET)-like method. Proposed is a "local-zone stationarity" assumption, where the amplitude of a speech signal remains approximately constant within a small band of TF components. This assumption is validated through statistical analysis of a quantitative measure of stationarity. Under this assumption, the non-sparse components (TF points where more than one speech source is active) are recovered via a Wiener-filter-like approach where the separated sparse components is utilized as a guide. The final separated sources are obtained by combining the separated sparse and non-sparse components. Both objective and subjective evaluations show that the proposed method achieves better separation quality compared to some existing BSS approaches where up to six simultaneous speech sources are considered.

Keyword:

Multiple speech source separation Sparsity B-format microphone

Author Community:

  • [ 1 ] [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Sun, Jundai]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Bao, Changchun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Ritz, Christian]Univ Wollongong, ICT Res Inst, Wollongong, NSW 2500, Australia
  • [ 5 ] [Ritz, Christian]Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2500, Australia

Reprint Author's Address:

  • 贾懋珅

    [Jia, Maoshen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

SPEECH COMMUNICATION

ISSN: 0167-6393

Year: 2018

Volume: 96

Page: 184-196

3 . 2 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:161

JCR Journal Grade:3

Cited Count:

WoS CC Cited Count: 15

SCOPUS Cited Count: 25

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 2

Online/Total:498/10633531
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.