Multiple-Speech-Source DOA Estimation Based on Single-Source Cluster Detection - Details

Author：

Li, L. (Li, L..) | Jia, M. (Jia, M..) | Wang, J. (Wang, J..) | Cao, R. (Cao, R..)

Indexed by：

EI Scopus SCIE

Abstract：

This　study　proposes　multiple-speech-source　direction-of-arrival　(DOA)　estimation　based　on　the　distribution　characteristic　of　the　time-frequency　(TF)　point　dominated　by　a　single-source　component　(i.e.,　single-source　point,　SSP).　By　exploring　the　TF　distribution　characteristics　of　SSPs,　we　found　that　most　are　distributed　in　clusters　in　the　TF　domain.　Hence,　the　concept　of　a　single-source　cluster　(SSC)　is　given,　each　composed　of　adjacent　TF　points　from　one　dominant　sound　source.　Considering　that　SSCs　have　different　shapes　and　sizes,　an　SSC　detection　method　is　designed　based　on　point-to-cluster　expansion,　which　is　the　research　focus　of　this　paper.　A　two-dimensional　Gaussian　function　is　introduced　to　model　the　theoretical　distribution　of　the　DOAs　of　SSPs,　and　a　cluster　expansion　rule　is　proposed　based　on　hypothesis　testing　of　the　DOA　of　a　source.　Two-dimensional　kernel　density　estimation　and　peak　search　are　adopted　to　estimate　the　DOAs　and　the　number　of　sources　using　the　detected　SSCs.　Experimental　results　in　both　simulated　and　real　environments　show　that　the　proposed　method　can　achieve　better　DOA　estimation　performance　than　some　current　techniques.　IEEE

Keyword：

hypothesis testing Microphone arrays Reflection DOA estimation Recording Estimation Reverberation Location awareness single-source cluster detection Direction-of-arrival estimation

Author Community：

[ 1 ] [Li L.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Jia M.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Wang J.]School of Information and Electronics, Beijing Institute of Technology, Beijing, China
[ 4 ] [Cao R.]Faculty of Science, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Multi-Source Localization Using Optimized Time-Frequency Representation and Sparsity Component Analysis
2023，ACM Transactions on Audio Speech and Language Processing
First-Order Relative Harmonic Coefficient-Based Time-Frequency Points Selection for Multi-Source DOA Estimation
2024，ACM Transactions on Audio Speech and Language Processing
Multi-source DOA estimation in reverberant environments using potential single-source points enhancement
2021，APPLIED ACOUSTICS
DOA estimation of multiple speech sources based on the single-source point detection using an FOA microphone
2022，APPLIED ACOUSTICS

Source ：

ACM Transactions on Audio Speech and Language Processing

ISSN： 2329-9290

Year： 2023

Volume： 31

Page： 1-14

5 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 36

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to