• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, L. (Li, L..) | Jia, M. (Jia, M..) | Wang, J. (Wang, J..) | Cao, R. (Cao, R..)

Indexed by:

EI Scopus SCIE

Abstract:

This study proposes multiple-speech-source direction-of-arrival (DOA) estimation based on the distribution characteristic of the time-frequency (TF) point dominated by a single-source component (i.e., single-source point, SSP). By exploring the TF distribution characteristics of SSPs, we found that most are distributed in clusters in the TF domain. Hence, the concept of a single-source cluster (SSC) is given, each composed of adjacent TF points from one dominant sound source. Considering that SSCs have different shapes and sizes, an SSC detection method is designed based on point-to-cluster expansion, which is the research focus of this paper. A two-dimensional Gaussian function is introduced to model the theoretical distribution of the DOAs of SSPs, and a cluster expansion rule is proposed based on hypothesis testing of the DOA of a source. Two-dimensional kernel density estimation and peak search are adopted to estimate the DOAs and the number of sources using the detected SSCs. Experimental results in both simulated and real environments show that the proposed method can achieve better DOA estimation performance than some current techniques. IEEE

Keyword:

hypothesis testing Microphone arrays Reflection DOA estimation Recording Estimation Reverberation Location awareness single-source cluster detection Direction-of-arrival estimation

Author Community:

  • [ 1 ] [Li L.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Jia M.]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 3 ] [Wang J.]School of Information and Electronics, Beijing Institute of Technology, Beijing, China
  • [ 4 ] [Cao R.]Faculty of Science, Beijing University of Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

ACM Transactions on Audio Speech and Language Processing

ISSN: 2329-9290

Year: 2023

Volume: 31

Page: 1-14

5 . 4 0 0

JCR@2022

ESI Discipline: ENGINEERING;

ESI HC Threshold:19

Cited Count:

WoS CC Cited Count: 36

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:683/10626243
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.