Multi-Source DOA Estimation in Reverberant Environments by Jointing Detection and Modeling of Time-Frequency Points - Details

Author：

Jia, Maoshen (Jia, Maoshen.) | Wu, Yuxuan (Wu, Yuxuan.) | Bao, Changchun (Bao, Changchun.) (Scholars：鲍长春) | Ritz, Christian (Ritz, Christian.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　article,　the　direction　of　arrival　(DOA)　estimation　of　multiple　speech　sources　in　reverberant　environments　is　investigated　based　on　the　recording　of　a　soundfield　microphone.　First,　the　recordings　are　analyzed　in　the　time-frequency　(T-F)　domain　to　detect　both　＇points＇　(single　T-F　points)　and　＇regions＇　(multiple,　adjacent　T-F　points)　corresponding　to　a　single　source　with　low　reverberation　(known　as　low-reverberant-single-source　(LRSS)　points).　Then,　a　LRSS　point　detection　algorithm　is　proposed　based　on　a　joint　dominance　measure　and　instantaneous　single-source　point　(SSP)　identification.　Following　this,　initial　DOA　estimates　obtained　for　the　detected　LRSS　points　are　analyzed　using　a　Gaussian　Mixture　Model　(GMM)　derived　by　the　Expectation-Maximization　(EM)　algorithm　to　cluster　components　into　sources　or　outliers　using　a　rule-based　method.　Finally,　the　DOA　of　each　actual　source　is　obtained　from　the　estimated　source　components.　Experiments　on　both　simulated　data　and　data　recorded　in　an　actual　acoustic　chamber　demonstrate　that　the　proposed　algorithm　exhibits　improved　performance　for　the　DOA　estimation　in　reverberant　environments　when　compared　to　several　existing　approaches.　©　2014　IEEE.

Keyword：

Gaussian distribution Reverberation Frequency domain analysis Frequency estimation Clustering algorithms Maximum principle Direction of arrival Image segmentation Audio recordings

Author Community：

[ 1 ] [Jia, Maoshen]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Wu, Yuxuan]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Bao, Changchun]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Ritz, Christian]School of Electrical Computer and Telecommunications Engineering, University of Wollongong, Wollongong; NSW; 2500, Australia

Reprint Author's Address：

[jia, maoshen]faculty of information technology, beijing university of technology, beijing; 100124, china

Email：

jiamaoshen@bjut.edu.cnemailchchbao@bjut.edu.cn) Bao, Changchun(chchbao@bjut.edu.cn

Show more details

Related Keywords：

Speech enhancement method with geometric phase estimation by incorporating MIXMAX model
2016，2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2016
Short-term traffic flow prediction by a Sugeno fuzzy system based on Gaussian mixture models
2012，Journal of Theoretical and Applied Information Technology
Online energy adjustment using AR-HMM for speech enhancement
2014，Acta Electronica Sinica
Comparisons on segmentation of brain MR image
2009，9th International Conference on Electronic Measurement and Instruments, ICEMI 2009

Source ：

ACM Transactions on Audio Speech and Language Processing

ISSN： 2329-9290

Year： 2021

Volume： 29

Page： 379-392

5 . 4 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：87

JCR Journal Grade：1

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 21

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to