Binaural Target Sound Source Localization Based on Time-frequency Units Selection - Details

Author：

Li, Ruwei (Li, Ruwei.) | Li, Tao (Li, Tao.) | Sun, Xiaoyue (Sun, Xiaoyue.) | Yang, Dengcai (Yang, Dengcai.) | Wang, Qi (Wang, Qi.)

Indexed by：

EI PKU CSCD

Abstract：

The　performance　of　the　existing　target　localization　algorithms　is　not　ideal　in　complex　acoustic　environment.　In　order　to　improve　this　problem,　a　novel　target　binaural　sound　localization　algorithm　is　presented.　First,　the　algorithm　uses　binaural　spectral　features　as　input　of　a　time-frequency　units　selector　based　on　deep　learning.　Then,　to　reduce　the　negative　impact　of　the　time-frequency　unit　belonging　to　noise　on　the　localization　accuracy,　the　selector　is　emploied　to　select　the　reliable　time-frequency　units　from　binaural　input　sound　signal.　At　the　same　time,　a　Deep　Neural　Network　(DNN)-based　localization　system　maps　the　binaural　cues　of　each　time-frequency　unit　to　the　azimuth　posterior　probability.　Finally,　the　target　localization　is　completed　according　to　the　azimuth　posterior　probability　belonging　to　the　reliable　time-frequency　units.　Experimental　results　show　that　the　performance　of　the　proposed　algorithm　is　better　than　comparison　algorithms　and　achieves　a　significant　improvement　in　target　localization　accuracy　in　low　Signal-to-Noise　Ratio(SNR)　and　various　reverberation　environments,　especially　when　there　is　noise　similar　to　the　target　sound　source.　©　2019,　Science　Press.　All　right　reserved.

Keyword：

Signal to noise ratio Deep neural networks Acoustic generators Deep learning Acoustic noise

Author Community：

[ 1 ] [Li, Ruwei]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Li, Tao]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Sun, Xiaoyue]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Yang, Dengcai]Institute of Science and Technology Development, Beijing University of Technology, Beijing; 100124, China
[ 5 ] [Wang, Qi]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

[li, ruwei]laboratory of speech and audio signal processing and institute of artificial intelligence, faculty of information technology, beijing university of technology, beijing; 100124, china

Email：

liruwei@bjut.edu.cn

Show more details

Related Keywords：

Deep learning for binaural sound source localization with low signal-to-noise ratio
2021，2020 International Symposium on Automation, Information and Computing, ISAIC 2020
Super-resolution reconstruction based on compressed sensing and deep learning model
2016，2016 International Conference on Communication and Electronics Systems, ICCES 2016
IRM with phase parameterization for speech enhancement
2019，2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2019
Physical-Layer Adversarial Robustness for Deep Learning-Based Semantic Communications
2023，IEEE Journal on Selected Areas in Communications

Source ：

Journal of Electronics and Information Technology

ISSN： 1009-5896

Year： 2019

Issue： 12

Volume： 41

Page： 2932-2938

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 13

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to