• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Ruwei (Li, Ruwei.) | Li, Tao (Li, Tao.) | Sun, Xiaoyue (Sun, Xiaoyue.) | Yang, Dengcai (Yang, Dengcai.) | Wang, Qi (Wang, Qi.)

Indexed by:

EI PKU CSCD

Abstract:

The performance of the existing target localization algorithms is not ideal in complex acoustic environment. In order to improve this problem, a novel target binaural sound localization algorithm is presented. First, the algorithm uses binaural spectral features as input of a time-frequency units selector based on deep learning. Then, to reduce the negative impact of the time-frequency unit belonging to noise on the localization accuracy, the selector is emploied to select the reliable time-frequency units from binaural input sound signal. At the same time, a Deep Neural Network (DNN)-based localization system maps the binaural cues of each time-frequency unit to the azimuth posterior probability. Finally, the target localization is completed according to the azimuth posterior probability belonging to the reliable time-frequency units. Experimental results show that the performance of the proposed algorithm is better than comparison algorithms and achieves a significant improvement in target localization accuracy in low Signal-to-Noise Ratio(SNR) and various reverberation environments, especially when there is noise similar to the target sound source. © 2019, Science Press. All right reserved.

Keyword:

Signal to noise ratio Deep neural networks Acoustic generators Deep learning Acoustic noise

Author Community:

  • [ 1 ] [Li, Ruwei]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Li, Tao]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Sun, Xiaoyue]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Yang, Dengcai]Institute of Science and Technology Development, Beijing University of Technology, Beijing; 100124, China
  • [ 5 ] [Wang, Qi]Laboratory of Speech and Audio Signal Processing and Institute of Artificial Intelligence, Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address:

  • [li, ruwei]laboratory of speech and audio signal processing and institute of artificial intelligence, faculty of information technology, beijing university of technology, beijing; 100124, china

Show more details

Related Keywords:

Related Article:

Source :

Journal of Electronics and Information Technology

ISSN: 1009-5896

Year: 2019

Issue: 12

Volume: 41

Page: 2932-2938

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 13

Online/Total:823/10670953
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.