• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Gao, Shang (Gao, Shang.) | Jia, Maoshen (Jia, Maoshen.) | Dong, Ruihai (Dong, Ruihai.)

Indexed by:

SCIE

Abstract:

First Order Ambisonics has attracted significant attention for direction-of-arrival estimation. Combining with the time-frequency analysis techniques, existing methods could realize accurate localization results in most of acoustic scenarios. However, these methods encounter challenges such as energy leakage due to the uncertainty of linear time-frequency transformation, and insufficient localization cues in high-reverberant noisy acoustic environments. This paper addresses these challenges by integrating the localization cues from spectrums with varying time-frequency resolutions. Specifically, by applying analysis window with different sizes, the spectrums with non-redundant localization cues can be generated. Then, a joint mask is designed by combining Hoyer sparsity and inter-channel energy measurements to assess the localization contribution at each point within spectrums. The accurate direct-of-arrival estimation can be achieved by applying the masked DOA cues of T-F points from spectrums with different resolutions. Objective evaluations are conducted in both simulation and actual recording environments. Corresponding results prove that the proposed method could exhibit a superior localization performance than several existing localization estimators.

Keyword:

Direction-of-arrival estimation Multiple signal classification sparsity Microphone arrays Acoustics Estimation time-frequency analysis Location awareness Energy measurement direction-of-arrival estimation Acoustic sensors Accuracy array signal processing Energy resolution Signal resolution

Author Community:

  • [ 1 ] [Gao, Shang]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Jia, Maoshen]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Dong, Ruihai]Univ Coll Dublin, Insight Ctr Data Analyt, Dublin D04V1W8, Ireland

Reprint Author's Address:

  • [Jia, Maoshen]Beijing Univ Technol, Sch Informat Sci & Technol, Beijing 100124, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

ISSN: 1558-7916

Year: 2025

Volume: 33

Page: 1590-1603

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:1152/10990774
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.