Monaural Speech Separation Method Based on Recurrent Attention with Parallel Branches - Details

Author：

Yang, X. (Yang, X..) | Bao, C. (Bao, C..) | Zhang, X. (Zhang, X..) | Chen, X. (Chen, X..)

Indexed by：

CPCI-S EI Scopus

Abstract：

In　many　speech　separation　methods,　the　contextual　information　contained　in　the　feature　sequence　is　mainly　modeled　by　recurrent　layer　and/or　self-attention　mechanism.　However,　how　to　combine　these　two　powerful　approaches　more　effectively　needs　to　be　explored.　In　this　paper,　a　recurrent　attention　with　parallel　branches　is　proposed　to　first　fully　exploit　the　contextual　information　contained　in　the　time-frequency　(T-F)　features.　Then,　this　information　is　further　modeled　by　the　recurrent　modules　in　a　conventional　manner.　Specifically,　the　proposed　recurrent　attention　with　parallel　branches　uses　two　attention　modules　stacked　sequentially.　Each　attention　module　has　two　parallel　branches　of　self-attention　to　model　dependencies　along　two　axes　and　one　convolutional　layer　for　feature　fusion.　Thus,　the　contextual　information　contained　in　the　T-F　features　can　be　fully　exploited　and　further　modeled　by　the　recurrent　modules.　Experimental　results　showed　the　effectiveness　of　our　proposed　method.　©　2023　International　Speech　Communication　Association.　All　rights　reserved.

Keyword：

speech separation speech enhancement contextual information

Author Community：

[ 1 ] [Yang X.]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Bao C.]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Zhang X.]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 4 ] [Chen X.]Speech and Audio Signal Processing Laboratory, Faculty of Information Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Multi-speaker Speech Separation under Reverberation Conditions Using Conv-Tasnet
2023，Journal of Advances in Information Technology
Iteratively Refined Multi-Channel Speech Separation
2024，APPLIED SCIENCES-BASEL
Research Situation and Prospects of Multi-speaker Separation and Target Speaker Extraction; [多说话人分离与目标说话人提取的研究现状与展望]
2024，Journal of Data Acquisition and Processing
Embedding Recurrent Layers with Dual-Path Strategy in a Variant of Convolutional Network for Speaker-Independent Speech Separation
2022，

Source ：

ISSN： 2308-457X

Year： 2023

Volume： 2023-August

Page： 3794-3798

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to