• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhao, Yuhao (Zhao, Yuhao.) | Jia, Maoshen (Jia, Maoshen.) | Ru, Jiawei (Ru, Jiawei.) | Tai, Junqi (Tai, Junqi.)

Indexed by:

EI

Abstract:

In this paper, we proposed a novel low bitrate neural speech codec based on sequence modeling networks. The proposed method consists of a convolution-based encoder and decoder, a DFSMN-Mamba module, and a vector quantizer. In the proposed method, a DFSMN-Mamba module is designed by combining Deep Feedforward Sequential Memory Network (DFSMN) with selective state space model Mamba, which is used to model the input features in parallel in both time and frequency dimensions. An adversarial loss is used to train the entire codec framework, which enables compression of speech waveforms into compact discrete representations at low bitrates. Experimental results show that the proposed method achieves better performance than the baseline in both subjective and objective evaluation. ©2024 IEEE.

Keyword:

Feedforward neural networks Audio signal processing Network coding Vector quantization Deep neural networks

Author Community:

  • [ 1 ] [Zhao, Yuhao]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Jia, Maoshen]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 3 ] [Ru, Jiawei]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 4 ] [Tai, Junqi]Beijing-Dublin International College, Beijing University of Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

Year: 2024

Page: 41-45

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

Affiliated Colleges:

Online/Total:719/10708451
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.