• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Ru, Jiawei (Ru, Jiawei.) | Jia, Maoshen (Jia, Maoshen.) | Zhao, Yuhao (Zhao, Yuhao.) | Tao, Liang (Tao, Liang.)

Indexed by:

EI

Abstract:

In this paper, we propose a neural speech coding method based on the dual-path conformer, which mainly consists of three steps: (1) the encoding and decoding of the time-frequency spectrum are performed by a structure that combines the CNN and the dual-path conformer, (2) residual vector quantization is employed to quantize the output features of encoder and form a compact discrete representation, and (3) multi-period and multi-scale discriminators are used to improve the perceptual quality of speech during adversarial training. Experimental results, from both subjective and objective evaluations, demonstrate that the proposed codec outperforms the state-of-the-art neural codec AudioDEC and the leading conventional codec Opus in terms of performance. ©2024 IEEE.

Keyword:

Audio signal processing Network coding Speech enhancement Quantization (signal) Vector quantization

Author Community:

  • [ 1 ] [Ru, Jiawei]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Jia, Maoshen]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 3 ] [Zhao, Yuhao]School of Information Science and Technology, Beijing University of Technology, Beijing, China
  • [ 4 ] [Tao, Liang]School of Information Science and Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Year: 2024

Page: 661-665

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 8

Affiliated Colleges:

Online/Total:644/10700369
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.