Learning Domain Specific Sub-layer Latent Variable for Multi-Domain Adaptation Neural Machine Translation - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

Domain　adaptation　proves　to　be　an　effective　solution　for　addressing　inadequate　translation　performance　within　specific　domains.　However,　the　straightforward　approach　of　mixing　data　from　multiple　domains　to　obtain　the　multi-domain　neural　machine　translation　(NMT)　model　can　give　rise　to　the　parameter　interference　between　domains　problem,　resulting　in　a　degradation　of　overall　performance.　To　address　this,　we　introduce　a　multi-domain　adaptive　NMT　method　aimed　at　learning　domain　specific　sub-layer　latent　variable　and　employ　the　Gumbel-Softmax　reparameterization　technique　to　concurrently　train　both　model　parameters　and　domain　specific　sub-layer　latent　variable.　This　approach　facilitates　learning　private　domain-specific　knowledge　while　sharing　common　domain-invariant　knowledge,　effectively　mitigating　the　parameter　interference　problem.　The　experimental　results　show　that　our　proposed　method　significantly　improved　by　up　to　7.68　and　3.71　BLEU　compared　with　the　baseline　model　in　English-German　and　Chinese-English　public　multi-domain　datasets,　respectively.　Copyright　©　2024　held　by　the　owner/author(s).　Publication　rights　licensed　to　ACM.

Keyword：

Neural machine translation Learning systems Domain Knowledge Multilayer neural networks Computational linguistics Computer aided language translation

Author Community：

[ 1 ] [Huang, Shuanghong]School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
[ 2 ] [Feng, Chong]School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
[ 3 ] [Feng, Chong]Southeast Academy of Information Technology, Beijing Institute of Technology, Putian, China
[ 4 ] [Shi, Ge]Beijing University of Technology, Beijing, China
[ 5 ] [Li, Zhengjun]School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
[ 6 ] [Zhao, Xuan]School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China
[ 7 ] [Zhao, Xuan]Southeast Academy of Information Technology, Beijing Institute of Technology, Putian, China
[ 8 ] [Li, Xinyan]China North Vehicle Research Institute, Beijing, China
[ 9 ] [Wang, Xiaomei]Institute of Science and Development, Chinese Academy of Sciences, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Generative bridging network for neural sequence prediction
2018，2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2018
Decoding with value networks for neural machine translation
2017，31st Annual Conference on Neural Information Processing Systems, NIPS 2017
Extended super function based Chinese Japanese machine translation
2009，2009 International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2009
Research on Neural Machine Translation Model
2019，2019 4th International Conference on Intelligent Computing and Signal Processing, ICSP 2019

Source ：

ACM Transactions on Asian and Low-Resource Language Information Processing

ISSN： 2375-4699

Year： 2024

Issue： 6

Volume： 23

2 . 0 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to