Bridging the Gap between Pre-Training and Fine-Tuning for Commonsense Generation - Details

Author：

Indexed by：

Abstract：

Commonsense　generation　aims　to　generate　a　plausible　sentence　containing　all　given　unordered　concept　words.　Previous　methods　focusing　on　this　task　usually　directly　concatenate　these　words　as　the　input　of　a　pre-trained　language　model　(PLM).　However,　in　PLMs’　pretraining　process,　the　inputs　are　often　corrupted　sentences　with　correct　word　order.　This　input　distribution　discrepancy　between　pre-training　and　fine-tuning　makes　the　model　difficult　to　fully　utilize　the　knowledge　of　PLMs.　In　this　paper,　we　propose　a　two-stage　framework　to　alleviate　this　issue.　Firstly,　in　pre-training　stage,　we　design　a　new　format　of　input　to　endow　PLMs　the　ability　to　deal　with　masked　sentences　with　incorrect　word　order.　Secondly,　during　fine-tuning,　we　insert　the　special　token　[MASK]　between　two　consecutive　concept　words　to　make　the　input　distribution　more　similar　to　the　input　distribution　in　pre-training.　We　conduct　extensive　experiments　and　provide　a　thorough　analysis　to　demonstrate　the　effectiveness　of　our　proposed　method.　The　code　is　available　at　https://github.com/LHRYANG/CommonGen.　©　2023　Association　for　Computational　Linguistics.

Keyword：

Author Community：

[ 1 ] [Yang, Haoran]The Chinese University of Hong Kong, Hong Kong
[ 2 ] [Wang, Yan]Tencent AI Lab., United States
[ 3 ] [Li, Piji]Tencent AI Lab., United States
[ 4 ] [Bi, Wei]Tencent AI Lab., United States
[ 5 ] [Lam, Wai]The Chinese University of Hong Kong, Hong Kong
[ 6 ] [Xu, Chen]Beijing University of Technology, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Source ：

Year： 2023

Page： 376-383

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to