SimEmotion: A Simple Knowledgeable Prompt Tuning Method for Image Emotion Classification - Details

Author：

Indexed by：

CPCI-S EI Scopus

Abstract：

Image　emotion　classification　is　an　important　computer　vision　task　to　extract　emotions　from　images.　The　state-of-the-art　methods　for　image　emotion　classification　are　primarily　based　on　proposing　new　architectures　and　fine-tuning　them　on　pre-trained　Convolutional　Neural　Networks.　Recently,　learning　transferable　visual　models　from　natural　language　supervision　has　shown　great　success　in　zero-shot　settings　due　to　the　easily　accessible　web-scale　training　data,　i.e.,　CLIP.　In　this　paper,　we　present　a　conceptually　simple　while　empirically　powerful　framework　for　supervised　image　emotion　classification,　SimEmotion,　to　effectively　leverage　the　rich　image　and　text　semantics　entailed　in　CLIP.　Specifically,　we　propose　a　prompt-based　fine-tuning　strategy　to　learn　task-specific　representations　while　preserving　knowledge　contained　in　CLIP.　As　image　emotion　classification　tasks　lack　text　descriptions,　sentiment-level　concept　and　entity-level　information　are　introduced　to　enrich　text　semantics,　forming　knowledgeable　prompts　and　avoiding　considerable　bias　introduced　by　fixed　designed　prompts,　further　improving　the　model’s　ability　to　distinguish　emotion　categories.　Evaluations　on　four　widely-used　affective　datasets,　namely,　Flickr　and　Instagram　(FI),　EmotionROI,　Twitter　I,　and　Twitter　II,　demonstrate　that　the　proposed　algorithm　outperforms　the　state-of-the-art　methods　to　a　large　margin　(i.e.,　5.27%　absolute　accuracy　gain　on　FI)　on　image　emotion　classification　tasks.　©　2022,　The　Author(s),　under　exclusive　license　to　Springer　Nature　Switzerland　AG.

Keyword：

Semantics Visual languages Image classification Image enhancement Classification (of information) Large dataset Text processing Social networking (online) Convolutional neural networks

Author Community：

[ 1 ] [Deng, Sinuo]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Shi, Ge]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Wu, Lifang]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 4 ] [Xing, Lehao]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 5 ] [Hu, Wenjin]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 6 ] [Zhang, Heng]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 7 ] [Xiang, Ye]Faculty of Information Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Image-text dual neural network with decision strategy for small-sample image classification
2019，Neurocomputing
Landform Image Classification Based on Discrete Cosine Transformation and Deep Network
2018，Acta Optica Sinica
Learning effective representations from sparse mutlimodal data on content curation social networks
2019，19th IEEE International Conference on Data Mining Workshops, ICDMW 2019
Topic Modeling for Short Texts Via Dual View Collaborate optimization
2022，7th IEEE International Conference on Data Science in Cyberspace, DSC 2022

Source ：

ISSN： 0302-9743

Year： 2022

Volume： 13247 LNCS

Page： 222-229

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 8

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 15

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to