Learning effective representations from sparse mutlimodal data on content curation social networks - Details

Author：

Indexed by：

Abstract：

Content　curation　social　networks　(CCSNs),　which　provide　users　a　platform　to　share　their　interests　by　multimedia　information,　are　the　most　rapidly　growing　social　networks　in　recent　years.　Since　large-scale　multimodal　data　have　been　generated　by　CCSN　users,　learning　multimodal　representations　for　contents　have　become　the　key　to　the　progress　of　many　applications　such　as　user　interest　analysis　and　recommender　system　for　curation　networks.　Learning　representations　for　CCSNs　faces　a　vital　challenge:　the　sparsity　of　multimodal　data.　It　is　difficult　for　most　existing　approaches　to　learn　effective　representations　for　multimodal　CCSNs　because　they　didn＇t　provide　a　solution　on　how　to　model　sparse　and　noisy　multimodal　data.　In　this　paper,　we　propose　a　2-step　approach　to　learn　accurate　multimodal　representations　from　sparse　multimodal　data.　First,　we　propose　a　novel　Board-Image-Word　(BIW)　graph　to　model　the　multimodal　data.　Benefited　from　the　unique　board-image　relation　on　CCSNs,　embeddings　of　images　and　texts　which　endow　semantic　relations　are　learned　from　the　network　topology　of　the　BIW　graph.　As　the　second　step,　a　deep　vision　model　with　modified　loss　function　are　trained　by　minimizing　the　distance　between　the　visual　features　of　contents　and　their　corresponding　semantic　relation　embeddings　to　learn　representations　which　incorporate　visual　information　and　graph-based　semantic　relations.　Experiments　on　the　dataset　from　Huaban.com　demonstrate　that　under　the　circumstance　of　sparser　text　modality,　our　method　significantly　outperformed　multimodal　DBN,　DBM　and　unimodal　representation　learning　methods　on　pin　classification　and　board　recommendation　tasks.　©　2019　IEEE.

Keyword：

Semantics Embeddings Classification (of information) Text processing Modal analysis Deep learning Social networking (online) Data mining Learning systems Topology Graphic methods

Author Community：

[ 1 ] [Wu, Lifang]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 2 ] [Yang, Bowen]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 3 ] [Jian, Meng]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 4 ] [Zhang, Xiuzhen]School of Computer Science and Information Technology, RMIT University, Melbourne, Australia
[ 5 ] [Zhang, Heng]Faculty of Information Technology, Beijing University of Technology, Beijing, China
[ 6 ] [Liu, Xu]Faculty of Information Technology, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Topic Modeling for Short Texts Via Dual View Collaborate optimization
2022，7th IEEE International Conference on Data Science in Cyberspace, DSC 2022
An Entity Linking Method Based on Entity Category and Word Embedding
2019，2019 3rd International Conference on Data Mining, Communications and Information Technology, DMCIT 2019
SimEmotion: A Simple Knowledgeable Prompt Tuning Method for Image Emotion Classification
2022，27th International Conference on Database Systems for Advanced Applications, DASFAA 2022
MDVT: A Multi-modal Fake News Detection Framework based on Vision Transformer
2023，6th International Conference on Machine Learning and Natural Language Processing, MNLP 2023

Source ：

ISSN： 2375-9232

Year： 2019

Volume： 2019-November

Page： 665-672

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 14

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to