Cross-Modal 3D Shape Retrieval via Heterogeneous Dynamic Graph Representation - Details

Author：

Dai, Yue (Dai, Yue.) | Feng, Yifan (Feng, Yifan.) | Ma, Nan (Ma, Nan.) (Scholars：马楠) | Zhao, Xibin (Zhao, Xibin.) | Gao, Yue (Gao, Yue.)

Indexed by：

EI Scopus SCIE

Abstract：

Cross-modal　3D　shape　retrieval　is　a　crucial　and　widely　applied　task　in　the　field　of　3D　vision.　Its　goal　is　to　construct　retrieval　representations　capable　of　measuring　the　similarity　between　instances　of　different　3D　modalities.　However,　existing　methods　face　challenges　due　to　the　performance　bottlenecks　of　single-modal　representation　extractors　and　the　modality　gap　across　3D　modalities.　To　tackle　these　issues,　we　propose　a　Heterogeneous　Dynamic　Graph　Representation　(HDGR)　network,　which　incorporates　context-dependent　dynamic　relations　within　a　heterogeneous　framework.　By　capturing　correlations　among　diverse　3D　objects,　HDGR　overcomes　the　limitations　of　ambiguous　representations　obtained　solely　from　instances.　Within　the　context　of　varying　mini-batches,　dynamic　graphs　are　constructed　to　capture　proximal　intra-modal　relations,　and　dynamic　bipartite　graphs　represent　implicit　cross-modal　relations,　effectively　addressing　the　two　challenges　above.　Subsequently,　message　passing　and　aggregation　are　performed　using　Dynamic　Graph　Convolution　(DGConv)　and　Dynamic　Bipartite　Graph　Convolution　(DBConv),　enhancing　features　through　heterogeneous　dynamic　relation　learning.　Finally,　intra-modal,　cross-modal,　and　self-transformed　features　are　redistributed　and　integrated　into　a　heterogeneous　dynamic　representation　for　cross-modal　3D　shape　retrieval.　HDGR　establishes　a　stable,　context-enhanced,　structure-aware　3D　shape　representation　by　capturing　heterogeneous　inter-object　relationships　and　adapting　to　varying　contextual　dynamics.　Extensive　experiments　conducted　on　the　ModelNet10,　ModelNet40,　and　real-world　ABO　datasets　demonstrate　the　state-of-the-art　performance　of　HDGR　in　cross-modal　and　intra-modal　retrieval　tasks.　Moreover,　under　the　supervision　of　robust　loss　functions,　HDGR　achieves　remarkable　cross-modal　retrieval　against　label　noise　on　the　3D　MNIST　dataset.　The　comprehensive　experimental　results　highlight　the　effectiveness　and　efficiency　of　HDGR　on　cross-modal　3D　shape　retrieval.

Keyword：

Representation learning dynamic graph Noise measurement 3D vision Shape Three-dimensional printing Solid modeling Convolution Correlation Cross modal retrieval heterogeneous graph Point cloud compression representation learning Cross-modal retrieval Bipartite graph

Author Community：

[ 1 ] [Dai, Yue]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China
[ 2 ] [Feng, Yifan]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China
[ 3 ] [Zhao, Xibin]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China
[ 4 ] [Gao, Yue]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China
[ 5 ] [Ma, Nan]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zhao, Xibin]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China;;[Gao, Yue]Tsinghua Univ, Sch Software, BNRist, KLISS, Beijing 100084, Peoples R China

Email：

daiyue1225@gmail.com |
evanfeng97@gmail.com |
manan123@bjut.edu.cn |
zxb@tsinghua.edu.cn |
kevin.gaoy@gmail.com

Show more details

Related Keywords：

Transformer-Based Discriminative and Strong Representation Deep Hashing for Cross-Modal Retrieval
2023，IEEE ACCESS
Graph Influence Network
2022，IEEE TRANSACTIONS ON CYBERNETICS
CASCE: A Contrastive Representation Learning Framework for Motor Imagery EEG-Based Unilateral Upper Limb Decoding
2025，IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT
Redundancy Is Not What You Need: An Embedding Fusion Graph Auto-Encoder for Self-Supervised Graph Representation Learning
2024，IEEE Transactions on Neural Networks and Learning Systems

Source ：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

ISSN： 0162-8828

Year： 2025

Issue： 4

Volume： 47

Page： 2370-2387

2 3 . 6 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 1

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 8

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to