Details - 北京工业大学机构库

Query：

学者姓名：孔德慧

Refining：

Year

2025 (2)
2022 (2)
2021 (19)
2020 (9)
2019 (13)
2018 (15)
2017 (6)
2016 (6)
2015 (18)
2014 (18)
2013 (12)
2012 (26)
2011 (12)
2010 (7)
2009 (28)
2008 (10)
2007 (15)
2006 (22)
2005 (9)
2004 (8)
2003 (13)
2002 (3)
2001 (1)
2000 (3)

Submit Unfold

Type

期刊论文 (150)
会议论文 (75)
专利 (52)

Submit Unfold

Indexed by

Scopus (130)
EI (116)
CSCD (83)
PKU (81)
CNKI (64)
万方 (62)
CQVIP (54)
incoPat (52)
zhihuiya (52)
SCIE (35)
CPCI-S (33)

Submit Unfold

Source

北京工业大学学报 (32)
Journal of Beijing University of Technology (23)
Journal of Information and Computational Science (15)
系统仿真学报 (7)
IEEE TRANSACTIONS ON MULTIMEDIA (6)
MULTIMEDIA TOOLS AND APPLICATIONS (6)
4th International Conference on Digital Home, ICDH 2012 (5)
5th International Conference on Digital Home (ICDH) (4)
Journal of System Simulation (4)
计算机教育 (4)
International Conference on Neural Networks and Signal Processing (3)
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION (3)
第十七届全国计算机辅助设计与图形学学术会议(CAD/CG’ 2012)暨第九届全国智能CAD与数字娱乐学术会议(CID’ 2012) (3)
10th Pacific Rim Conference on Multimedia (2)
10th Pacific Rim Conference on Multimedia, PCM 2009 (2)
2009 International Conference on Computational Intelligence and Software Engineering, CiSE 2009 (2)
2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年 (2)
APPLIED SCIENCES-BASEL (2)
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING (2)
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS (2)
中国图象图形学报A辑 (2)
图学学报 (2)
第十二届中国虚拟现实大会 (2)
计算机工程与应用 (2)
计算机研究与发展 (2)
11th EAI International Conference on Simulation Tools and Techniques, SIMUTools 2019 (1)
19th International Conference on Advances in Multimedia Modeling, MMM 2013 (1)
1st International Conference on Communications and Information Processing (ICCIP 2012) (1)
1st International Congress on Image and Signal Processing (1)
2004 IEEE International Conference on Multimedia and Expo (ICME) (1)
2008 9th International Conference on Signal Processing, ICSP 2008 (1)
2010国际计算机科学技术与应用论坛 (1)
2012 IEEE International Conference on Computer Science and Automation Engineering, CSAE 2012 (1)
2012 IEEE Symposium on Electrical and Electronics Engineering, EEESYM 2012 (1)
2012 International Conference of Intelligence Computation and Evolutionary Computation, ICEC 2012 (1)
2012 International Conference on Communications and Information Processing, ICCIP 2012 (1)
2012 International Conference on Computer Science and Service System, CSSS 2012 (1)
2013 5th International Conference on Computational and Information Sciences, ICCIS 2013 (1)
2018第12届全国计算机图形学大会Chinagraph 2018 (1)
30th International Conference on Artificial Neural Networks (ICANN) (1)
3rd International Conference on Computer Science and Service System (CSSS) (1)
3rd International Conference on Natural Computation (ICNC 2007) (1)
4th International Conference on Image and Graphics (1)
5th International Conference on Visual Information Engineering, VIE 2008 (1)
6th International Conference on Advanced Language Processing and Web Information Technology (1)
6th International Conference on Digital Home, ICDH 2016 (1)
6th World Congress on Intelligent Control and Automation (1)
6th World Congress on Intelligent Control and Automation, WCICA 2006 (1)
7th International Conference on Digital Home (ICDH) (1)
8th IEEE International Conference on Computer and Information Technology (1)
8th International Conference on Signal Processing (1)
8th International Conference on Signal Processing, ICSP 2006 (1)
9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON) (1)
9th IEEE Annual Information Technology, Electronics and Mobile Communication Conference, IEMCON 2018 (1)
9th International Conference on Signal Processing (1)
9th Pacific Conference on Computer Graphics and Applications, Pacific Graphics 2001 (1)
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (1)
CHINA COMMUNICATIONS (1)
Conference on Computer Graphics, Imaging and Visualisation (1)
Frontiers of Information Technology & Electronic Engineering (1)
IADIS International Conference Computer Graphics, Visualization, Computer Vision and Image Processing 2011, Part of the IADIS Multi Conference on Computer Science and Information Systems 2011, MCCSIS 2011 (1)
IEEE International Conference on Acoustics, Speech, and Signal Processing (1)
IEEE International Conference on Multimedia and Expo (ICME 2007) (1)
IEEE International Conference on Multimedia and Expo (ICME) (1)
IEEE TRANSACTIONS ON CYBERNETICS (1)
IEEE TRANSACTIONS ON IMAGE PROCESSING (1)
IEEE/WIC International Conference on Web Intelligence (WI 2003) (1)
IET COMPUTER VISION (1)
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING (1)
International Conference of Intelligence Computation and Evolutionary Computation (ICEC 2012) (1)
International Conference on Computational-Intelligence and Security (1)
International Conference on Computer Science and Software Engineering, CSSE 2008 (1)
International Conference on Software Technology and Engineering (1)
International Journal of Advancements in Computing Technology (1)
International Journal of Digital Multimedia Broadcasting (1)
International Journal of Simulation: Systems, Science and Technology (1)
International Workshop on Information and Electronics Engineering (IWIEE) / International Conference on Information, Computing and Telecommunications (ICICT) (1)
JOURNAL ON MULTIMODAL USER INTERFACES (1)
Journal of Fiber Bioengineering and Informatics (1)
Journal of Optoelectronics Laser (1)
Journal of Software (1)
Journal of South China University of Technology (Natural Science) (1)
MATHEMATICAL PROBLEMS IN ENGINEERING (1)
MULTIMEDIA SYSTEMS (1)
NEUROCOMPUTING (1)
Proceedings - IEEE International Conference on Acoustics, Speech, and Signal Processing (1)
SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES (1)
SIAM JOURNAL ON DISCRETE MATHEMATICS (1)
VISUAL COMPUTER (1)
WIRELESS NETWORKS (1)
中国图象图形学报 (1)
中国科学F辑 (1)
中国科学（F辑:信息科学） (1)
中国科学：信息科学 (1)
光电子·激光 (1)
全国第16届计算机科学与技术应用（CACIS）学术会议 (1)
华南理工大学学报(自然科学版) (1)
孔德慧 (1)
微计算机信息 (1)
电脑应用技术 (1)
第一届中国情感计算及智能交互学术会议 (1)
第六届全国几何设计与计算学术会议 (1)
第十五届中国虚拟现实大会暨虚拟现实与可视化技术国际会议 (1)
计算机学报 (1)
计算机工程 (1)
计算机应用 (1)
软件学报 (1)
郑州大学学报(理学版) (1)

Submit Unfold

Complex

First Author (43)
Reprint Author (8)
First Comm (79)
Reprint Comm (79)

Submit Unfold

Co-Author

尹宝才 (131)
Yin, Baocai (80)
王立春 (46)
王少帆 (42)
李敬华 (36)
Wang, Shaofan (29)
Zhang, Yong (29)
张勇 (23)
Yin, Bao-Cai (22)
Wang, Lichun (20)
Li, Jinghua (19)
Shi, Yunhui (15)
YIN Bao-cai (13)
孙艳丰 (13)
王玉萍 (8)
蔡鹏 (8)
Wang, Ru (7)
Li, Xin (6)
Yin, BC (6)
王雁来 (6)
高荣华 (6)
Cai, Peng (5)
Ding, Wenpeng (5)
Sun, Bin (5)
Sun, Yanfeng (5)
Wang, Li-Chun (5)
Yin Baocai (5)
Yin, B.-C. (5)
施云惠 (5)
薛娟 (5)
Lu, Tailong (4)
Xue, Juan (4)
司慧琳 (4)
孙磊 (4)
张楠 (4)
肖小芳 (4)
郭金铜 (4)
Du, Xiaohui (3)
Huo, Yi (3)
Ji, Peng-Fei (3)
Liu, Caixia (3)
Si, Huilin (3)
Sun, Lei (3)
Wang, Pengcheng (3)
Wang, Yuping (3)
Wu, Qianjun (3)
Xiao, Xiao-Fang (3)
Zang, Yuding (3)
季鹏飞 (3)
徐振华 (3)
杜晓晖 (3)
淮华瑞 (3)
王文通 (3)
田鹏宇 (3)
胡永利 (3)
荣子豪 (3)
谷春亮 (3)
赵欣欣 (3)
郭荆玮 (3)
闫会霞 (3)
霍奕 (3)
Gao, Junbin (2)
Gao, Ronghua (2)
Guo, Jing-Wei (2)
Guo, Jin-Tong (2)
Huang, Qingming (2)
Jing, Guodong (2)
Kang, Liang (2)
Kuang, Yun (2)
Li, Qianxing (2)
Li, Yan (2)
Rong, Zihao (2)
Si, Hui-Lin (2)
Sun, Xiaowei (2)
Tian, Pengyu (2)
Wang, Ke (2)
Wang Lichun (2)
Wang, YL (2)
Wang, Zhiyong (2)
Wu, Yongpeng (2)
Xue, J. (2)
Xu, Zhen-Hua (2)
Yang, Guang-Wei (2)
Zhang, Juan (2)
Zhang, Y. (2)
Zhang, Yang (2)
ZHANG Yong (2)
Zhao, Xinxin (2)
信建佳 (2)
刘媛媛 (2)
刘彩霞 (2)
刘蓬燕 (2)
张雯晖 (2)
李文超 (2)
李爽 (2)
林菁 (2)
王文东 (2)
王玉田 (2)
王珂 (2)
王茹 (2)
胡玉杰 (2)
许梦文 (2)
谭斐 (2)
贾思宇 (2)
马淑燕 (2)
Bai, Zhuowei (1)
Baocai, Yin (1)
BaoCai, Yin (1)
Cai, P. (1)
CAI Peng (1)
Chen, Dongpan (1)
Cheng, Shi-Quan (1)
Chen Ran (1)
Chen, T.-B. (1)
Deng, Zhengjie (1)
Du, Xiao-Hui (1)
Gao, R.-H. (1)
Gao, Rong-Hua (1)
Gu, Chun-Liang (1)
Guo, Yaxin (1)
Huai, Huarui (1)
Huang, W.-J. (1)
Huang, Yaoda (1)
Hu, YL (1)
Hu, Yong-Li (1)
Jia, XB (1)
Jia, Xibin (1)
Jinghua, Li (1)
Jin, Wei (1)
Li, Chun (1)
Li, Chunjing (1)
Lichun, Wang (1)
Li, Jiazhen (1)
Li jinghua (1)
Li, Jing-Hua (1)
Li, Lanxiao (1)
Li, Li-Yan (1)
Li, Mine (1)
Li, Shuang (1)
Li, Shuo (1)
Liu, Honglin (1)
Liu, Panbiao (1)
Liu, Wentao (1)
Liu, WT (1)
Li, Wenchao (1)
Li, Xuelong (1)
Lu, Bo-Xue (1)
Luo, XiaoNan (1)
Nan, Z (1)
Qin, Xu-Guo (1)
RongHua, Gao (1)
Roth, Hubert (1)
Ruan, Xiaogang (1)
Shen, Bowei (1)
Shi, Lina (1)
Shi, Yun-Hui (1)
Si, H.-L. (1)
Song, Cai-Fang (1)
Sun, Bo (1)
Wang, Huai-Bin (1)
Wang, Jin (1)
Wang, K (1)
Wang lichun (1)
Wang, Peng-Tao (1)
Wang, Renhong (1)
Wang Ru (1)
Wang, Shao-fan (1)
Wang, WD (1)
Wang, Wen-Dong (1)
Wang wentong (1)
Wang, Xiaotian (1)
Wang, Yanlai (1)
Wang, Yan Lai (1)
Wang, Yufei (1)
Wang, Yu-Ping (1)
Wang, Yu-Tian (1)
Wang, Zhen (1)
Wen, Wen (1)
Wu, S.-N. (1)
Xia, Ting-Ting (1)
Xin, Li (1)
Xin, Yongjia (1)
Xiong, Ruiqin (1)
XUE Juan (1)
Xu, Min (1)
Yan, Huixia (1)
Yin baocai (1)
Yin, Bao-cai (1)
Yin, BaoCai (1)
Yin, Bao Cai (1)
Yingxin, Xing (1)
Yong Zhang (1)
Yue, Wenying (1)
Yue, WY (1)
Yu, Yilan (1)
Zhang, J. (1)
Zhang, Nan (1)
Zhang, Wenhui (1)
Zhang, Xiangwu (1)
Zhang, Zhen (1)
Zheng, Chong-Yu (1)
Zhu, Weijia (1)
于沁杨 (1)
代晋玮 (1)
冯会晓 (1)
刘洋 (1)
刘洪林 (1)
刘润泽 (1)
北京工业大学学报 (1)
吕博学 (1)
吴思宁 (1)
吴鑫 (1)
夏婷婷 (1)
孙彬 (1)
孙文胜 (1)
孙晓伟 (1)
孙杰 (1)
孙首乙 (1)
宋彩芳 (1)
岳文颖 (1)
崔洁 (1)
左琳 (1)
巩林昊 (1)
康亮 (1)
张娟 (1)
张彬 (1)
张洋 (1)
朱江 (1)
朱碧焓 (1)
李丽岩 (1)
李倩星 (1)
李学龙 (1)
李新海 (1)
李燕 (1)
李素琴 (1)
杨光伟 (1)
段学浩 (1)
汪洋 (1)
沈伯伟 (1)
王国良 (1)
王志勇 (1)
王怀彬 (1)
王振 (1)
王鹏涛 (1)
石丽娜 (1)
秦旭果 (1)
程世铨 (1)
程可 (1)
蒋春燕 (1)
虞义兰 (1)
贾文浩 (1)
辛永佳 (1)
邢迎新 (1)
邬玉洁 (1)
邵广翠 (1)
邹自强 (1)
郑重雨 (1)
闫鹏飞 (1)
陈晟 (1)
陈通波 (1)
靳威 (1)
马春玲 (1)
马胜蕾 (1)
高宁 (1)
高明 (1)
黄万军 (1)

Submit Unfold

Language

Chinese (159)
English (117)
Other (1)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 28 >

3d human pose estimation based on conditional dual-branch diffusion SCIE

期刊论文 | 2025 , 31 (1) | MULTIMEDIA SYSTEMS

Abstract&Keyword Cite

Abstract ：

Thanks to the development of 2D keypoint detectors, monocular 3D human pose estimation (HPE) via 2D-to-3D lifting approaches have achieved remarkable improvements. However, monocular 3D HPE is still a challenging problem due to the inherent depth ambiguities and occlusions. Recently, diffusion models have achieved great success in the field of image generation. Inspired by this, we transform 3D human pose estimation problem into a reverse diffusion process, and propose a dual-branch diffusion model so as to handle the indeterminacy and uncertainty of 3D pose and fully explore the global and local correlations between joints. Furthermore, we propose conditional dual-branch diffusion model to enhance the performance of 3D human pose estimation, in which the joint-level semantic information are regarded as the condition of the diffusion model, and integrated into the joint-level representations of 2D pose to enhance the expression of joints. The proposed method is verified on two widely used datasets and the experimental results have demonstrated the superiority.

Keyword ：

Human pose estimation Human pose estimation Diffusion model Diffusion model Joint semantics Joint semantics Dual-branch Dual-branch

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Jinghua , Bai, Zhuowei , Kong, Dehui et al. 3d human pose estimation based on conditional dual-branch diffusion [J]. \| MULTIMEDIA SYSTEMS , 2025 , 31 (1) .
MLA	Li, Jinghua et al. "3d human pose estimation based on conditional dual-branch diffusion" . \| MULTIMEDIA SYSTEMS 31 . 1 (2025) .
APA	Li, Jinghua , Bai, Zhuowei , Kong, Dehui , Chen, Dongpan , Li, Qianxing , Yin, Baocai . 3d human pose estimation based on conditional dual-branch diffusion . \| MULTIMEDIA SYSTEMS , 2025 , 31 (1) .
Export to	NoteExpress RIS BibTex

MMF-Net: A novel multi-feature and multi-level fusion network for 3D human pose estimation SCIE

期刊论文 | 2025 , 19 (1) | IET COMPUTER VISION

Li, Qianxing | Kong, Dehui | Li, Jinghua | Yin, Baocai

Abstract&Keyword Cite

Abstract ：

Human pose estimation based on monocular video has always been the focus of research in the human computer interaction community, which suffers mainly from depth ambiguity and self-occlusion challenges. While the recently proposed learning-based approaches have demonstrated promising performance, they do not fully explore the complementarity of features. In this paper, the authors propose a novel multi-feature and multi-level fusion network (MMF-Net), which extracts and combines joint features, bone features and trajectory features at multiple levels to estimate 3D human pose. In MMF-Net, firstly, the bone length estimation module and the trajectory multi-level fusion module are used to extract the geometric size information of the human body and multi-level trajectory information of human motion, respectively. Then, the fusion attention-based combination (FABC) module is used to extract multi-level topological structure information of the human body, and effectively fuse topological structure information, geometric size information and trajectory information. Extensive experiments show that MMF-Net achieves competitive results on Human3.6M, HumanEva-I and MPI-INF-3DHP datasets.

Keyword ：

image processing image processing pose estimation pose estimation computer vision computer vision image reconstruction image reconstruction

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Qianxing , Kong, Dehui , Li, Jinghua et al. MMF-Net: A novel multi-feature and multi-level fusion network for 3D human pose estimation [J]. \| IET COMPUTER VISION , 2025 , 19 (1) .
MLA	Li, Qianxing et al. "MMF-Net: A novel multi-feature and multi-level fusion network for 3D human pose estimation" . \| IET COMPUTER VISION 19 . 1 (2025) .
APA	Li, Qianxing , Kong, Dehui , Li, Jinghua , Yin, Baocai . MMF-Net: A novel multi-feature and multi-level fusion network for 3D human pose estimation . \| IET COMPUTER VISION , 2025 , 19 (1) .
Export to	NoteExpress RIS BibTex

一种视频语义结构信息辅助的弱监督时序动作定位方法

会议论文 | 2022 | 2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年

孔德慧 | 许梦文 | 李敬华 | 王少帆 | 尹宝才

Abstract&Keyword Cite

Abstract ：

弱监督时序动作定位任务的目标是在只有视频级标签的情况下,对未分割的视频中的动作进行分类和时序上的定位。目前基于神经网络模型的方法,大多训练分类器以预测视频片段级的类别分数,再融合其为视频级的类别分数。这些方法只关注视频的视觉特征,却忽视了视频语义结构信息。为进一步提升视频动作定位的质量,本文提出了一种视频语义结构信息辅助的弱监督时序动作定位方法。该方法首先以分类模块作为基础模型,然后基于视频在时序结构上的稀疏性和语义连续性等辅助信息设计一种平滑注意力模块,修正分类结果;另外,加入视频片段级语义标签预测模块,改善弱监督标签信息不充足问题;最后将三个模块共同训练以融合提升时序动作定位的精度。通过在THUMOS14和ActivityNet数据集上的实验,表明本文方法的性能指标明显优于目前现有方法。

Keyword ：

语义结构信息语义结构信息伪标签伪标签注意力值注意力值动作定位动作定位弱监督弱监督

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 许梦文 , 李敬华 et al. 一种视频语义结构信息辅助的弱监督时序动作定位方法 [C] //2021中国自动化大会论文集 . 2022 .
MLA	孔德慧 et al. "一种视频语义结构信息辅助的弱监督时序动作定位方法" 2021中国自动化大会论文集 . (2022) .
APA	孔德慧 , 许梦文 , 李敬华 , 王少帆 , 尹宝才 . 一种视频语义结构信息辅助的弱监督时序动作定位方法 2021中国自动化大会论文集 . (2022) .
Export to	NoteExpress RIS BibTex

HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation SCIE

期刊论文 | 2022 , 487 , 243-256 | NEUROCOMPUTING

Wu, Yongpeng | Kong, Dehui | Wang, Shaofan | Li, Jinghua | Yin, Baocai

WoS CC Cited Count： 15

Abstract&Keyword Cite

Abstract ：

3D pose estimation remains a challenging task since human poses exhibit high ambiguity and multigranularity. Traditional graph convolution networks (GCNs) accomplish the task by modeling all skeletons as an entire graph, and are unable to fuse combinable part-based features. By observing that human movements occur due to part of human body (i.e. related skeletons and body components, known as the poselet) and those poselets contribute to each movement in a hierarchical fashion, we propose a hierarchical poselet-guided graph convolutional network (HPGCN) for 3D pose estimation from 2D poses. HPGCN sets five primitives of human body as basic poselets, and constitutes high-level poselets according to the kinematic configuration of human body. Moreover, HPGCN forms a fundamental unit by using a diagonally dominant graph convolution layer and a non-local layer, which corporately capture the multi-granular feature of human poses from local to global perspective. Finally HPGCN designs a geometric constraint loss function with constraints on lengths and directions of bone vectors, which help produce reasonable pose regression. We verify the effectiveness of HPGCN on three public 3D human pose benchmarks. Experimental results show that HPGCN outperforms several state-of-the-art methods. (c) 2021 Elsevier B.V.

Keyword ：

Graph convolutional network Graph convolutional network Hierarchical poselet Hierarchical poselet Geometric constraint Geometric constraint Diagonally dominant graph convolution Diagonally dominant graph convolution Pose estimation Pose estimation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wu, Yongpeng , Kong, Dehui , Wang, Shaofan et al. HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation [J]. \| NEUROCOMPUTING , 2022 , 487 : 243-256 .
MLA	Wu, Yongpeng et al. "HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation" . \| NEUROCOMPUTING 487 (2022) : 243-256 .
APA	Wu, Yongpeng , Kong, Dehui , Wang, Shaofan , Li, Jinghua , Yin, Baocai . HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation . \| NEUROCOMPUTING , 2022 , 487 , 243-256 .
Export to	NoteExpress RIS BibTex

一种视频语义结构信息辅助的弱监督时序动作定位方法

会议论文 | 2021 | 2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年

孔德慧 | 许梦文 | 李敬华 | 王少帆 | 尹宝才

Abstract&Keyword Cite

Abstract ：

Keyword ：

语义结构信息语义结构信息弱监督弱监督动作定位动作定位注意力值注意力值伪标签伪标签

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 许梦文 , 李敬华 et al. 一种视频语义结构信息辅助的弱监督时序动作定位方法 [C] //2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年 . 2021 .
MLA	孔德慧 et al. "一种视频语义结构信息辅助的弱监督时序动作定位方法" 2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年 . (2021) .
APA	孔德慧 , 许梦文 , 李敬华 , 王少帆 , 尹宝才 . 一种视频语义结构信息辅助的弱监督时序动作定位方法 2021中国自动化大会——中国自动化学会60周年会庆暨纪念钱学森诞辰110周年 . (2021) .
Export to	NoteExpress RIS BibTex

Real-Time Human Action Recognition Using Locally Aggregated Kinematic-Guided Skeletonlet and Supervised Hashing-by-Analysis Model SCIE

期刊论文 | 2021 , 52 (6) , 4837-4849 | IEEE TRANSACTIONS ON CYBERNETICS

Sun, Bin | Wang, Shaofan | Kong, Dehui | Wang, Lichun | Yin, Baocai

WoS CC Cited Count： 6

Abstract&Keyword Cite

Abstract ：

3-D action recognition is referred to as the classification of action sequences which consist of 3-D skeleton joints. While many research works are devoted to 3-D action recognition, it mainly suffers from three problems: 1) highly complicated articulation; 2) a great amount of noise; and 3) low implementation efficiency. To tackle all these problems, we propose a real-time 3-D action-recognition framework by integrating the locally aggregated kinematic-guided skeletonlet (LAKS) with a supervised hashing-by-analysis (SHA) model. We first define the skeletonlet as a few combinations of joint offsets grouped in terms of the kinematic principle and then represent an action sequence using LAKS, which consists of a denoising phase and a locally aggregating phase. The denoising phase detects the noisy action data and adjusts it by replacing all the features within it with the features of the corresponding previous frame, while the locally aggregating phase sums the difference between an offset feature of the skeletonlet and its cluster center together over all the offset features of the sequence. Finally, the SHA model combines sparse representation with a hashing model, aiming at promoting the recognition accuracy while maintaining high efficiency. Experimental results on MSRAction3D, UTKinectAction3D, and Florence3DAction datasets demonstrate that the proposed method outperforms state-of-the-art methods in both recognition accuracy and implementation efficiency.

Keyword ：

Joints Joints skeletonlet skeletonlet sparse representation sparse representation skeleton joints skeleton joints Solid modeling Solid modeling Feature extraction Feature extraction Computational modeling Computational modeling Kinematics Kinematics Action recognition Action recognition Real-time systems Real-time systems hashing hashing Analytical models Analytical models

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Sun, Bin , Wang, Shaofan , Kong, Dehui et al. Real-Time Human Action Recognition Using Locally Aggregated Kinematic-Guided Skeletonlet and Supervised Hashing-by-Analysis Model [J]. \| IEEE TRANSACTIONS ON CYBERNETICS , 2021 , 52 (6) : 4837-4849 .
MLA	Sun, Bin et al. "Real-Time Human Action Recognition Using Locally Aggregated Kinematic-Guided Skeletonlet and Supervised Hashing-by-Analysis Model" . \| IEEE TRANSACTIONS ON CYBERNETICS 52 . 6 (2021) : 4837-4849 .
APA	Sun, Bin , Wang, Shaofan , Kong, Dehui , Wang, Lichun , Yin, Baocai . Real-Time Human Action Recognition Using Locally Aggregated Kinematic-Guided Skeletonlet and Supervised Hashing-by-Analysis Model . \| IEEE TRANSACTIONS ON CYBERNETICS , 2021 , 52 (6) , 4837-4849 .
Export to	NoteExpress RIS BibTex

Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition SCIE

期刊论文 | 2021 , 15 (2) | ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

Sun, Bin | Kong, Dehui | Wang, Shaofan | Wang, Lichun | Yin, Baocai

WoS CC Cited Count： 8

Abstract&Keyword Cite

Abstract ：

Multi-view human action recognition remains a challenging problem due to large view changes. In this article, we propose a transfer learning-based framework called transferable dictionary learning and view adaptation (TDVA) model for multi-view human action recognition. In the transferable dictionary learning phase, TDVA learns a set of view-specific transferable dictionaries enabling the same actions from different views to share the same sparse representations, which can transfer features of actions from different views to an intermediate domain. In the view adaptation phase, TDVA comprehensively analyzes global, local, and individual characteristics of samples, and jointly learns balanced distribution adaptation, locality preservation, and discrimination preservation, aiming at transferring sparse features of actions of different views from the intermediate domain to a common domain. In other words, TDVA progressively bridges the distribution gap among actions from various views by these two phases. Experimental results on IXMAS, ACT4(2), and NUCLA action datasets demonstrate that TDVA outperforms state-of-the-art methods.

Keyword ：

sparse representation sparse representation Action recognition Action recognition transfer learning transfer learning multi-view multi-view

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Sun, Bin , Kong, Dehui , Wang, Shaofan et al. Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition [J]. \| ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA , 2021 , 15 (2) .
MLA	Sun, Bin et al. "Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition" . \| ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA 15 . 2 (2021) .
APA	Sun, Bin , Kong, Dehui , Wang, Shaofan , Wang, Lichun , Yin, Baocai . Joint Transferable Dictionary Learning and View Adaptation for Multi-view Human Action Recognition . \| ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA , 2021 , 15 (2) .
Export to	NoteExpress RIS BibTex

DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction SCIE

期刊论文 | 2021 , 23 , 2843-2856 | IEEE TRANSACTIONS ON MULTIMEDIA

Liu, Caixia | Kong, Dehui | Wang, Shaofan | Li, Jinghua | Yin, Baocai

WoS CC Cited Count： 11

Abstract&Keyword Cite

Abstract ：

Although deep networks based methods outperform traditional 3D reconstruction methods which require multiocular images or class labels to recover the full 3D geometry, they may produce incomplete recovery and unfaithful reconstruction when facing occluded parts of 3D objects. To address these issues, we propose Depth-preserving Latent Generative Adversarial Network (DLGAN) which consists of 3D Encoder-Decoder based GAN (EDGAN, serving as a generator and a discriminator) and Extreme Learning Machine (ELM, serving as a classifier) for 3D reconstruction from a monocular depth image of an object. Firstly, EDGAN decodes a latent vector from the 2.5D voxel grid representation of an input image, and generates the initial 3D occupancy grid under common GAN losses, a latent vector loss and a depth loss. For the latent vector loss, we design 3D deep AutoEncoder (AE) to learn a target latent vector from ground truth 3D voxel grid and utilize the vector to penalize the latent vector encoded from the input 2.5D data. For the depth loss, we utilize the input 2.5D data to penalize the initial 3D voxel grid from 2.5D views. Afterwards, ELM transforms float values of the initial 3D voxel grid to binary values under a binary reconstruction loss. Experimental results show that DLGAN not only outperforms several state-of-the-art methods by a large margin on both a synthetic dataset and a real-world dataset, but also predicts more occluded parts of 3D objects accurately without class labels.

Keyword ：

depth loss depth loss monocular depth image monocular depth image Three-dimensional displays Three-dimensional displays 3D reconstruction 3D reconstruction Transforms Transforms ELM ELM Image reconstruction Image reconstruction Shape Shape Generative adversarial networks Generative adversarial networks latent vector latent vector Two dimensional displays Two dimensional displays Gallium nitride Gallium nitride

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Liu, Caixia , Kong, Dehui , Wang, Shaofan et al. DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction [J]. \| IEEE TRANSACTIONS ON MULTIMEDIA , 2021 , 23 : 2843-2856 .
MLA	Liu, Caixia et al. "DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction" . \| IEEE TRANSACTIONS ON MULTIMEDIA 23 (2021) : 2843-2856 .
APA	Liu, Caixia , Kong, Dehui , Wang, Shaofan , Li, Jinghua , Yin, Baocai . DLGAN: Depth-Preserving Latent Generative Adversarial Network for 3D Reconstruction . \| IEEE TRANSACTIONS ON MULTIMEDIA , 2021 , 23 , 2843-2856 .
Export to	NoteExpress RIS BibTex

基于时空上下文模型的RGB-D序列目标跟踪方法 CSCD

期刊论文 | 2021 , 47 (03) , 224-230 | 北京工业大学学报

孔德慧 | 荣子豪 | 贾思宇 | 王少帆 | 尹宝才

Abstract&Keyword Cite

Abstract ：

为了实现更为精确的视频目标跟踪,提出一种以时空上下文模型为基础的RGB-D序列目标跟踪算法.通过引入更新模板的深度信息,该模型精准地区分了输入序列的目标区域与背景区域,实现了深度权值和颜色权值的融合;基于目标序列的深度及目标动量计算,该模型有效地实现了尺度更新与遮挡处理.通过在RGB-D图像序列数据集上的详细实验评估,该时空上下文模型相对于其他先进的同类方法表现出更好的性能.因此,该方法实现了更为精确可靠的视频目标跟踪.

Keyword ：

目标跟踪目标跟踪机器学习机器学习 RGB-D RGB-D 目标动量目标动量计算机视觉计算机视觉时空上下文时空上下文

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 荣子豪 , 贾思宇 et al. 基于时空上下文模型的RGB-D序列目标跟踪方法 [J]. \| 北京工业大学学报 , 2021 , 47 (03) : 224-230 .
MLA	孔德慧 et al. "基于时空上下文模型的RGB-D序列目标跟踪方法" . \| 北京工业大学学报 47 . 03 (2021) : 224-230 .
APA	孔德慧 , 荣子豪 , 贾思宇 , 王少帆 , 尹宝才 . 基于时空上下文模型的RGB-D序列目标跟踪方法 . \| 北京工业大学学报 , 2021 , 47 (03) , 224-230 .
Export to	NoteExpress RIS BibTex

基于时空上下文模型的RGB-D序列目标跟踪方法 CQVIP

期刊论文 | 2021 , 47 (3) , 224-230 | 孔德慧

孔德慧 | 荣子豪 | 贾思宇 | 王少帆 | 尹宝才 | 北京工业大学学报

Abstract&Keyword Cite

Abstract ：

基于时空上下文模型的RGB-D序列目标跟踪方法

Keyword ：

目标跟踪目标跟踪时空上下文时空上下文计算机视觉计算机视觉 RGB-D RGB-D 目标动量目标动量机器学习机器学习

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	孔德慧 , 荣子豪 , 贾思宇 et al. 基于时空上下文模型的RGB-D序列目标跟踪方法 [J]. \| 孔德慧 , 2021 , 47 (3) : 224-230 .
MLA	孔德慧 et al. "基于时空上下文模型的RGB-D序列目标跟踪方法" . \| 孔德慧 47 . 3 (2021) : 224-230 .
APA	孔德慧 , 荣子豪 , 贾思宇 , 王少帆 , 尹宝才 , 北京工业大学学报 . 基于时空上下文模型的RGB-D序列目标跟踪方法 . \| 孔德慧 , 2021 , 47 (3) , 224-230 .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 28 >

Type
Departments

All Years Choose Year From to