Details - 北京工业大学机构库

Query：

学者姓名：张菁

Refining：

Year

2025 (5)
2024 (4)
2023 (1)
2022 (3)
2021 (18)
2020 (15)
2019 (19)
2018 (13)
2017 (14)
2016 (26)
2015 (9)
2014 (12)
2013 (12)
2012 (14)
2011 (5)
2010 (9)
2009 (13)
2008 (14)
2007 (6)
2006 (1)
2004 (1)

Submit Unfold

Type

期刊论文 (139)
专利 (43)
会议论文 (32)

Submit Unfold

Indexed by

Scopus (127)
EI (69)
CSCD (66)
SCIE (65)
PKU (60)
incoPat (43)
zhihuiya (43)
万方 (40)
CPCI-S (26)
CNKI (25)
CQVIP (25)
PubMed (4)
CSSCI (1)

Submit Unfold

Source

测控技术 (11)
China Environmental Science (9)
CHINESE JOURNAL OF ELECTRONICS (6)
World Information on Earthquake Engineering (6)
4th IEEE International Conference on Progress in Informatics and Computing (IEEE PIC) (5)
DESALINATION AND WATER TREATMENT (5)
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE (5)
Journal of Harbin Institute of Technology (5)
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (4)
Journal of Earthquake Engineering and Engineering Vibration (4)
NEUROCOMPUTING (4)
APPLIED SCIENCES-BASEL (3)
IET IMAGE PROCESSING (3)
JOURNAL OF APPLIED REMOTE SENSING (3)
APPLIED OPTICS (2)
Acta Scientiae Circumstantiae (2)
Environmental Science (2)
Frontiers of Architecture and Civil Engineering in China (2)
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2)
IEEE TRANSACTIONS ON MULTIMEDIA (2)
INTERNATIONAL JOURNAL OF REMOTE SENSING (2)
International Conference on Digital Image Computing - Techniques and Applications (DICTA) (2)
Journal of Beijing University of Technology (2)
Journal of Chemical Engineering of Chinese Universities (2)
Journal of Civil, Architectural and Environmental Engineering (2)
Journal of Natural Disasters (2)
MULTIMEDIA SYSTEMS (2)
REMOTE SENSING (2)
北京工业大学学报 (2)
10th IEEE Workshop on Multimedia Signal Processing (1)
13th International Conference on Control Automation Robotics & Vision (ICARCV) (1)
15th IEEE International Symposium on Multimedia (ISM) (1)
18th IEEE International Symposium on Multimedia (IEEE ISM) (1)
18th Pacific-Rim Conference on Multimedia (PCM) (1)
19th International Conference on Digital Signal Processing (DSP) (1)
1st IEEE Conference on Multimedia Information Processing and Retrieval (MIPR) (1)
2008 IEEE 10th Workshop on Multimedia Signal Processing, MMSP 2008 (1)
2008 IEEE International Conference Neural Networks and Signal Processing, ICNNSP (1)
2010 International Conference on Computer Design and Applications, ICCDA 2010 (1)
2012 2nd IEEE International Conference on Signal Processing, Communications and Computing, ICSPCC 2012 (1)
22nd International Conference on MultiMedia Modeling, MMM 2016 (1)
2nd IEEE International Conference on Progress in Informatics and Computing (PIC) (1)
30th IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (1)
4th International Conference on Digital Image Processing (ICDIP) (1)
4th International Conference on Wireless Communications and Signal Processing (WCSP) (1)
8th International Conference on Internet Multimedia Computing and Service (ICIMCS) (1)
9th International Conference on Computer Vision Theory and Applications (VISAPP) (1)
APPLIED INTELLIGENCE (1)
APPLIED THERMAL ENGINEERING (1)
Acta Electronica Sinica (1)
Acta Photonica Sinica (1)
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE (1)
COMPUTER VISION AND IMAGE UNDERSTANDING (1)
Chinese Journal of Applied and Environmental Biology (1)
Chinese Journal of Environmental Engineering (1)
DISCRETE AND CONTINUOUS DYNAMICAL SYSTEMS (1)
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE (1)
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING (1)
HAZARDOUS SUBSTANCES & ENVIRONMENTAL ENGINEERING (1)
IEEE First International Conference on Multimedia Big Data (1)
IEEE Fourth International Conference on Big Data and Cloud Computing (BdCloud) (1)
IEEE International Conference on Multimedia and Expo (ICMEW) (1)
IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (1)
IEEE SIGNAL PROCESSING LETTERS (1)
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE (1)
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (1)
IMAGE AND VISION COMPUTING (1)
International Conference on Mechatronics Engineering and Computing Technology (ICMECT) (1)
International Conference on Neural Networks and Signal Processing (1)
JOURNAL OF ELECTRONIC IMAGING (1)
JOURNAL OF HAZARDOUS MATERIALS (1)
Journal of Building Materials (1)
Journal of Electronics (1)
Journal of Mechanical Strength (1)
MATERIALS CHEMISTRY AND PHYSICS (1)
Modern Chemical Industry (1)
Pattern Recognition Letters (1)
REMOTE SENSING LETTERS (1)
Rare Metal Materials and Engineering (1)
Research of Environmental Sciences (1)
SENSORS (1)
SEPARATION SCIENCE AND TECHNOLOGY (1)
SIGNAL PROCESSING (1)
光子学报 (1)
北京教育(高教) (1)
北京教育（高教版） (1)
四川环境 (1)
小型微型计算机系统 (1)
数据采集与处理 (1)
智能系统学报 (1)
江苏环境科技 (1)
环境与可持续发展 (1)
电子与信息学报 (1)
电子学报 (1)
电子测量技术 (1)
计算机应用研究 (1)

Submit Unfold

Complex

First Author (44)
Reprint Author (46)
First Comm (45)
Reprint Comm (45)
ESI HCP (1)

Submit Unfold

Co-Author

Zhuo, Li (73)
卓力 (73)
李晓光 (24)
Li, D. (19)
张辉 (17)
Li, Jiafeng (15)
Cao, W. (14)
Zhang, Hui (13)
李嘉锋 (13)
Tian, Qi (11)
Liang, Xi (10)
沈兰荪 (10)
Liu, Xin (7)
Li, Zhenwei (7)
王立元 (7)
Geng, Wenhao (6)
Wang, Chao (6)
Zhuo Li (6)
李晨豪 (6)
Chen, Lu (5)
Hu, Xiaochen (5)
Wang, Zhan (5)
刁蒙蒙 (5)
周倩兰 (5)
曹嫣 (5)
梁西 (5)
王超 (5)
胡笑尘 (5)
Chen, Lin (4)
Dong, H. (4)
Kang, Junpeng (4)
Liang, Y.-H. (4)
Li, Chenhao (4)
Li, Xiaoguang (4)
Shen Lansun (4)
Shi, Longyue (4)
Wu, D. (4)
Yang, X. (4)
Yang, Ying (4)
Yang, Yuncong (4)
Yao, Jiacheng (4)
孙亮亮 (4)
张燕 (4)
王江萍 (4)
耿文浩 (4)
肖庆新 (4)
隋磊 (4)
马民涛 (4)
高静静 (4)
Cao, Yan (3)
Cheng, Bo (3)
Fan, D. (3)
He, Chen (3)
Liang, Y. (3)
Li, Wenjuan (3)
Li, Wensheng (3)
Li, Zhuo (3)
Peng, Y. (3)
Shen, Lansun (3)
Sun, T. (3)
Tian, Jimiao (3)
Wang, M. (3)
Wang, Meng (3)
Yang, H. (3)
Yang, Wentao (3)
Yin, W. (3)
Zeng, H. (3)
Zhang, Jie (3)
Zhang Jing (3)
Zhang, Y. (3)
Zhou, Qianlan (3)
姚嘉诚 (3)
屈盼玲 (3)
成博 (3)
朱江淼 (3)
杨云聪 (3)
沈浩杰 (3)
王世镖 (3)
王星 (3)
贺辰 (3)
贾童瑶 (3)
赵孟凯 (3)
陈璐 (3)
Chang, W. (2)
Cheng, Lina (2)
Chen, Na (2)
David Dagan FENG (2)
Feng, David Dagan (2)
Jiang, Liying (2)
Li, Jun (2)
Liu, Jihong (2)
Liu, Q. (2)
Li, Youjiao (2)
Peng, Yuanfan (2)
Qu, Panling (2)
Sui, Lei (2)
Wang, J. (2)
Wang, Liyuan (2)
Xu, T. (2)
Xu, Y. (2)
Yang, Y. (2)
Ye, L. (2)
Zeng, H.-P. (2)
Zeng, T. (2)
Zhang, Pei (2)
Zhao Mengkai (2)
Zhao, Mengkai (2)
Zhao, Xiaolei (2)
Zhao, Yingdi (2)
Zhou, Yuenan (2)
Zhuo, L. (2)
Zhu, Ziqi (2)
卢运西 (2)
康俊鹏 (2)
张广朋 (2)
张强 (2)
张新峰 (2)
张沛 (2)
张雁雁 (2)
朱牧 (2)
李昱钊 (2)
李耀鹏 (2)
杨立恒 (2)
田吉淼 (2)
胡健 (2)
蔡轶珩 (2)
贾童谣 (2)
赵晓蕾 (2)
赵霙頔 (2)
郜征 (2)
陈后金 (2)
韩松 (2)
马春杰 (2)
高宇麒 (2)
黄伟 (2)
黄晓东 (2)
Bai, Guangmei (1)
Bai, Yu (1)
Bian, J. (1)
Bian, Wei (1)
Chen, G.-Y. (1)
Chen Lu (1)
Chen, Sha (1)
Chu, Zheng (1)
Cui, L. (1)
Diao, Mengmeng (1)
Dong Wenyue (1)
Dong, Wenyue (1)
Duan, X. (1)
Duan, X.-D. (1)
Du, Chunxu (1)
Du, H. (1)
Du, Lina (1)
Du, X. (1)
Fan, B. Y. (1)
Fan, Tengjiao (1)
Fan, Y. (1)
Feng, D.D. (1)
Gao, J.-J. (1)
Geng, W. (1)
Geng Wenhao (1)
Guan, H. (1)
Guan, H.-W. (1)
Guan, M. (1)
Guo, J. (1)
Hao, RX (1)
Hao, Yuxing (1)
He Lin (1)
He, Y. (1)
Hou, Ai-Yue (1)
Hou, A.-Y. (1)
Hou, C. (1)
Huang, Xian-Huai (1)
Huang, Xiaodong (1)
Hu, B. (1)
Jiafeng, Li (1)
Jiang, S. (1)
Jiang, S.-S. (1)
Ji, J. (1)
Jin, Guoqing (1)
Lansun, Shen (1)
Lei, Sui (1)
Liang, D.-B. (1)
Liang, Dong-Bo (1)
Liang, M. (1)
Liang, Y.-W. (1)
Li, B (1)
Li, D (1)
Li, J. (1)
Li, S. (1)
Liu, L. (1)
Liu, Mei (1)
Liu, X. (1)
Liu, Y. (1)
Liu, Yang (1)
Liu, Yongdong (1)
Liu, Zhixing (1)
Liu, Zihou (1)
Li, W. (1)
Li, Wei-Hua (1)
Li, X. (1)
Li Xiaoguang (1)
Li, Xioguang (1)
Li, X.-Y. (1)
Li, Yuzhao (1)
Li, Z. (1)
Long, Haixia (1)
Lu, Dingyu (1)
Lu, J. (1)
Luo, Y.-H. (1)
Lü, Y. (1)
Lu, Yunxi (1)
Ni, Z. (1)
Ouyang, M. G. (1)
Peng, Yongzhen (1)
Ran, J. (1)
Sang, Lixia (1)
Shang, H. (1)
Shen, L. (1)
Shen, Lan-Sun (1)
Shen, L.-S. (1)
Shi, X. (1)
Song, L.-X. (1)
Song, X. (1)
Song, Yin (1)
Su, D. (1)
Sui, L. (1)
Sun, C. (1)
Sun, Guohui (1)
Sun, Shengyao (1)
Su, Q. (1)
Su, Q.-L. (1)
Tang, X. (1)
Tang, Xiaoyu (1)
Tao, X.-X. (1)
Tian, H. (1)
Tian, Q. (1)
Wang, A. (1)
Wang Chao (1)
Wang, E. H. (1)
Wang, H. (1)
Wang, J.-A. (1)
Wang, L. (1)
Wang, Liuqian (1)
Wang Liyuan (1)
Wang, L.-Y. (1)
Wang Qun (1)
Wang, S. (1)
Wang, Shu (1)
Wang, Suyu (1)
Wang, Y. (1)
Wang, Yizhou (1)
Wang, Y.-Q. (1)
Wang, Y.-Y. (1)
Wang, Z. (1)
Wang, Zhan-Zhao (1)
Wei, J. (1)
Wu, L. (1)
Wu, Q. (1)
Wu, Xinjia (1)
Xiao, Ye (1)
Xiong, X.-L. (1)
Xi, Xuejie (1)
Yang, F. B. (1)
Yang, F. Y. (1)
Yang, H (1)
Yang, J. (1)
Yang, K. (1)
Yang, Liheng (1)
Yang, Liying (1)
Yang, M. (1)
Yang, T. (1)
Yang, Xin-Guang (1)
Yang, Y. C. (1)
Yan, Pu (1)
Yao Jiacheng (1)
Ye, Xiao (1)
Yin, C. (1)
Youjiao, Li (1)
Yuan, P.-F. (1)
Yu, P.-B. (1)
Yu, Zexin (1)
Zeng, B. (1)
Zeng, W. (1)
Zhai, Jie-Yi (1)
Zhang, C.-D. (1)
Zhang, Dongming (1)
Zhang, H. (1)
Zhang, H. G. (1)
Zhang, J.-K. (1)
Zhang, K. (1)
Zhang, Kai (1)
Zhang, Kang (1)
Zhang, L. (1)
Zhang, Lei (1)
Zhang, Na (1)
Zhang Pei (1)
Zhang, Qian (1)
Zhang, Qiang (1)
Zhang, Shan (1)
Zhang, Shuying (1)
Zhang, X. (1)
Zhang, X.-J. (1)
Zhang, Ya-Chao (1)
Zhang, Yan (1)
Zhang, Y.-C. (1)
Zhang, Y.-H. (1)
Zhang, Yian (1)
Zhang, Y.-L. (1)
Zhang, Z. (1)
Zhao, Bai-Hang (1)
Zhao, C. (1)
Zhao, Lijiao (1)
Zhao, S. (1)
Zhao, S.-X. (1)
Zhao, Y.-X. (1)
Zheng, T. (1)
Zheng, Yongyu (1)
Zhi, Mengxun (1)
Zhixing, Liu (1)
Zhong, H. (1)
Zhong, J. (1)
Zhong, Rugang (1)
Zhou, C. (1)
Zhou, Kailong (1)
Zhou, L.-J. (1)
Zhou, Q. (1)
Zhou, Rong-Xuan (1)
Zhou, R.-X. (1)
Zhu, H. (1)
Zhu, Mu (1)
任杰 (1)
刘志兴 (1)
刘斌 (1)
周真理 (1)
姜丽颖 (1)
孙少卿 (1)
孙磊 (1)
张冬明 (1)
张磊 (1)
张雷 (1)
徐晗 (1)
李振伟 (1)
李艳萍 (1)
王仕宝 (1)
王宗浩 (1)
王柳谦 (1)
王珂 (1)
王素玉 (1)
王逸舟 (1)
田卫 (1)
赵士伟 (1)
陈欣 (1)
齐天卉 (1)

Submit Unfold

Language

Chinese (114)
English (100)

Submit

Clean All

Select All Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 22 >

Coarse-to-fine domain adaptation object detection with feature disentanglement SCIE

期刊论文 | 2025 | INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Li, Jiafeng | Zhi, Mengxun | Zheng, Yongyu | Zhuo, Li | Zhang, Jing

Abstract&Keyword Cite

Abstract ：

Domain adaptation object detection (DAOD) uses the labeled data of one scene (i.e., the source domain) and the unlabeled data of another unfamiliar scene (i.e., the target domain) to train the cross-domain object detector. Most existing methods align the overall distribution of features by adversarial adaptive methods. Despite their success, these methods are primarily designed for two-stage detectors that are challenging to deploy, resulting in limited practical applications. In addition, owing to the instability of adversarial domain discriminator training, inducing the detector is difficult using only an adversarial adaptive strategy to extract instance-level domain-invariant features to align the overall distribution. To address these issues, we propose a new cross-domain object detection framework based on the You Only Look Once (YOLO) series of algorithms named Disentanglement Representation YOLO (DRY). The developed method achieves feature disentanglement in the channel dimension and spatial dimensions through domain-invariant feature disentanglement (DIFD) and instance-level feature disentanglement (ILFD) modules, respectively, prompting the detector to extract domain-invariant features. Experiments demonstrate that our model outperforms existing methods. It achieved an average accuracy value of 42.7 on the Cityscapes to FoggyCityscapes benchmark and significantly outperformed all other methods on human and car objects. The average accuracy values of 49.0 and 49.5 achieved on the SIM10K to Cityscapes and KITTI to Cityscapes scenarios, respectively, are superior to those of existing methods. Extensive experimental results on various datasets verify that the proposed DRY method is effective and widely applicable. The code is available at https://github.com/BJUTsipl/DRY.

Keyword ：

Object detection Object detection Cross-domain detection Cross-domain detection Unsupervised domain adaptation Unsupervised domain adaptation Disentangled representation learning Disentangled representation learning

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Jiafeng , Zhi, Mengxun , Zheng, Yongyu et al. Coarse-to-fine domain adaptation object detection with feature disentanglement [J]. \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2025 .
MLA	Li, Jiafeng et al. "Coarse-to-fine domain adaptation object detection with feature disentanglement" . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (2025) .
APA	Li, Jiafeng , Zhi, Mengxun , Zheng, Yongyu , Zhuo, Li , Zhang, Jing . Coarse-to-fine domain adaptation object detection with feature disentanglement . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2025 .
Export to	NoteExpress RIS BibTex

T2Det: twin-tower detector with geometric invariance for oriented object detection SCIE

期刊论文 | 2025 , 16 (5) , 494-505 | REMOTE SENSING LETTERS

Wang, Liuqian | Zhang, Jing | Li, Jiafeng | Zhuo, Li

Abstract&Keyword Cite

Abstract ：

Oriented object detection (OOD) in remote sensing images (RSIs) is of increasing interest. Since RSIs often contain many oriented objects, it is valuable and challenging to discover geometric invariance of geospatial objects to improve the model's perception of rotation angle and scale. In this paper, we propose a twin-tower detector (T(2)Det) for OOD in RSIs. Specifically, T(2)Det overcomes the challenges posed by the angles and scales of oriented object by developing a self-supervised (SS) branch that exploits geometric invariance based on the main branch. Then, we design a twin-tower (T-2) loss function to enhance the network's ability to perceive the geometric invariance of geospatial object, where a coarse loss function and a fine loss function are introduced for both branches to optimize the model from coarse to fine. In addition, T-2 loss function optimization strategy based on global or refinement modes is developed to achieve the trade-off between the main branch and the SS branch. On three benchmark datasets, including VEDAI, HRSC2016, and NUAA-SIRST, our T(2)Det achieves competitive performance of 85.15%, 90.66% mAP, and 99.28 P-d, respectively, without unnecessary extra features.

Keyword ：

geometric invariance geometric invariance twin-tower detector twin-tower detector Remote sensing images Remote sensing images oriented object detection oriented object detection self-supervised learning self-supervised learning

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Liuqian , Zhang, Jing , Li, Jiafeng et al. T2Det: twin-tower detector with geometric invariance for oriented object detection [J]. \| REMOTE SENSING LETTERS , 2025 , 16 (5) : 494-505 .
MLA	Wang, Liuqian et al. "T2Det: twin-tower detector with geometric invariance for oriented object detection" . \| REMOTE SENSING LETTERS 16 . 5 (2025) : 494-505 .
APA	Wang, Liuqian , Zhang, Jing , Li, Jiafeng , Zhuo, Li . T2Det: twin-tower detector with geometric invariance for oriented object detection . \| REMOTE SENSING LETTERS , 2025 , 16 (5) , 494-505 .
Export to	NoteExpress RIS BibTex

Explainable graph convolutional network based on catastrophe theory and its application to group activity recognition SCIE

期刊论文 | 2025 , 150 | ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE

Kang, Junpeng | Zhang, Jing | Chen, Lin | Zhang, Hui | Zhuo, Li

Abstract&Keyword Cite

Abstract ：

Graph convolutional networks (graph models for short) are crucial for understanding model decisions through mathematical white-box interpretation, which can radically improve the performance and credibility of downstream artificial intelligence applications. To address the limitations of existing interpretability of over- smoothing and over-squashing, we propose an explainable graph model based on nonlinear catastrophe theory and apply it to group activity recognition to validate the usefulness of interpretability. (1) We introduce catastrophe mathematical theory to explore the internal processes of graph models and construct the explainable dynamical equations of the graph convolutional network; (2) When graph node features lose uniqueness, leading to over-smoothing, which reduces the discriminative power of the graph model, we propose a mathematical method to predict over-smoothing; (3) In response to the over-squashing of the node feature values that is excessively compressed, we design a channel expansion unit to extend the transmission paths of graph nodes and alleviate the over-squashing in the graph structure. Finally, we apply our model to group activity recognition tasks to capture complex interactions within groups. We obtain the competitive results on five publicly available graph structure datasets (Actor, Chameleon, Texas, Cornell, Cora) and our self-built group activity dataset. Our model can effectively capture node and graph-level features with stronger generalization capabilities. For complex and diverse real-world group activity data, our model offers intuitive graph-level explanations for group activity analysis. Through the analysis of over-smoothing and over-squashing, our method extends new theoretical approaches in explainable artificial intelligence.

Keyword ：

Explainable Explainable Group activity recognition Group activity recognition Graph convolutional network Graph convolutional network Over-smoothing Over-smoothing Catastrophe theory Catastrophe theory

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Kang, Junpeng , Zhang, Jing , Chen, Lin et al. Explainable graph convolutional network based on catastrophe theory and its application to group activity recognition [J]. \| ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE , 2025 , 150 .
MLA	Kang, Junpeng et al. "Explainable graph convolutional network based on catastrophe theory and its application to group activity recognition" . \| ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 150 (2025) .
APA	Kang, Junpeng , Zhang, Jing , Chen, Lin , Zhang, Hui , Zhuo, Li . Explainable graph convolutional network based on catastrophe theory and its application to group activity recognition . \| ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE , 2025 , 150 .
Export to	NoteExpress RIS BibTex

Spatial-temporal transformer network for protecting person-of-interest from deepfaking SCIE

期刊论文 | 2025 , 31 (1) | MULTIMEDIA SYSTEMS

Lu, Dingyu | Liu, Zihou | Zhang, Dongming | Zhang, Jing | Jin, Guoqing

Abstract&Keyword Cite

Abstract ：

The rampant use of forgery techniques poses a significant threat to the security of celebrities' identities. Although current deepfake detection methods have shown effectiveness when dealing with specific public face forgery datasets, their reliability diminishes when applied to open data. Moreover, these methods are susceptible to re-compression and mainly rely on pixel-level abnormalities in forgery faces. In this study, we present a novel approach to detecting face forgery by leveraging individual speaking patterns of facial expressions and head movements. Our method utilizes potential motion patterns and inter-frame variations to effectively differentiate between fake and real videos. We propose an end-to-end dual-branch detection network, named the spatial-temporal transformer (STT), which aims to safeguard the identity of the person-of-interest (POI) from deepfaking. The STT incorporates the spatial transformer (ST) to establish the connection between facial expressions and head movements, while the temporal transformer (TT) exploits inconsistencies in facial attribute changes. Additionally, we introduce a central compression loss to enhance the detection performance. Extensive experiments are conducted to evaluate the effectiveness of the STT, and the results demonstrate its superiority over other SOTA methods in detecting forgery videos involving POIs. Furthermore, our network exhibits resilience to pixel-level re-compression perturbations, making it a robust solution in the face of evolving forgery techniques.

Keyword ：

Central compression loss Central compression loss Speaking pattern Speaking pattern Person-of-interest Person-of-interest Spatial-temporal transformer Spatial-temporal transformer Deepfake video detection Deepfake video detection Eye gaze Eye gaze

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Lu, Dingyu , Liu, Zihou , Zhang, Dongming et al. Spatial-temporal transformer network for protecting person-of-interest from deepfaking [J]. \| MULTIMEDIA SYSTEMS , 2025 , 31 (1) .
MLA	Lu, Dingyu et al. "Spatial-temporal transformer network for protecting person-of-interest from deepfaking" . \| MULTIMEDIA SYSTEMS 31 . 1 (2025) .
APA	Lu, Dingyu , Liu, Zihou , Zhang, Dongming , Zhang, Jing , Jin, Guoqing . Spatial-temporal transformer network for protecting person-of-interest from deepfaking . \| MULTIMEDIA SYSTEMS , 2025 , 31 (1) .
Export to	NoteExpress RIS BibTex

RWGCN: Random walk graph convolutional network for group activity recognition SCIE

期刊论文 | 2025 , 55 (6) | APPLIED INTELLIGENCE

Kang, Junpeng | Zhang, Jing | Chen, Lin | Zhang, Hui | Zhuo, Li

Abstract&Keyword Cite

Abstract ：

Group activity recognition can remarkably improve the understanding of video content by analyzing human behaviors and activities in videos. We propose a random walk graph convolutional network (RWGCN) for group activity recognition. (1) Considering the limitation of the convolutional structure to the visual information of group activities, the position feature extraction module is used to compensate for the loss of visual information. (2) A graph convolutional network (GCN) with distance-adaptive edge relations is constructed using individuals as graph nodes to identify the intrinsic relationships among the individuals in the group activities. (3) A Levy flight random walk mechanism is introduced into the GCN to obtain information from different nodes and integrate the previous position information to recognize group activity. Extensive experiments on the publicly available CAD, CAE datasets, and self-built BJUT-GAD dataset show that our RWGCN achieves MPCA of 95.49%, 94.82%, and 96.02%, respectively, which provides a better competitiveness in group activity recognition compared to other methods.

Keyword ：

Random walk Random walk Graph convolutional network Graph convolutional network Group activity recognition Group activity recognition Levy flight Levy flight Position information Position information

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Kang, Junpeng , Zhang, Jing , Chen, Lin et al. RWGCN: Random walk graph convolutional network for group activity recognition [J]. \| APPLIED INTELLIGENCE , 2025 , 55 (6) .
MLA	Kang, Junpeng et al. "RWGCN: Random walk graph convolutional network for group activity recognition" . \| APPLIED INTELLIGENCE 55 . 6 (2025) .
APA	Kang, Junpeng , Zhang, Jing , Chen, Lin , Zhang, Hui , Zhuo, Li . RWGCN: Random walk graph convolutional network for group activity recognition . \| APPLIED INTELLIGENCE , 2025 , 55 (6) .
Export to	NoteExpress RIS BibTex

MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming SCIE

期刊论文 | 2024 , 248 | COMPUTER VISION AND IMAGE UNDERSTANDING

Chen, Lin | Zhang, Jing | Zhang, Yian | Kang, Junpeng | Zhuo, Li

Abstract&Keyword Cite

Abstract ：

Standardized regulation of livestreaming is an important element of cyberspace governance. Temporal action localization (TAL) can localize the occurrence of specific actions to better understand human activities. Due to the short duration and inconspicuous boundaries of human-specific actions, it is very cumbersome to obtain sufficient labeled data for training in untrimmed livestreaming. The point-supervised approach requires only a single-frame annotation for each action instance and can effectively balance cost and performance. Therefore, we propose a memory knowledge propagation network (MKP-Net) for point-supervised temporal action localization in livestreaming, including (1) a plug-and-play memory module is introduced to model prototype features of foreground actions and background knowledge using point-level annotations, (2) the memory knowledge propagation mechanism is used to generate discriminative feature representation in a multi-instance learning pipeline, and (3) localization completeness learning is performed by designing a dual optimization loss for refining and localizing temporal actions. Experimental results show that our method achieves 61.4% and 49.1% SOTAs on THUMOS14 and self-built BJUT-PTAL datasets, respectively, with an inference speed of 711 FPS.

Keyword ：

Memory knowledge propagation Memory knowledge propagation Point-supervised Point-supervised Livestreaming Livestreaming Dual optimization loss Dual optimization loss Temporal action localization Temporal action localization

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Chen, Lin , Zhang, Jing , Zhang, Yian et al. MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming [J]. \| COMPUTER VISION AND IMAGE UNDERSTANDING , 2024 , 248 .
MLA	Chen, Lin et al. "MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming" . \| COMPUTER VISION AND IMAGE UNDERSTANDING 248 (2024) .
APA	Chen, Lin , Zhang, Jing , Zhang, Yian , Kang, Junpeng , Zhuo, Li . MKP-Net: Memory knowledge propagation network for point-supervised temporal action localization in livestreaming . \| COMPUTER VISION AND IMAGE UNDERSTANDING , 2024 , 248 .
Export to	NoteExpress RIS BibTex

Gp3Former: Gaussian Prior Tri-Cascaded Transformer for Video Instance Segmentation in Livestreaming Scenarios SCIE

期刊论文 | 2024 , 9 (1) , 770-784 | IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE

Li, Wensheng | Zhang, Jing | Zhuo, Li

Abstract&Keyword Cite

Abstract ：

Livestreaming platforms attract many active streamers and daily users, and their public opinion power poses a major challenge to network regulation. Video scene understanding can promote the efficiency and quality of network regulation, in which video instance segmentation is a fundamental task for scene understanding. Due to the presence of small, dense instances and fast-changing scenes in livestreaming scenarios, we propose a Gaussian prior tri-cascaded Transformer Gp3Former for video instance segmentation. First, the Mask2Former-VIS encoder is used to enhance the representation of video features at different scales for small instance segmentation. Then, a tri-cascaded Transformer decoder is designed to adapt to the fast-changing scenes in livestreaming, which can extract global, balanced, and local instance features while sacrificing as little scene information as possible. Finally, to cope with the dense instances in livestreaming, a Gaussian prior is imposed during instance association and segmentation to learn the Gaussian distribution of a series of cross-frame instances. The experimental results show that with an inference efficiency of 19.6 FPS, the proposed method reaches 50.6%AP, 50.0%AR on YouTube-VIS 2019, and 82.9%AP, 82.3%AR on self-built BJUT-LSD, respectively, which is effective and superior for video instance segmentation of livestreaming scenarios.

Keyword ：

Livestreaming Livestreaming tri-cascaded tri-cascaded video scenarios video scenarios Gaussian prior Gaussian prior video instance segmentation video instance segmentation

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Wensheng , Zhang, Jing , Zhuo, Li . Gp3Former: Gaussian Prior Tri-Cascaded Transformer for Video Instance Segmentation in Livestreaming Scenarios [J]. \| IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE , 2024 , 9 (1) : 770-784 .
MLA	Li, Wensheng et al. "Gp3Former: Gaussian Prior Tri-Cascaded Transformer for Video Instance Segmentation in Livestreaming Scenarios" . \| IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE 9 . 1 (2024) : 770-784 .
APA	Li, Wensheng , Zhang, Jing , Zhuo, Li . Gp3Former: Gaussian Prior Tri-Cascaded Transformer for Video Instance Segmentation in Livestreaming Scenarios . \| IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE , 2024 , 9 (1) , 770-784 .
Export to	NoteExpress RIS BibTex

Domain adaptation with optimized feature distribution for streamer action recognition in live video SCIE

期刊论文 | 2024 , 16 (1) , 107-125 | INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

He, Chen | Zhang, Jing | Chen, Lin | Zhang, Hui | Zhuo, Li

WoS CC Cited Count： 1

Abstract&Keyword Cite

Abstract ：

Since the large-scale annotation of streamer actions is expensive, training with generic action data is a practical approach. Nevertheless, the spatiotemporal differences between generic actions and streamer actions decrease the recognition accuracy. Domain adaptation utilizes labeled data from both the source domain and target domain to mitigate the performance degradation of target domain data, but it relies on (1) the feature distribution of each category that satisfies the clustering assumption and (2) the distribution of features of the same category in different domains having minimal discrepancy. Considering that streamer action recognition in live video does not meet the above assumptions, we propose a domain adaptation method with optimized feature distribution for streamer action recognition in live video. The method generates diverse features for each sample through the style transfer module and then uses the proposed metric learning loss to constrain the features in a similar feature space to satisfy the above assumptions. The experimental results show that our method has an accuracy of 86.35%, which exceeds the SOTA by 4.71% and an inference speed of 1500 FPS, which is capable of performing the task of streamer action recognition in live video.

Keyword ：

Optimized feature distribution Optimized feature distribution Action recognition Action recognition Live video Live video Domain adaptation Domain adaptation Streamer Streamer

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	He, Chen , Zhang, Jing , Chen, Lin et al. Domain adaptation with optimized feature distribution for streamer action recognition in live video [J]. \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2024 , 16 (1) : 107-125 .
MLA	He, Chen et al. "Domain adaptation with optimized feature distribution for streamer action recognition in live video" . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS 16 . 1 (2024) : 107-125 .
APA	He, Chen , Zhang, Jing , Chen, Lin , Zhang, Hui , Zhuo, Li . Domain adaptation with optimized feature distribution for streamer action recognition in live video . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2024 , 16 (1) , 107-125 .
Export to	NoteExpress RIS BibTex

Single-stage zero-shot object detection network based on CLIP and pseudo-labeling SCIE

期刊论文 | 2024 , 16 (2) , 1055-1070 | INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Li, Jiafeng | Sun, Shengyao | Zhang, Kang | Zhang, Jing | Zhuo, Li

WoS CC Cited Count： 3

Abstract&Keyword Cite

Abstract ：

The detection of unknown objects is a challenging task in computer vision because, although there are diverse real-world detection object categories, existing object-detection training sets cover a limited number of object categories . Most existing approaches use two-stage networks to improve a model's ability to characterize objects of unknown classes, which leads to slow inference. To address this issue, we proposed a single-stage unknown object detection method based on the contrastive language-image pre-training (CLIP) model and pseudo-labelling, called CLIP-YOLO. First, a visual language embedding alignment method is introduced and a channel-grouped enhanced coordinate attention module is embedded into a YOLO-series detection head and feature-enhancing component, to improve the model's ability to characterize and detect unknown category objects. Second, the pseudo-labelling generation is optimized based on the CLIP model to expand the diversity of the training set and enhance the ability to cover unknown object categories. We validated this method on four challenging datasets: MSCOCO, ILSVRC, Visual Genome, and PASCAL VOC. The results show that our method can achieve higher accuracy and faster speed, so as to obtain better performance of unknown object detection. The source code is available at https://github.com/BJUTsipl/CLIP-YOLO.

Keyword ：

Single-stage Single-stage Pseudo-labeling Pseudo-labeling Zero-shot detection Zero-shot detection CLIP CLIP

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Li, Jiafeng , Sun, Shengyao , Zhang, Kang et al. Single-stage zero-shot object detection network based on CLIP and pseudo-labeling [J]. \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2024 , 16 (2) : 1055-1070 .
MLA	Li, Jiafeng et al. "Single-stage zero-shot object detection network based on CLIP and pseudo-labeling" . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS 16 . 2 (2024) : 1055-1070 .
APA	Li, Jiafeng , Sun, Shengyao , Zhang, Kang , Zhang, Jing , Zhuo, Li . Single-stage zero-shot object detection network based on CLIP and pseudo-labeling . \| INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS , 2024 , 16 (2) , 1055-1070 .
Export to	NoteExpress RIS BibTex

Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images SCIE

期刊论文 | 2023 , 44 (4) , 1280-1307 | INTERNATIONAL JOURNAL OF REMOTE SENSING

WoS CC Cited Count： 2

Abstract&Keyword Cite

Abstract ：

High-resolution remote sensing images (HR-RSIs) have a strong dependency between geospatial objects and background. Considering the complex spatial structure and multiscale objects in HR-RSIs, how to fully mine spatial information directly determines the quality of semantic segmentation. In this paper, we focus on the Spatial-specific Transformer with involution for semantic segmentation of HR-RSIs. First, we integrate the spatial-specific involution branch with self-attention branch to form a Spatial-specific Transformer backbone to produce multilevel features with global and spatial information without additional parameters. Then, we introduce multiscale feature representation with large window attention into Swin Transformer to capture multiscale contextual information. Finally, we add a geospatial feature supplement branch in the semantic segmentation decoder to mitigate the loss of semantic information caused by down-sampling multiscale features of geospatial objects. Experimental results demonstrate that our method can achieve a competitive semantic segmentation performance of 87.61% and 80.08% mIoU on Potsdam and Vaihingen datasets, respectively.

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wu, Xinjia , Zhang, Jing , Li, Wensheng et al. Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images [J]. \| INTERNATIONAL JOURNAL OF REMOTE SENSING , 2023 , 44 (4) : 1280-1307 .
MLA	Wu, Xinjia et al. "Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images" . \| INTERNATIONAL JOURNAL OF REMOTE SENSING 44 . 4 (2023) : 1280-1307 .
APA	Wu, Xinjia , Zhang, Jing , Li, Wensheng , Li, Jiafeng , Zhuo, Li , Zhang, Jie . Spatial-specific Transformer with involution for semantic segmentation of high-resolution remote sensing images . \| INTERNATIONAL JOURNAL OF REMOTE SENSING , 2023 , 44 (4) , 1280-1307 .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 22 >

Type
Departments

All Years Choose Year From to