Bi-Source Reconstruction-Based Classification Network for Face Forgery Video Detection - Details

Author：

Zhang, D. (Zhang, D..) | Fu, C. (Fu, C..) | Lu, D. (Lu, D..) | Li, J. (Li, J..) | Zhang, Y. (Zhang, Y..)

Indexed by：

EI Scopus SCIE

Abstract：

Current　methods　for　detecting　deep　fakes　concentrate　on　specific　patterns　of　forgery　like　noise　characteristics,　local　textures,　or　frequency　statistics.　These　approaches　assume　training　and　test　sets　exhibit　similar　data　distributions,　which　bring　severe　performance　drops　and　further　limit　broader　applications　when　migrating　unseen　domains.　Existing　works　show　that　reconstruction　learning　is　effective　in　capturing　unseen　forgery　clues.　However,　2D　reconstruction　is　insufficient　and　can　not　handle　non-frontal　face　reconstruction,　while　3D　reconstruction　provides　more　critical　details　of　facial　structure　and　finds　accurate　forgery　regions.　In　this　paper,　we　propose　a　bi-source　reconstruction　based　classification　network　(BRCNet)　to　incorporate　2D　and　3D　reconstruction　as　the　supervisions　and　learn　the　optimal　feature　representation.　In　detail,　we　employ　an　encoder-decoder　architecture　to　facilitate　reconstruction　learning,　enhancing　the　learned　representations　to　detect　forgery　patterns　that　are　unknown.　To　further　capture　forgery　evidence　across　multiple　scales,　instead　of　using　encoder　features　from　the　reconstruction　network　only,　we　build　a　feature　improvement　network　to　combine　feature　details　from　encoder　and　decoder　features　in　a　multi-scale　fashion.　In　addition,　we　use　the　reconstruction　difference　to　supervise　the　feature　aggregation,　which　enables　detecting　the　subtle　and　trivial　discrepancies　between　fake　and　real　video　frames.　Extensive　experiments　are　conducted　to　validate　the　performance　of　our　proposed　method　on　several　deep　fake　benchmarks.　The　results　demonstrate　the　efficacy　of　our　approach,　offering　promising　results　and　showcasing　its　potential　for　practical　applications.　The　source　code　is　available　at　https://github.com/cccvl/BRCNet.　　©　1991-2012　IEEE.

Keyword：

multi-scale feature aggregation multi-scale attention feature alignment Face forgery video detection 3D reconstruction

Author Community：

[ 1 ] [Zhang D.]People's Daily Online, State Key Laboratory of Communication Content Cognition, Beijing, 100733, China
[ 2 ] [Fu C.]Beijing University of Technology, Faculty of Information Technology, Beijing, 100021, China
[ 3 ] [Lu D.]Beijing University of Technology, Faculty of Information Technology, Beijing, 100021, China
[ 4 ] [Li J.]People's Daily Online, State Key Laboratory of Communication Content Cognition, Beijing, 100733, China
[ 5 ] [Zhang Y.]University of Science and Technology of China, School of Information Science and Technology, Hefei, 230026, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

TP-Net: Two-Path Network for Retinal Vessel Segmentation
2023，IEEE Journal of Biomedical and Health Informatics
Vehicle Re-identification Based on Multi-Scale Attention Feature Fusion
2022，11th International Conference on Computing and Pattern Recognition, ICCPR 2022
Multi-Scale Attention-Based Deep Neural Network for Brain Disease Diagnosis
2022，CMC-COMPUTERS MATERIALS & CONTINUA
Road Scene Segmentation Based on Multi-scale Attention Mechanism
2022，

Source ：

IEEE Transactions on Circuits and Systems for Video Technology

ISSN： 1051-8215

Year： 2024

Issue： 6

Volume： 34

Page： 4257-4269

8 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to