Hybrid Transformer Network for Deepfake Detection

Khan, Sohail Ahmed; Dang Nguyen, Duc Tien

dc.contributor.author	Khan, Sohail Ahmed
dc.contributor.author	Dang Nguyen, Duc Tien
dc.date.accessioned	2023-03-31T13:15:20Z
dc.date.available	2023-03-31T13:15:20Z
dc.date.created	2022-11-23T11:42:09Z
dc.date.issued	2022
dc.identifier.isbn	9781450397209
dc.identifier.uri	https://hdl.handle.net/11250/3061520
dc.description.abstract	Deepfake media is becoming widespread nowadays because of the easily available tools and mobile apps which can generate realistic looking deepfake videos/images without requiring any technical knowledge. With further advances in this field of technology in the near future, the quantity and quality of deepfake media is also expected to flourish, while making deepfake media a likely new practical tool to spread mis/disinformation. Because of these concerns, the deepfake media detection tools are becoming a necessity. In this study, we propose a novel hybrid transformer network utilizing early feature fusion strategy for deepfake video detection. Our model employs two different CNN networks, i.e., (1) XceptionNet and (2) EfficientNet-B4 as feature extractors. We train both feature extractors along with the transformer in an end-to-end manner on FaceForensics++, DFDC benchmarks. Our model, while having relatively straightforward architecture, achieves comparable results to other more advanced state-of-the-art approaches when evaluated on FaceForensics++ and DFDC benchmarks. Besides this, we also propose novel face cut-out augmentations, as well as random cut-out augmentations. We show that the proposed augmentations improve the detection performance of our model and reduce overfitting. In addition to that, we show that our model is capable of learning from considerably small amount of data.	en_US
dc.language.iso	eng	en_US
dc.publisher	ACM	en_US
dc.relation.ispartof	CBMI '22: Proceedings of the 19th International Conference on Content-based Multimedia Indexing
dc.rights	Navngivelse 4.0 Internasjonal	*
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/deed.no	*
dc.title	Hybrid Transformer Network for Deepfake Detection	en_US
dc.type	Chapter	en_US
dc.description.version	publishedVersion	en_US
dc.rights.holder	Copyright 2022 The Author(s)	en_US
cristin.ispublished	true
cristin.fulltext	original
cristin.qualitycode	1
dc.identifier.doi	https://doi.org/10.1145/3549555.3549588
dc.identifier.cristin	2079125
dc.source.pagenumber	8-14	en_US
dc.relation.project	Norges forskningsråd: 309339	en_US
dc.identifier.citation	In: CBMI '22: Proceedings of the 19th International Conference on Content-based Multimedia Indexing, pp. 8-14.	en_US

Tilhørende fil(er)

Filnavn:: 3549555.3549588.pdf
Størrelse:: 2.927Mb
Format:: PDF
Beskrivelse:: PDF

Åpne

Denne innførselen finnes i følgende samling(er)

Department of Information Science and Media Studies [853]
Registrations from Cristin [9688]

Vis enkel innførsel

Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal