Giter Site home page Giter Site logo

forensics's Introduction

MIFAE-Forensics

This is the official implementation of MIFAE-Forensics for DeepFake detection.

Catalog

  • Visualization demo.
  • Pre-training code.
  • Fine-tuning code.

Network Structure.

image

Two pretext tasks, i.e. facial region guided masking in the spatial domain and high-frequency components masking in the frequency domain.

Visualization Results.

1. Frequency Visualization

Original image -> High-frequency components masking -> Network prediction -> Full reconstruction

image

2. Spatial Visualization

  • We first visualizae the MAE with facial region guiaded masking strategy in our paper.

Original image -> Facial region guided masking -> Network prediction -> Full reconstruction

image

  • We also visualize the vanilla MAE reconstruction without facial region guided masking strategy as comparison.

Original image -> Random masking -> Network prediction -> Full reconstruction

image

3. DeepFake detection via the reconstruction discrepancy.

image

Usage

Pre-training instruction

To pre-train ViT-B/16 (recommended default) with multi-node distributed training, run the following on 8 nodes with 8 GPUs each:

python submitit_pretrain.py \
    --job_dir ${JOB_DIR} \
    --nodes 8 \
    --use_volta32 \
    --batch_size 64 \
    --model mae_vit_base_patch16 \
    --norm_pix_loss \
    --mask_ratio 0.75 \
    --mask_radius 16 \
    --epochs 800 \
    --warmup_epochs 40 \
    --blr 1.5e-4 --weight_decay 0.05 \
    --data_path ${IMAGENET_DIR}

Fine-tuning instruction

You can choose different reconstruction strategies through:

  1. args.recon_real (reconstruction of real faces only),
  2. args.recon_dual (positive reconstruction on real faces and negative construction on fake faces)
  3. direct fine-tuning without reconstruction.
python partial_finetuning_with_reconstruction.py \
    --finetune ""\
    --decoder ""\
    --recon_real

Ackownledgement

This repository is built on MAE.

forensics's People

Contributors

mark-dou avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.