Giter Site home page Giter Site logo

darkk-pvt / avmod Goto Github PK

View Code? Open in Web Editor NEW
3.0 1.0 3.0 2.01 MB

AVmod is a Audiovisual modulator developed with Deep Fake

License: MIT License

Python 29.89% Jupyter Notebook 70.11%
face-detection deep-learning gan lsgan python keras deepfakes tensorflow umeyama keras-vggface

avmod's Introduction

AVmod

AVmod is a Audiovisual modulator developed as Project for Third year

  • AVmod uses LSGAN and cyclic GAN to achieve the functionality of face swaping and voice modulation

Contributors

  • Himanshu
  • Jai
  • Karan
  • Sagar

Description

  • MTCNN_face_detection.ipynb

    • File-1
    • Responsible for frame extraction, face detection/alignment on input video.
    • Detected faces are saved in ./faces/raw_face for non-aligned result and ./faces/aligned_faces for aligned results.
    • Crude eyes binary masks saved in ./faces/binary_mask_eye.
  • prep_binary_masks.ipynb

    • File-2
    • For datapreprocessing.
    • Create binary masks using aligned_faces and save results in ./binary_masks/faceA_eyes and ./binary_masks/faceB_eyes folder.
    • Require face_alignment package.
  • train.ipynb

    • File-3
    • Used for model training.
    • Require additional training images generated through prep_binary_masks.ipynb
    • Save models in ./model directory.
    • Save backup models in ./model/backup_iter{iteration_num}.
  • video_conversion.ipynb

    • File-4
    • Used for video conversion based on training done in train.ipynb
    • Use five-points landmarks for face alignment.

Training Data

  • Pick images that are stored in ./facesA/aligned_faces and ./facesB/aligned_faces for each target.
  • Resizing of image will be performed to make images 256x256 for training.
  • Training will happen for 40000 iterations (default) can be increased to 80000 and more according to requirement.

Requirements

  • python 3.6.4
  • tensorflow r1.15.2
  • keras r2.1.5
  • opencv
  • keras_vggface
  • moviepy
  • face_alignment
  • pathlib

Guide for Installations of Dependencies

pip install -r requirements.txt

Todo

  • Functionality for voice modulation
  • Increase face swapping area
  • Binary Mask for mouth
  • Interface for easy access to training and conversion
  • GPU support for py files (CUDA)

Acknowledgment

Code borrowed from tjwei and keras-contrib. The generative network is adopted from CycleGAN. Weights and scripts of MTCNN are from FaceNet.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.