Giter Site home page Giter Site logo

ha0tang / gesturegan Goto Github PK

View Code? Open in Web Editor NEW
173.0 11.0 20.0 12.99 MB

[ACM MM 2018 Oral] GestureGAN for Hand Gesture-to-Gesture Translation in the Wild

Home Page: http://disi.unitn.it/~hao.tang/project/GestureGAN.html

License: Other

MATLAB 9.70% Lua 6.01% Shell 2.04% Python 82.25%
pytorch acmmm2018 computer-vision gans generative-model generative-adversarial-network deep-learning image-generation image-translation image-manipulation

gesturegan's Introduction

  • πŸ‘― We are looking self-motivated researcher to join/visit our Group.

GitHub stats

Top Langs

Hao Tang

[Homepage] [Google Scholar] [Twitter]

I am currently a postdoctoral researcher at Computer Vision Lab, ETH Zurich, Switzerland.

⚑ News

We released the code of XingVTON and CIT for virtual try-on, the code of TransDA for source-free domain adaptation using Transformer, the code of IEPGAN for 3D pose transfer, the code of TransDepth for monocular depth prediction using Transformer, the code GLANet for unpaired image-to-image translation, the code MHFormer for 3D human pose estimation.

🌱 My Repositories

3D-Aware Image/Video Generation

3D Human Pose Estimation

Text-to-Image Synthesis

3D Objection Generation

Monocular Depth Prediction

Face Anonymisation

Person Image Generation

Scene Image Generation

Unsupervised Image Translation

Deep Dictionary Learning

Virtual Try-On

Hand Gesture Recognition

Source-Free Domain Adaptation

gesturegan's People

Contributors

ha0tang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

gesturegan's Issues

L2 loss channel-wise missing?

Hello!

I've been reading the GestureGAN paper and the way in which calculating the L2 loss in the generator channel-wise avoids "channel pollution". However, I can not find it implemented in the code. Only the L1 loss is calculated channel-wise, which the paper states that is not necessary. Is this an error en in the code, and when calculating L1 loss it was meant to calculate L2?

Thanks!

Filter failure cases manually from train set?

I am interested to understand the working of gestureGAN_twocycle model. I downloaded the senz3d dataset and prepared training and test data as indicated. When I saw the training output, I noticed that there are many images for which the pose is wrongly identified. Did you manually delete them from the 135,504 training samples? Can you please provide a .txt file with the correct sample's filenames ?
Thanks and best regards,
jysa01

How/Where is the skeleton image embedding implemented?

Hello @Ha0Tang ,
I read through the GestureGAN paper and noticed under Experimental setup that the skeleton images are embedded by passing through an encoder. I cannot identify where this has been implemented in the code.
The dataloader seggregates the incoming input image into four 256x256 blocks - RealA, RealB, RealC, RealD. This is then fed to the model using set_input function. Am I missing something here? Kindly help

Thanks,
jysa01

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.