Giter Site home page Giter Site logo

guohengkai / region-ensemble-network Goto Github PK

View Code? Open in Web Editor NEW
113.0 113.0 36.0 63.34 MB

Repository for Region Ensemble Network based Hand Pose Estimation

License: GNU General Public License v2.0

Python 100.00%
convolutional-neural-networks deep-learning hand-pose-estimation region-ensemble-network

region-ensemble-network's People

Contributors

guohengkai avatar xinghaochen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

region-ensemble-network's Issues

Install pyrealsense

@guohengkai

Thanks for sharing your code. There is some errors when installed the pyrealsense, can you give me the operations in details?

regards,
weiguo

Windows specifics: set environment variable PYRS_INCLUDES to the rs.h directory location and environment variable PYRS_LIBS to the librealsense binary location. You might also need to have stdint.h available in your path.

Speed and GPU memory

Nice work!
i wonder how fast is the algorithm?
And how many GPU memories does this algorithm occupy?

Training codes using REN

Hi,
Thank you very much for sharing codes! I'm a new beginner for the framework of caffe, can you give more details on the training processing? I
regards,
weiguo

Can GAP Layer replace FC layer?

Hi thans yours work. I'm interest to your work. There is a question for me. Fc Layer param is so much. Can GAP Layer replace FC layer in your network?

Can you release the training code?

Thank you for your awesome research. I want to use your method to train my own model but I find that there is only the prediction code in the repo. Can you release the training code together? I will be very grateful for that.

Data augmentation

Hi, Thanks for your research and paper. I am trying to implement it in pytorch.

I would like to ask about the data augmentation (translation, scaling, and rotation).
If i am understanding it correctly, first you get the cube (150x150), then resize to 96x96, then normalize to -1,1. This is all done using _crop_image.

How did you apply the transformations after _crop_image?

Thanks for your help.

data preprocessing

HI~
Thank you for your great work!
Could you please explain how do you get the hand center for me?
I see you have given the hand center coordinate in /labels/*_center.txt

Thank you very much!

solver.prototxt

Dear hengkai,
Can you share the solver.prototxt file that used to train the networks? I want to run the architecture described in the paper.
regards,
weiguo

Visualization part lack argumentparser

Hello,
Thank you for sharing! There may be needed more argumentparsers in the visualization part,e,g,dataset_model.

$ python evaluation/show_result.py icvl --in_file=results/icvl_ren_4x6x6.txt

regards,
weiguo

hand_model.detect_image() error

Hi,

Thank you for your nice code! I want to reproduce the results, Since the pyrs.start() doesn't work, then I am using pysr.service instead. There is some following errors

Traceback (most recent call last):
File "demo/show_cv2.py", line 75, in
results = hand_model.detect_image(d)
File "/home/alex/PhD/PoseEstimation/REN/region-ensemble-network-master/evaluation/hand_model.py", line 42, in detect_image
return self.detect_images([img])[0,...]
TypeError: 'NoneType' object has no attribute 'getitem'

Can you give me some advice?

Regards,
weiguo

About the loss function

hi:
excellent job!!
I want to retrain the model , is it convenient for you to tell me the loss function.
I did not see the info about loss function neither in the paper nor in the model folder(only deploy prototxt)
thank you !

trianing code about ITOP dataset

hi, thx for you work. Can you release the training code about ITOP dataset? I'm dong some research on this
dataset and looking forward to it.

label normalization

thx a lot for your paper!
Here comes the question,how you normalized the joints according to the cropping?
thx again and hope for your reply.

parse model error.

感谢分享~
我在执行: python evaluation/run_model.py icvl ren_4x6x6 your/path/to/output/file your/path/to/ICVL/images/test 的时候,
读入模型报了下面的错误:
Can't parse message of type "caffe.NetParameter" because it is missing required fields: layer[53].psroi_pooling_param.output_dim, layer[53].psroi_pooling_param.group_size

Annotation error on NYU dataset

Hi, thank you for your code. And it works well on ICVL dataset. However, I found there is some error on NYU dataset. Hand joint position is correct, but the line is error with the start point. Can you give the more information on annotation of 14 hand pose joints of the NYU dataset which you used to train?

def get_sketch_setting(dataset):
    if dataset == 'icvl':
        return [(0, 1), (1, 2), (2, 3), (0, 4), (4, 5), (5, 6),
                (0, 7), (7, 8), (8, 9), (0, 10), (10, 11), (11, 12),
                (0, 13), (13, 14), (14, 15)]
    elif dataset == 'nyu':
        return [(0, 1), (0, 2), (0, 5), (3, 4), (4, 5), (0, 7), (6, 7),
                (0, 9), (8, 9), (0, 11), (10, 11), (0, 13), (12, 13)]
    elif dataset == 'msra':
        return [(0, 1), (1, 2), (2, 3), (3, 4), (0, 5), (5, 6), (6, 7), (7, 8),
                (0, 9), (9, 10), (10, 11), (11, 12), (0, 13), (13, 14), (14, 15), (15, 16),
                (0, 17), (17, 18), (18, 19), (19, 20)]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.