Giter Site home page Giter Site logo

Comments (5)

gemcollector avatar gemcollector commented on August 29, 2024 1

Thx for your checking! Could you provide me with your email? (or you can send me an email) I can give you the results, model, data etc..

from rl-vigen.

gemcollector avatar gemcollector commented on August 29, 2024

Hi, there! ~ Thanks for your question. Yes, you should use the other branch to run the dexterous hand environment. The dexterous hand environment based on VRL3 algorithm, and it should use demonstrations to learn these tasks. For simplicity and reproducibility, I create a new branch for this environment. :)

from rl-vigen.

coderlemon17 avatar coderlemon17 commented on August 29, 2024

Hi, thanks for your reply, I have a few more questions.

  1. Since V3RL is a 3-stage training framework, so do all algorithms reported in Fig. 6 (i.e. the aggregated generalization score of dexterous manipulation) utilize this framework (and additional data) or just the VRL3?
  2. I noticed that the Robosuite also has the different-visual-vppearance setting (Appendix D.1.1), but there are no experiments reported on it. I wonder do you happen to have the generalization results of different baselines in this setting too?
  3. For Appendix E.2 Wall Time, I'm not sure how the FPS is calculated. Is it total_frame_collected / total training time or frame_in_a_batch / time_to_calculate_a_batch? Also, shouldn't this metric heavily depend on the hardware that the algorithms are evaluated with? (i.e. Different CPU, GPU might result in significantly different results.)

from rl-vigen.

gemcollector avatar gemcollector commented on August 29, 2024

Thanks for your question!

  1. As mentioned in Appendix, we only use the stage 3 for training, you can check the code. The original paper and our experiments find that it is enough for achieving a good policy for our tasks.
  2. Yea, we also show the results of different baselines (Figure 19) in Appendix .
  3. You can check the logger.py to know the details of how to calculate the FPS, and the FPS indeed relies on your hardware. I test them on Tesla A40.

from rl-vigen.

coderlemon17 avatar coderlemon17 commented on August 29, 2024

Thank you for your reply.

  1. Sorry, I missed Fig. 19., it's indeed what I want. However, if it is possible, could you kindly provide the results for each separate task, and maybe the data for plotting? We would like to cite it in our paper.

  2. Also about the evaluation pipeline regarding robosuite in Fig. 19, I haven't found a script for it. Based on my understanding:

  • First, you train the agent in one fix scene in train mode.
  • Then you evaluate the generalization performance for 100 trials. (10 trails in each scene under eval-$DIFFICULTY.
  1. I think there's a small typo here, where eval_easy -> eval-easy

from rl-vigen.

Related Issues (18)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.