Giter Site home page Giter Site logo

tonghanwang / eiti-edti Goto Github PK

View Code? Open in Web Editor NEW
28.0 28.0 6.0 4.13 MB

Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)

Home Page: https://sites.google.com/view/influence-based-mae

License: MIT License

Dockerfile 0.05% Python 99.95%

eiti-edti's People

Contributors

tonghanwang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

eiti-edti's Issues

EITI 中 VAE 具体怎么计算及使用?

image

你好,请问这个公式里,log 里面的两个概率怎么计算?虽然论文说用 vae,但是 vae 不是用来压缩及采样的吗?怎么得到概率?对应代码在哪里?

能提供一下island相关的运行命令吗

你好,我仿照pass任务上运行的command(只修改了size=10)在task island 上运行了ppo2. 但是会出现如下维数相关的问题。我认为这可能是缺少必须的参数设置。请问我如果想在island上成功运行代码都需要在命令上修改哪些参数?
Traceback (most recent call last): File "./baselines/run.py", line 302, in <module> main(sys.argv) File "./baselines/run.py", line 244, in main model, env = train(args, extra_args) File "./baselines/run.py", line 95, in train **alg_kwargs File "/home/sunrifr/EITI-EDTI/baselines/ppo2/ppo2.py", line 188, in learn ext_rewards_tv_n, int_rewards_tv_n, ext_rewards_v_n, int_rewards_v_n = runner.run() # pylint: disable=E0632 File "/home/sunrifr/EITI-EDTI/baselines/ppo2/runner.py", line 153, in run self.obs_n, t_rewards, self.dones, infos = self.env.step(actions) File "/home/sunrifr/EITI-EDTI/baselines/common/vec_env/subproc_vec_env.py", line 513, in step self.key_points.update([self.pre_state_n[j], actions[j]], [state_n[j], None]) File "/home/sunrifr/EITI-EDTI/baselines/Curiosity.py", line 1468, in update self.p_c_1.add(c_num_1) File "/home/sunrifr/EITI-EDTI/baselines/Curiosity.py", line 103, in add self.count_1.add(num[:-self.n_1]) File "/home/sunrifr/EITI-EDTI/baselines/Curiosity.py", line 64, in add self.count_1.add(num) File "/home/sunrifr/EITI-EDTI/baselines/Curiosity.py", line 39, in add f = f[num[i]] IndexError: index 4 is out of bounds for axis 0 with size 4

期待你的回复,谢谢!

模型的测试

你好,可以提供一个训练好的模型吗?方便我测试

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.