Light

训练环境奖励问题 about marl-algorithms HOT 2 CLOSED

starry-sky6688 commented on May 18, 2024

训练环境奖励问题

from marl-algorithms.

Comments (2)

starry-sky6688 commented on May 18, 2024

我是在默认reward下训练的，没有尝试sparse reward。但我觉得sparse reward训练起来不容易，尤其还是让多个agent合作，前期没有正经验去学习的情况下很难胜利，所以正经验很难得到。这三个算法都是学习的基本算法，你如果想在sparse reward下训练的话可以从经验这方面入手。

from marl-algorithms.

goodbyeearth commented on May 18, 2024

感谢分享！

from marl-algorithms.

Related Issues (20)

关于参数reuse_network HOT 3
关于COMA critic网络输入 HOT 3
关于g2anet中hard_weights的问题 HOT 1
可以使用其他的环境跑这里面的算法吗？ HOT 1
自定义的环境能使用这里面的算法跑吗？ HOT 1
custom data traing HOT 1
策略函数中的eval_hidden和target_hidden如何理解 HOT 2
None
关于qtran_base.py中_get_individual_q的一个小问题 HOT 2
关于qtran的问题 HOT 1
QMIX 目标网络更新 HOT 1
为什么改名哇 HOT 1
内存泄漏问题 HOT 4
mavan的cuda有问题.... HOT 1
A bug when choosing actions HOT 1
智能体决策不同时 HOT 1
请问CommNet 和 G2ANet 需要外部训练算法是什么意思呢 HOT 1
请问avail_action是什么呢？ HOT 1
关于涉及环境参数的一些疑问 HOT 1
关于QMIX的Trick：Eligibility traces HOT 1

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.