Giter Site home page Giter Site logo

sigmabm / starcraft Goto Github PK

View Code? Open in Web Editor NEW

This project forked from starry-sky6688/marl-algorithms

0.0 0.0 0.0 26.87 MB

Implementations of QMIX, VDN, COMA, QTRAN, CommNet, DyMA-CL, G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 100.00%

starcraft's Introduction

StarCraft

This is a pytorch implementation of the multi-agent reinforcement learning algorithms, including QMIX, VDN, COMA, QTRAN(both QTRAN-base and QTRAN-alt), and DyMA-CL, which are the state of art MARL algorithms. In addition, we implemented CommNet and combined it with coma, which we called CommNet_COMA. We trained these algorithms on SMAC, the decentralised micromanagement scenario of StarCraft II.

Corresponding Papers

Requirements

Acknowledgement

Quick Start

$ python main.py --evaluate_epoch=100 --map=3m --alg=qmix

Directly run the main.py, then the algorithm will be tested on map '3m' for 100 episodes, using the pretrained model.

The running of DyMA-CL is independent from others beacuse it requires different environment settings, you should open it as a new project, for more details, please read DyMA-CL documentation.

Result

We independently train these algorithms for 8 times and take the mean of the 8 independent results. In order to make the curves smoother, we also take the mean of every five points in the horizontal direction. In each independent training process, we train these algorithms for 5000 epochs and evaluate them for every 5 epochs. From the figure 1 we can see that our results is not the same as in the papers, maybe there are some small bugs, we are pleasure that you pull request to improve this project. Furthermore, as show in figure 2, we compare the best result we think in the 8 independent results. All of the results are saved in ./result .

1. Mean Win Rate of 8 Independent Runs on '3m'

2. Best Win Rate of 8 Independent Runs on '3m'

starcraft's People

Contributors

starry-sky6688 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.