Giter Site home page Giter Site logo

deepnash's Introduction

Attempt at Implementing DeepNash

This repository is a modification based on baskuit/R-NaD: Experimentation with Regularized Nash Dynamics on a GPU accelerated game (github.com), especially the vtrace.py file.

This repository implements possible implementations of DeepNash in two scenarios: Lasertag and NIM game.

Lasertag

Lasertag is a zero-sum game where two agents are in a grid environment. Each agent can move and shoot lasers. When a laser hits the opponent, the agent gets a reward, while the hit agent receives the same punishment.

Both agents make decisions simultaneously in this scenario. The Laser_tag gym environment in this repository is modified based on younggyoseo/lasertag-v0: Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017) (github.com). Use the following code to install it.

cd lasertag-v0
pip install -e .

image

image

NIM Game

NIM GAME-Wiki

The NIM game involves n piles of different items, with arbitrary numbers of items in each pile. Two players take turns removing any number of items from a single pile, at least one and at most all items, but they cannot take nothing or items from multiple piles. The player who removes the last item according to the rules, making the opponent unable to take any items, wins. One of the players has a winning strategy in this game.

Suppose there are n piles of items, with 1 item in the first pile, 2 items in the second pile, and so on, up to n items in the n-th pile. It can be observed that when n is odd, the second player has a winning strategy. Conversely, when n is even, the first player has a winning strategy.

image

deepnash's People

Contributors

deligentfool avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.