Giter Site home page Giter Site logo

alphago's Introduction

알파고 제로 + 오목

구조

Game State

state

  • 학습 또는 네트워크 계산에 필요한 Input State
  • 게임 보드 N X N의 matrix를 중첩한 구조
    • 현재 플레이어
    • White의 최근 K 개의 상태
    • Black의 최근 K 개의 상태

Agent

  • Collecting Game Data
    • Self-Play를 한 결과로 Game State, Winner, MCTS-Propability를 모음
    • Augmenting data : 회전, 뒤집기 적용
    • Data queue : Augmenting한 data를 저장
  • Training
    • 새로운 데이터 5120 개가 생겨날 때 마다 학습
    • Batch size : 256
    • Training Rate : 0.001
    • Training Loop : 5회
  • Validation Test
    • Training이 끝나고 Best Model과 Current Model간의 대결 테스트

Model

model

  • Input Layer:

    • 64개의 채널의 3x3 커널크기를 가진 Convolution Layer
    • Batch Normalization
    • A rectifier linear unit
  • Residual Block:

    • 64개의 채널의 3x3 커널크기를 가진 Convolution Layer
    • Batch Normalization
    • A rectifier linear unit
    • 64개의 채널의 3x3 커널크기를 가진 Convolution Layer
    • Batch Normalization
    • Residual Layer의 Input 추가
    • A rectifier linear unit
  • Policy Head

    • 2개의 채널의 1x1 커널크기를 가진 Convolution Layer
    • Batch Normalization
    • A rectifier linear unit
    • 보드의 너비 X 높이 개의 output을 가지는 Fully Connected Layer
  • Value Head

    • 1개의 채널의 1x1 커널크기를 가진 Convolution Layer
    • Batch Normalization
    • A rectifier linear unit
    • 64 개의 output을 가지는 Fully Connected Layer
    • A rectifier linear unit
    • [-1, 1]의 결과를 output을 가지는 tanh
  • Loss Function loss

프로젝트 일지

여기로

6x6 4목 2번째 시도

66_2

Reference

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.