Giter Site home page Giter Site logo

Training of human-like models about katago HOT 8 OPEN

BigBoxxx avatar BigBoxxx commented on September 7, 2024
Training of human-like models

from katago.

Comments (8)

lightvector avatar lightvector commented on September 7, 2024

Maybe. If you could find a way to reliably categorize the styles of different players across tens of thousands of games, then you could add some parameters to the human sgf metadata encoder to label the style and train it. But to start off, you would have to find a way to categorize styles for the games without having a model trained to do it, because you wouldn't have a model yet.

from katago.

mooy8899 avatar mooy8899 commented on September 7, 2024

define "human-like"

from katago.

BigBoxxx avatar BigBoxxx commented on September 7, 2024

@lightvector Thank you for your answer. We are trying to move in this direction.

Another question is, can we use your b18c384nbt-humanv0.bin.gz model and fine-tune it with some specific human game data? This would be to adapt it to specific player styles or differences in rating definitions across various regions.

from katago.

jopdorp avatar jopdorp commented on September 7, 2024

It would also be great to make smaller models, for example b10 human style

from katago.

BigBoxxx avatar BigBoxxx commented on September 7, 2024

@jopdorp Good idea~ so which part of the documentation can I refer to in order to implement the training process?🙏

from katago.

jopdorp avatar jopdorp commented on September 7, 2024

@BigBoxxx I think thre are two general approaches here, that is to train a new supervised model from scratch, similar to how @lightvector trained the 18b humansl model, or :

You could take the 18b human sl model, input randomly generated data into it, and the 10b model to be trained, then update the parameters of the 10b model to get closer to the same outputs as the 18b model, this way you would not need any data, and you can get closer to the capabilities of the 18b model.

As a side note, this second approach could also be applied to make more powerful 10b and 20b normal models (non humansl)

from katago.

BigBoxxx avatar BigBoxxx commented on September 7, 2024

@jopdorp Thanks jopdorp, what documents can I refer to for training a new supervised model from scratch, similar to the 18b humansl model?

from katago.

jopdorp avatar jopdorp commented on September 7, 2024

@BigBoxxx I think there is just the source code in the python directory

from katago.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.