[This goes beyond Leela 0. I am not sure where to post it, please feel free to move it

There are two types of suggestions here: Adding

Idea: Pose additional questions to the net in order to make it smarter about lczero.org HOT 6 OPEN

leelachesszero commented on July 30, 2024

Idea: Pose additional questions to the net in order to make it smarter

from lczero.org.

Comments (6)

killerducky commented on July 30, 2024

It sounds interesting, but there is a high cost. I will use Suggestion 1 as an example. You talked about only doing it for random positions, but in the introduction you mentioned the main point is to combine everything into a single NN with multiple outputs. I think that means you need to calculate what the net should output for those cases for all positions, not just some. So you need self-play games to do a full 800 node search for both sides for every position. And you need to double the number of outputs.

Combining value and policy heads is essentially free because we were already computing them. So the question is will it be worth it to generate games at half the rate, and have your NN evals be slower? I can't even guess. :)

from lczero.org.

DaghN commented on July 30, 2024

I agree killerducky, getting policy for both sides should involve twice as much calculation, or half as many games, not to mention the added net structure cost of a bigger input and output.

With regard to games, I think the eventual ceiling of the net is the important thing, as things seem to be heading now.

With regard to the cost/benefit of having better but slower NN evals, I believe that aiming for better is always good, it is what drives the improvement and makes Leela viable at all. The hope is that adding more information/structure will lead to a quantum leap in strength. (As opposed to now, where we simply make the net bigger until it's not worth it anymore because it becomes too slow.)

from lczero.org.

DaghN commented on July 30, 2024

There are two types of suggestions here:

Adding policy for the other side.
Trying to get the net to think more in terms of moves, the logic of the position, deeper comprehension.

After thinking more about it, the second part seems to be more involved and not readily facilitated by the current leela structure. It is not clear that Leela is thinking much at all in terms of tactical logic or move logic (if I move this piece, what will then be possible in the position), as evidenced by the problem with discovered checks. Instead, maybe it is working more on very finetuned/balanced but shallow pattern recognition, to try and put some simple words on the difference.

from lczero.org.

Ishinoshita commented on July 30, 2024

The idea of making more information flow into the net by predicting the next k moves has been explored in go game by Tian et al. (FB, Darkforest bot) in supervised learning mode, achieving better prediction accuracy:
https://arxiv.org/abs/1511.06410. Could work for RL as well. And readily applicable to chess, a priori.
Another paper Multi-Labelled Value Networks for Computer Go (Wu et al., https://arxiv.org/abs/1705.10701) also report training a value head to output position value for different komi (compensation points for second side to move), which de facto amounts to injecting more information into the network. An additional board value (BV) head, sharing the same network front-end with the value head, is trained to output the status of stones/intersections at the end of the game. Yet another implementation of the same idea. Although these two last (multi-komi value head and BV head) are specific to the game of go.

from lczero.org.

mooskagh commented on July 30, 2024

While I don't have any better suggestions of a better place where to post suggestions like this (dev forum?), it would be nice github issues to be more actionable and task oriented, so that we can mark them "done" sometimes.
Keeping this open for now, but we need a better place for non-actionable ideas.. Probably.

from lczero.org.

mooskagh commented on July 30, 2024

I think good place for writeups like those would be a section in our lczero.org website (even if it's already implemented or not relevant anymore).
So I'm moving issue there in order not to forget to migrate it, afterwards it can be closed.

from lczero.org.

Idea: Pose additional questions to the net in order to make it smarter about lczero.org HOT 6 OPEN

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent