Skip to content

How Many Hidden Layers to Use in Leela Chess Zero? #1887

Answered by mooskagh
ginkgo20 asked this question in Q&A
Discussion options

You must be logged in to vote

Initially, the topology of the Lc0 network was exactly the same as a0.

However, with time, some changes were introduced. Particularly,

  • Number of residual blocks changed over time, and the most recent resnet-based nets have 15 blocks (of 2 convolutional layers each).
  • Lc0 uses Squeeze-and-Excitation blocks instead of pure resnet.
  • Policy head is convolutional now rather than fully connected.
  • Value head returns W,D,L rather than a single V value.
  • Moves-left-head was not present in the a0 paper at all.

Moreover, the most recent and strongest networks are not resnet-based at all, they are transformer based.

Replies: 2 comments 1 reply

This comment was marked as off-topic.

@ginkgo20
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by ginkgo20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants