KataGo's final (for now) neural nets available

hexahedron · June 21, 2020, 10:46pm

Just for those people who hang out more here than reddit or L19:

KataGo today has uploaded the final neural nets from its latest run, and which are a big step up from any previous nets due to a learning rate drop. They can be found here: https://github.com/lightvector/KataGo/releases/tag/v1.4.5

See the above link for the release and the new nets and executables and/or see this reddit post for more details.

For anyone interested in seeing some games by this latest neural net, the recent series of games from kata-bot-test in the last day or so are the new 40x256 net duking it out against RoyalZeroSlow while limited to about 5000 visits.

kata-bot has also been updated with it as well, and the recent games between it and JBXKataBot2 from 2020-06-19 to 2020-06-20 are also the new net stomping on its former older version (but still losing a few games!) as JBXKataBot2 had only been using the prior released network. Note that JBXKataBot2 also sometimes interleaved this with games against other bots too, so you might have to skip through the history at points.

Although it could definitely go longer and improve more, the run has reached a reasonable stopping point and so I’m stopping it for now due to the cost of continuing. It’s been a great run, and I hope people continue to enjoy it. Thanks to everyone who gave their enthusiasm and support, and for those who contributed or helped test things out.

And if you’re curious about where KataGo might be headed in the future after this, I did include some thoughts in that reddit post above so feel free to check that out. In particular, - in the future maybe look out for a community-distributed run, and/or message me if you have solid web dev experience that might help make that possible. But other than that, I do plan to continue working on KataGo.

Yay.

baduk17 · June 21, 2020, 10:56pm

How much did it cost to keep it improving?

hexahedron · June 22, 2020, 7:42pm

A pretty hefty amount. Most of it was covered by a generous sponsorship that I was able to obtain. Towards the end after reaching the end of that, I did also cover some myself to run a little longer, but paying for a few dozen GPUs for any extended time as an individual wasn’t really practical.

Anyways, that’s the point of going distributed. A lot easier to pool things together when lots of people can each contribute a little - which is what made Leela Zero (and in related domains, projects like Stockfish) so successful.