Bot ranks

BHydden · February 11, 2025, 7:11am

It’s because the bot isn’t really 23k, it’s a weird combination of much better and much worse. Depending on whether opponents know how to take advantage of the much worse moves, the bot either wins easily or loses regularly, so big swings of either are more likely than with a more balanced human player.

GreenAsJade · February 11, 2025, 7:19am

Great explanation!

DanTup · April 3, 2025, 9:45am

Sorry to revive an old thread, but I’ve basically stopped playing against bots because of this issue and I wonder if there are improvements that could be made. I understand the bot ranks are all over the place because the bots are very inconsistent, but this information is not presented on the initial “Play” screen, it just shows a bot with a rank, and it’s reasonable to assume the bot would be approximately that rank.

If a range was shown instead, it would not only make it clearer that the game you get might show the bot as anywhere in between those values, but it would make it clearer how inconsistent each bot is (so you could try to pick one with a narrower range, or a range that is more on the right “side” of your rank that you’d like).

Starting a game with a bot that says 18k and then getting 23k is rather annoying. If it had instead said “18k - 25k” then maybe I’d pick a different bot instead.

Groin · April 3, 2025, 12:05pm

Which bot are you talking about?

siimphh · April 4, 2025, 10:36am

I’m currently messing around with amybot model and exploration parameters and forcing it to play until boards are settled. And I have semi accidentally produced quite big swings in strength doing so. So maybe that’s what they mean.

If it is, then I should hopefully be able to stabilise their ranks soon, unless I accidentally cause more breakage of course

shinuito · April 4, 2025, 10:56am

Some of the weaker bots would be more consistent if they made sure not to blunder very basic life and death at the end of a game.

If you’ve written a bot that is intentionally weaker by choosing the second best move or third best move at random, or just picking a random move generally at certain points, then it can just throw won games for no reason to almost any rank of player.

shinuito · April 4, 2025, 11:00am

Since @siimphh commented, I don’t have to look further than the last two games to find an example

Winning, winning, winning vs a 25kyu, randomly ignore an Atari on a group of 17 stones = game over

No player, bot or otherwise can have a consistent level if you do that randomly.

I think the bots need a safeguard in their move logic.

If you’re going to randomly choose to make a mistake, don’t ever choose a mistake bigger than size X, which might depend on their rank.

jlt · April 4, 2025, 11:05am

Humans can do that too

shinuito · April 4, 2025, 11:07am

Yeah but not so regularly and completely at random. (Probably hundreds of games a day?)

I was quite confident I would find a game like that. I bet I can find many more for the ddk amybots.

I bet a real 13kyu wouldn’t do it even close to as often as amybot against tpk players.

I’m just saying, if you want a very obvious reason why the bots rank isn’t stable, there it is.

shinuito · April 4, 2025, 11:12am

Let’s flip a coin to let the whole corner die

It can get to 13kyu by playing well and beating players, but then it can throw a game to any player of any rank on a coin flip.

Immediately the 19kyu kills it, and then it even plays a pass like move.

shinuito · April 4, 2025, 11:22am

Coin flip, throw a 9x9 game

Let’s say it’s only throwing like 3/50 games completely at random, so p=0.06 that it loses to anyone, it will still lose a lot of rating to 25kyus, could be like 50-60 rating points a game. It gains like 6 rating points if it wins. I think maybe that’s a very slight rating gain on average

^^ coin flip

Tschej · April 4, 2025, 11:41am

Works for easy positions but may be weird again for very complicated ones, a big fight for example or very hard to spot life and death on a human group. And the bot is like: “How can this stupid human make a -15 move, its 5k stronger than me. God please fix them so they dont ignore this ‘obvious’ life and death” xp

shinuito · April 4, 2025, 11:43am

Have you looked at the examples I shown above - I’ve given screenshots.

I think even if the bot looked one move ahead and saw that it was about to lose 17 stones because they’re in atari, it could reasonably guess it was about to lose 30 points.

Tschej · April 4, 2025, 11:46am

Yea, but does a bot know what an easy position and a hard one is? Im not in the topic too much but Id assume for the bot, a super complicated situation, for humans, is as obvious as a simple atari.

shinuito · April 4, 2025, 11:50am

Does it know what atari is?

Most likely yes.

So if you randomly ignore an atari of large groups, you will lose games to anyone, and hence your rank won’t be stable.

It’s almost as simple as that.

Tschej · April 4, 2025, 12:03pm

If its a neural network, Im not even sure.
But if it is so easy to solve, Im on your side.

shinuito · April 4, 2025, 12:22pm

I don’t even know anything about this specific bot, but when a bot can play well enough to reach like a high ddk rank, it must be doing something correctly.

At that stage, it shouldn’t randomly be ignoring ataris especially in the late stages of games. Unless you’re ok with the bot being very volatile. But the whole point is to question why the bots have large swings in ranking.

If you imagine there’s two ways to program a weak bot

Your bot just is that strength naturally, you coded up a bunch of rules, tree search, whatever and it just plays at about that level, maybe stronger or weaker if it can look further or less far.
You actually have a strong bot but you want to make it weaker, so you introduce conditions to make it lose points intentionally.

In case 1, you can maybe manually patch flaws to stop it losing to certain tricks.

If it’s case 2 and you’re wondering why your bot isn’t at a stable rank, maybe you need to adjust when it decides to intentionally play a bad move.

My feeling based on playing Amybot ddk is that it’s based on a stronger bot.

It plays quite well overall in the opening and middle game. It’s fine if it can read 5 or 6 moves ahead to capture stones, it’s supposed to be ddk.

But then it ignores random ataris in the endgame like (and it responds to ataris otherwise)

and

even though it “knows” how to live to begin with.

The reason I’m even trying moves like that, is that I suspect it is a coin flip of sorts to just ignore ataris.

I’ll try another game.

siimphh · April 4, 2025, 5:25pm

The 19x19 amybot-ddk is currently playing just about as well as it can so all the blunders are in fact the neural network getting it wrong. I looked at what it was thinking for the first game you pointed out:

The engine does realize that F13 is the best move after a little bit of exploration, with the MCTS tree summary being:

+--E7 v=273 wr=97.2%
|  +--F7 v=150 wr=97.2%
|  |  +--E8 v=149 wr=97.2%
|  +--E8 v=114 wr=97.2%
|     +--F7 v=97 wr=97.3%
|        +--F8 v=78 wr=97.2%
|           +--G8 v=67 wr=98.0%
+--B11 v=97 wr=97.6%
|  +--A11 v=56 wr=97.9%
|  |  +--B12 v=47 wr=98.1%
|  +--B12 v=39 wr=97.1%
+--F13 v=273 wr=98.5%
   +--E7 v=67 wr=98.0%
   +--E8 v=47 wr=98.3%

But it still also thinks E7 is a fine move, and hasn’t explored black A3 as a response at all!

And in general, I’m super grateful for @shinuito for looking looking through games and pointing out these blunders! I could/should have done the same myself, but you really do highlight a major problem and I think your wider explanation of this is also spot on.

Fixing this does require training a better network though, but I think I have better training data available now compared to when I first trained the amybot model. And the first test networks seem to be less prone to big blunders.

shinuito · April 4, 2025, 9:17pm

It’s interesting, I was curious how the bot could play so well in the opening and middle game and then make random blunders like that.

I suppose that can happen with neural nets alright if they have some kinds of blind spots.

I wonder is if similar to but much simpler than the thing where Katago was struggling with liberties of groups that make circle/ring shapes. Like when the liberties are too far away from where the move is being played.

I wonder does making the liberties of groups explicit and factor into the nets/ai’s decision make an improvement. If it’s like an extra meta parameter to the algorithm.

But sorry, I don’t mean to sound negative about the bot, I just wanted to point out one factor I think contributes a lot to rank instability.

The amybots are very cool overall

The same would be true for Agapanthus, Bergamot etc where they might randomly play a worse move mixed in with otherwise Katago like play. Like they’ll be playing well and then just break and play dumpling looking moves and lose the game.

shinuito · April 18, 2025, 2:51pm

From talking to some newer players, you have bots like Agapanthus

which might be 25kyu when you start a game, and 20kyu before you even finish the game.

I think it is because it’s playing like a strong bot but randomly choosing bad moves.

Sometimes its bad moves just aren’t bad enough to turn the game around especially on 9x9. It can be different on bigger board sizes.

Technically this kind of move is bad, and it might throw the game, but not when the opponent goes on to play moves that let stones be captured, and the bot definitely knows how to capture

So this might be a different kind of volatility than comes from having a net not predicting the right move, but from initially picking a bad move, but then your opponents (it’s meant to be like a 25kyu bot) not being able to capitalise on it, and so you go back to destroying them

There are games where it will properly throw the game with bad moves.