Potential rank inflation on OGS or How To Beat KataGo With One Simple Trick (free)!

thomazjunior2001 · November 11, 2022, 2:23pm

For example, whoever wrote this article doesn’t seem to realize 64 playout katago isn’t really full strength katago and genuinely seems to think katago is “only” near-superhuman instead of outright superhuman because the paper calls it “professional-level”

ArsenLapin1 · November 11, 2022, 4:00pm

I cannot blame them. At this point I wouldn’t even call it “misinterpreting” so much as “believing a deliberate lie”.

ArsenLapin1 · November 15, 2022, 4:24pm

Someone genuinely asked the question “Is this paper simply wrong?” and got a response from the authors of the paper:

Feijoa · November 15, 2022, 5:50pm

We used Tromp-Taylor rules, modified to remove opponent stones from within groups that can be shown to be unconditionally alive via Benson’s algorithm.
…
Scoring then proceeds with regular Tromp-Taylor rules.

This seems like the essence of the problem. How is it valid at all to change the rules after passing? ~~White wins under the “modified” rules it was supposedly playing with.~~

Edit: I think I was confused about how these modified rules work. Still seems strange to try to apply these rules in a situation with no unconditionally alive groups.

thomazjunior2001 · November 15, 2022, 6:08pm

I think they should have the adversarial net play against katago with equal hardware time instead of arbitrary low fixed visits. Right now they are giving the adversarial net 600 playouts while katago only gets a single one. Katago’s policy isn’t even how katago selects moves in the first place, that’s normally the job of the search and value head. I don’t care how strong the policy is even supposed to be by itself (they claim top 1000 professional level), equal time with equal hardware is the only fair way to compare two different nets.

nadoss · November 15, 2022, 10:11pm

It’s frankly surprising that it got as much attention as it did, considering it is mind-numbingly boring from both the go and academic perspective.

sir_husky_potato · November 15, 2022, 10:46pm

I mean, it has a great combination of topics – the game of Go, fancy deep learning techniques (adversarial networks), authors from MIT. If we don’t go under the surface, it looks interesting.

PJTraill · November 24, 2022, 11:09pm

It is also unclear which stones they remove, as “within” is not defined in the rules. If it is defined with respect to the edges of the board, there can be a pass-alive group within another pass-alive group, but it would be absurd to remove the contained group.

(Actually when I first came across Go in an encyclopedia I thought that was how it worked and played several games on that basis with fellow-pupils. To be alive was to be connected to the edge of the board)

sir_husky_potato · November 30, 2022, 12:51pm

A new version of the article in question is out!

Link: Adversarial Policies in Go - Game Viewer

First updated version:
We hardcode a defense for KataGo by making the victim not pass until it has no more legal moves outside its territory. With more training, we are able to find another attack against the victim, achieving a win rate of 99.8%. The adversary gets the victim to form a circular structure and then kills it.

Second updated version:
With 2048 visits, KataGo’s Latest network plays at a superhuman level, but our adversary still achieves a 72.6% win rate.

Allerleirauh · November 30, 2022, 1:17pm

Looks pretty damn weird, huh.

square_fuseki · November 30, 2022, 1:26pm

someone, analyze these new games with OGS Kata

yebellz · November 30, 2022, 1:28pm

Can you help upload them? I can run the level IV analysis, but I’m on my phone right now, so annoying to manage the SGF files.

Allerleirauh · November 30, 2022, 1:29pm

Works on my home kata. It think white is winning and can play whereever even though once black retakes ko white is dead. Do I see that correctly?

square_fuseki · November 30, 2022, 1:32pm

2048 visits Kata vs Adversary, 6 games where Kata lost
https://online-go.com/library/314419/8998

yebellz · November 30, 2022, 1:40pm

Thanks for uploading the games. Analysis started and most are still churning through, but it looks like our analysis engine is also similarly affected by these attacks. The score estimation graphs have some stunning jumps, when the engine finally realizes (much too late) that the game is not what it seems. Or maybe it is just showing a stunning blunder on the part of the victim in some cases. Need to check the analysis more carefully to understand.

Allerleirauh · November 30, 2022, 1:46pm

This does look like a satisfying adversarial attack. Anyone wanna try it against katago themselves? I wonder if the arrangement is very specific or a human can learn it.

OGS uses different network so obviously it’s not gonna match perfectly. But you can find the positions where it’s mistaken too.

Like here it appears to say the best move is bottom-right but once black retakes ko white can’t do anything.

Same here.

Seems this ko on the inside that’s too big is the thing that does it, huh.

yebellz · November 30, 2022, 2:21pm

This game (#4 in the series above) is quite weird:

The adversary manages to create two chances to capture black’s dragon. On the first chance, the adversary blunders by playing elsewhere, but then a few moves later gets another chance to capture, which it finally takes.

On the second chance, here’s the OGS Kata analysis of the position, which does not seem to make much sense:

The AI analysis does recognize that the played move Q6 is a massive blunder, since it does not resolve the massive ko situation in the top-left to save the dragon. However, it thinks that R8 is a reasonable move, even though (I think) that should also be considered a massive blunder for letting the dragon die in ko. I guess this disparity might be explained by the analysis engine not focusing enough playouts on variations rather than the path of the game? It does not even seem to suggest any reasonable moves, like A14, to save the dragon.

thomazjunior2001 · November 30, 2022, 3:29pm

Would be interesting to see what would now happen if they froze the adversary and allowed katago to learn how to counter the strategy. Seems like the strategy still revolves around giving katago a massive lead, making its value head almost useless since all moves have the exact same winrate (100%). When that happens, katago is unable to differentiate between moves and it considers a ton of candidates at once, each only getting a couple dozen playouts at best. Wonder if adjusting komi internally after every move to make the game seem even would help

Jon_Ko · November 30, 2022, 3:39pm

Very nice, it seems the adversary is always building multiple locally dead groups inside a huge territory and then KataGo neglects the capturing race. I wonder if it would pass if it were allowed to (but I suspect it wouldn’t).

And it’s a nice example of how science works. Their previous work was flawed in some ways (few visits, ruleset discussions) and they took it to their heart and came up with something better (as far as I can tell).

hexahedron · November 30, 2022, 3:42pm

Yep, this is much more interesting!

The issue is not one of capturing races at all, or ko, or anything else like that. It’s specifically cyclic-topology groups.

It’s a known issue that is common to all AlphaZero-trained bots, as far as I’m aware - I’m pretty sure convnets consistently learn the incorrect algorithm for determining whether a group is alive, one that only works on tree-topology groups and not groups that have cycles. As of a week or two ago, I’ve already been talking with the paper authors about it.

See here for the earliest prior discovery of this systematic weakness I’m aware of, it has also been reported in other places several times since then, and I’ve tested enough AZ reproductions back when it was originally discovered to think that it probably applies to all AZ agents that use convolution-based nets.

github.com/lightvector/KataGo

A little flaw of KataGo in a particular shape

opened 03:59PM - 25 Jun 20 UTC

jdk2000

> ![K O}PH8GCZY%F 8CV6`V@54](https://user-images.githubusercontent.com/2788560…9/85746866-5ed11100-b739-11ea-9f22-def6a4f1a12d.png) It's a game the white owes the black 60 points, despite that, the white can win the game easily because the black at the bottom right corner is already dead without using any means. But it seems that Katago thinks the white needs to work hard to kill the black, even takes the risk of black making a live in the upper left corner, and finally the white will lose the game. it doesn't make sense from Go technique. This shape may be seen rarely, but sometimes a very small flaw may lead to a big bug, like a saying in Chinese: "One ant-hole may cause the collapse of a thousand-li dike.", so I hope that it can be fixed in the future version. Of course, if it's not easy to work out, forget it. After all, it's just a rare shape. Finally, thank you very much for your great contribute for open source Go AI, what helps us go further to see the truths of Go.

Anyways, although the weakness is not new, this is now the first time we have a way to systematically generate examples, so I think the result is very interesting and cool. And we now have a way to generate enough natural examples to try to see whether a net can be trained to understand cyclic-topology groups, which we couldn’t do before. I’ll be trying that at some point in the next months.