I’ve been noticing this in more and more of my 9x9 games specifically (although the issue might also exist in 19x19). Here you will see an in-game estimator showing that black is up by 17.5.
Basically the old estimator (in game) could be ok in some situations, but if it’s dealing with anything complicated or uncertain, it’ll depend on how many playouts it does, and possibly even which playouts.
The post game one is Katago so it knows in some extent quite a lot of variations that could happen and will likely tell the status of stones that potentially even the players don’t know. It was deemed (probably correctly) to be too good to be allowed in game.
The score estimator is deliberately weak, as explained in older threads. It is an estimator. If you click it several times consecutively, you will get different results each time, at least late in the middle game of a 19x19 game.
I don’t think he is referring to the autoscore, which scores the game. He is talking about using the SE after the game is over (i.e., going back some moves). His second diagram is not the autoscore, because it shows territory scored where the boundary is not closed. The autoscore does not do this.
I know, but the estimator post game, in demo boards, and for spectators (not from the same IP the linked post says) of an ongoing game, is supposed to be katago from what I’m aware.
See the tiny squares in the second image as compared to the first? The katago estimator does that to reflect how certain it is on who controls which point, the old estimator doesn’t do that
So then, Katago is in two different configurations? One for the autoscore, where if follows the rule about boundaries, and another for the post-game SE, where it ignores the rule about boundaries? I didn’t know that.
My vague impression is that autoscore asks Katago to estimate the final ownership of each intersection and simply sums that up.
The strong score estimator seems to just ask Katago to estimate the score, which could of course be very different from autoscore and the weak in-game estimator.
In no case is any “rule about boundaries” followed as far as I know.