Making my own simple bot: Euroton

JohnnieDarko · October 8, 2025, 5:58pm

Awesome! Looking forward to seeing that, and eventually having our bots fight it out. Let me know if I can be of any help.

JohnnieDarko · October 14, 2025, 11:27am

I added a couple of big things:

Inluence map, semi-transparent stones and territory estimate
Every stone now emits influence, tapering off with distance. The bot emits positive influence, the opponent emits negative. At every point on the board, the sum is taken, leading to a positive or negative value. This is usable as a rough territory estimate, although it doesn’t take into account wether stones are dead.

Stones “hidden” behind others also emit some influence, though only a fraction of what they’d had with direct line of sight. This is called semi-transparent stones.

Example: There’s empty point A1 and a stone on C2. The stone is 3 steps away from A1 so it emits strong influence on A1. If the shortest path gets completely blocked by two stones on B1 and B2, then it’s influence over A1 will be less. However, if a is only on either B1 or B2, then the influence of the C2 stone on A1 remains unchanged, as the C2 stone can still find an unblocked shortest patch to A1.

image1484×1461 103 KB
```
[ownership grid with solid stones] (+Black -White)
-0.69 ##### ##### +0.37 
-0.71 ##### -0.35 -0.22 
```
```
[ownership grid with 50% transparent stones] (+Black -White)
-0.64 ##### ##### +0.24 
-0.69 ##### -0.35 -0.22 
```
Influence delta (the big moves)
Building on the previous step, the total board sum is taken over the influence values per position. Basically one number who has the most influence / rough territory. Then on each empty position a move is simulated, and this inluence sum is recalculated. The move that changes the influence the most (the largest delta), recieves the highest score.

Stones in open or unsettled positions change the influence more than in strongly settled ones. So this step identifies the positions that are most effective, or that’s the idea.

The downside is that it can take unfortunately up to 2 minutes on 19x19 on a regular laptop per move. To speed things up this calculation is limited to a certain radius around the virtual stone (now set to 6), which brings it down to an acceptable couple of seconds. This does trade in some precision. With full vision, Euroton (black) will play C6 here, but with the smaller range it finds F3 just as good.

The transparency of stones have an effect in the bottom right below. The bot likes S3, because it increases liberties the most, and it also flips T1 strongly in favor of “black territory”. The rest of the board is blocked off by the white wall, so it doesn’t care about that. However…

… setting all stones to 50% transparent, the black stones in the corner now gain influence over the rest of the board, so there’s no reason to shy away from a wall. It probably would’ve done this move as well without the influence_delta at all, but this way the influence delta is not getting in the way of other good moves.

It does require tweaking to see how this slots in with the existing rules.

Btw, new iron pillar fuseki dropped.

Further added is a very strong “don’t needlessly connect a diagonal connection”-exception to the Connectivity rule. Many moves were wasted on this, but now it’ll only connect if the diagonal is under risk of being cut from an opponents stone (ie there’s a stone in the diagonal).

Now I’m coding a defense against monkey jumps / slides. GnuGo does relentless monkey slides, and the influence delta rule doesn’t identify it as a huge enough risk to defend it. Without hardcoding a block against monkey jumps it’s hard to keep any form of territory or even have a representative demo game.

richyfourtytwo · October 14, 2025, 1:42pm

My current best bot is named Hubert. Hubert knows how to capture and has some knowlegde about good/bad patterns. He has no idea how the game is counted and passes randomly unless there is an attractive move (as per his pattern preferences) left. (I’m using Tromp-Taylor rules.) In other words: Hubert is still incredibly stupid, yet about 2350 Elo points stronger than Rob, who plays truly randomly.

Edit: let me know if posting this in your thread annoys you!

JohnnieDarko · October 14, 2025, 1:45pm

I enjoy you posting here!

How did you make it recognise patterns?

richyfourtytwo · October 14, 2025, 2:24pm

I (vibe-) coded a pattern ‘engine’, so I can define patterns in small txt files. Here is the one for empty triangles:

MM
Xo

X represents the newly played stone, M stands for My stone and o for not opponents stone. I have a few other categories like empty/non empty/board edge.

My ‘players’ (or bots) are ini files in which I can assign float values to patterns. So empty triangle would get a small negative value.

I don’t have much besides these patterns yet. The most important non-pattern feature is detecting if a move captures. But this too gives a (configurable) value per captured stone.

I’ll try with sticking to pure move evals for as long as posssible, as this means I don’t have to copy board states and execute moves. But I’m pretty sure that eventually I’ll have to give up on that.

Here’s an example game of Hubert against Julia, who is about 400 Elo weaker. She’s lacking a few patterns and more often plays a move other than the top candidate (higher randomness): hubert vs. julia

JohnnieDarko · October 14, 2025, 2:57pm

That’s awesome, that looks like a lot of fun to make. I think you can get quite far with not executing moves, although you’ll have the same “if it looks like a ladder then just avoid it entirely” until you have enough patterns to cover your bases. How do you measure elo?

So added the Monkey Jump Defense rule, and it works perfectly, but only in the case with adjacent stones, where the opponent has a stone on the 2nd line diagonally under my 3rd line stone. GnuGo isn’t that easily defeated…

Game 33 - Euroton (B) vs. GnuGo lvl 1 (W)

This corner looks great, the sides are blocked off, finally some points for Euroton.

Oh no..

Game 33b - GnuGo lvl 1 (B) vs Euroton (W) - B+200pt

A great territory on the side with no opponent stones nearby…. gone in 3 stones.

>>

So I do still have to make something, preferably something that uses territory information that we have in an effective way.

richyfourtytwo · October 14, 2025, 3:10pm

I have 24 bots by now and they played lots of games against each others. Average ELO distance to neighbors about 100.

JohnnieDarko · October 23, 2025, 1:01pm

I added a debug grid with pretty gradients to see what’s going on in the bot. The top move gets a score of +100 (cyan), and then the rest is scaled from there.

Bot (Black) picks E3, with D4 as #2 move

Bot (white) picks C4

Bot (white) picks F3

The bot has the tendency to make freestanding iron pillars (two adjacent friendly stones), especially when it can “headbutt” into opponent stones, see white below.

This is because this move double-dips in rules; it’s both extending liberties of a friendly group, and reducing opponent liberties.

added: a simple multiplier that reduces the reward of extending liberties of stones or groups that are free-standing, not touching an opponent

>>

As a result, it again plays a little better than before, getting close to losing by “only” 100pts.

Game 40 - GnuGo (B) vs Euroton (W) - result: B+155pt

JohnnieDarko · October 23, 2025, 2:00pm

I started testing the bot on Beta OGS. The bot renamed to Oroton for easier pronunciation. Feel free to do a game against it.

Note: It doesn’t know when to pass or resign, (it will never stop playing until it runs out of legal moves), but it passes when you do.

I am out of the house for the next few hours, so if it crashes, that’s unfortunately it. But so far it seems to be able to play multiple games at once. Enjoy

richyfourtytwo · October 26, 2025, 3:26pm

First battle of the bots:

That was Soraya, one of my stronger bots atm. (Yeah, I know, hard to believe!)

Why did she not play C4 instead of passing? Because she’s far too strong for that and knows to avoid empty triangles!!!

Feijoa · October 26, 2025, 4:39pm

If you want to add that the easy way, gtp2ogs can delegate pass and resign to another bot like Gnugo. You can see my config for random-move-nixbot here for example.

ending_bot: {
     command: ["${gnugo}/bin/gnugo", "--mode", "gtp"],
     moves_to_allow_before_checking_ratio: 0.2,
     allowed_resigns: 3,
     send_chats: false
},

richyfourtytwo · October 26, 2025, 4:51pm

Ha, even my random player called Bob knows how to pass.

Guess how he decides!

hoctaph · October 26, 2025, 5:39pm

What is allowed_resigns ?
(I thought a player couldn’t resign more than once per game. )

JohnnieDarko · October 26, 2025, 6:40pm

added: territory edge detector to defend our own territory.
A candidate move is now determined to be the edge of our potential territory if it satisfies all conditions:
- The position is relatively neutral, not fully owned by either player.
- The position is at the edge of our territory. Positions in a radius around the move are checked, and the sum of our ownership values is taken. This makes it weigh bigger territories as more important.
- Has an adjacent opponent stone, or an open path to strong opponent territory (max distance 3).

Example:

Black has a strong influence / potential territory on the upper half (green, and yellow), and white has it on the lower half (red):

The territory edge detector finds the cells that allow a path into it, as well as the open paths on the side:

On 19x19 it identifies O9 as a hole in the lower right black territory.

There are still many situations where it doesn´t detect well so it needs more work.

Awesome, I was going to ask you to give it a try. Well it just goes to show that there are so many things we take for granted when we play the game that just ‘make sense’.

Brilliant, thanks for sharing! I saw the option in gtp2ogs indeed.

Feijoa · October 26, 2025, 7:29pm

IIRC it means your bot will wait until Gnugo has resigned 3 times before actually resigning.

I think that could be messing it up sometimes, like when after the first resign it can’t find a legal move.

JohnnieDarko · November 3, 2025, 2:26am

After many updates, the bot is highly improved in its defensive abilities. Subjectively I think it plays more human-like and blunders less. It is also stronger against GnuGo. On 9x9, the “old” Oct 23 bot (currently online) needed four handicap stones to win; the new version wins with three. Given GnuGo’s estimated 6–7 kyu strength and the handicap scaling of 5-6 ranks per stone on 9x9, it lands around 21-25 kyu. This is just a very rough estimate, take it with a spoonful of salt.

However, much to my chagrin, the Oct 23 version (which is currently online), beats the new version convincingly on 19x19, as white and as black… The new defensive abilities make it play more safe, but also too slow.

Example: New (white) vs old (black). White spends too many stones making territory in the bottom left that by the time it want to move to the center it’s already occupied by black. It also seems like white makes way too many eyes, but those were in fact all captures. Captures that didn’t need to be done, but the bot doesn’t know that yet.

>

I’m very annoyed that the new bot which can do everything the old bot can do and more, somehow loses. But realistically, it is simply a matter of tuning/balancing the new functions with the existing ones.

To improve sealing off its own territory, I rewrote most of how influence / potential territory was calculated and saved. That is mostly about fixing edge cases, such as:

A big improvement is that the Eye-maker rule now understands connected-eyes (probably not the right term). It wouldn’t play A18 below, rather prefering to connect at B18. This is because the old Eye-maker didn’t judge an empty diagonal around an eye as a safe location. But if that empty diagonal is another eye, it should be seen as a safe connection, and now it is.

As you can hopefully tell from grid below, it neatly identifies where to play (white) to create two eyes (the cyan +100 locations). But, it is a overly cautious in the bottom left corner, and in fact it lucks out there. It doesn’t like B3 and C3 because it splits eye space in two, which is what you’d expect. No, the algorithm sees the eye at B1 as an existing eye, and because that eye already exists, the creation of a single eye at B3 or C3 triggers the “this makes two eyes” rule for the group at A1. I can predict that this needs more work in the future..

Some 1st line patterns are hardcoded. In the situation below, there wasn’t really a strong incentive for black to connect on B15 or B11, so that pattern is now codified and always recognised and given a high value. I don’t want to make a pattern matching library, but in this case it’s either pattern matching, or playing out multiple moves to see that it is bad for territory, so I chose the simpler approach.

>>

Also added is a rule to prevent the opponent from connecting underneath on 1st or 2nd line.
In this case, the bot (black) will cut white stones to prevent them from sneaking into black territory. It triggers if there are opponent stones on both sides, and 1st or 2nd line is treated as an OR statement.

The downside of this pattern matching approach is that these moves are now also played if there is no territory to defend. This adds to the bot having become slower. But that’s preferable over a bot that has easy exploits.

Next up is to try to tune all the new stuff (I left out more than half) to work together.

anoek · November 11, 2025, 2:53pm

This is really cool @JohnnieDarko, I played Oroton on beta and it felt very natural and, importantly, weak which is great. I encourage you to take snapshots or versionify what you’re doing throughout the process, having a variety of different feeling weak and weakish bots would be a huge boon to the community.

JohnnieDarko · November 12, 2025, 1:30pm

@anoek Thanks for the positive feedback, glad you like it!

I do have snapshots of each update of the bot. They run as separate bots already, and with the way the rules are tuned, it’s also relatively easy to remove or emphasize certain behaviors. My idea was to have multiple versions of the bot running on OGS as well, so it’s great to hear that’s something you’d like as well.

Update version 25-11-11 (Now on OGS beta: Oroton ).

Added: Bot Arena and Weight Annealer

I like this idea to speed things up. I added a Bot Arena that pits two versions of the bot, A (baseline) and B (candidate), against each other using separate weight configs. Then the bots play 8 games against each other, each 4 times as black.

Each of the four games starts with one of the four best 9x9 openings (E5, D5, C5, D6) (as per @mark5000 ‘s new book), then proceeds normally. When finished, the Arena reports the winners, point margins, and captures, and saves each game as an SGF file.

Optionally the Annealer does a small random adjustment to bot B’s weights. Then they play another 8 games. If the winrate improves, these weights are the new candidate. If the winrate goes down, these weights are rejected. Then the anealler does a small adjustment to the weights, another 8 games are played, and so on. The idea is that eventually optimal weights for bot B are found (8 out of 8 wins).

However..
During many runs, it turns out it’s not too difficult, given enough run, to find weights that win all games. Yet those differ only very slightly from weights that lose all games. This is initially strange, but looking at the games themselves, they all show some strange moves. Most 9x9 games hinge on one capture, so it’s the random move here and there that then determines the outcome.

A little self-serving perhaps, but my manually tuned weights feel more natural and predicatable, using the rules as intended. So annealing isn’t yet very useful, but the Arena is great for spotting major blunders and coding errors.

Liberty Attack rule has become smarter.

Don’t attack weakly.
Added: Light penalty (tunable) on placing stones with only two liberties, to dissuade moves as:

image553×558 12.5 KB
Attack big groups with few liberties.

The rule now takes the opponent group size into account. It already took liberty-count in effect, so now attaching to a large enemy group with very few liberties would get the biggest bonus, and a small group with many liberties gets the least.

Also added into account is the size of adjacent groups. If a opponent group is one liberty away from another opponent group, both groups are taken into account for the size parameter. Don’t know if this works out yet.
Don’t waste moves on doomed groups
It isn’t always necessary to capture. If an opponent is in atari, but even if it adds another stone it’s still in atari, then there’s no need to capture it now.
*Bonus for Captures that simultaneously rescue a friendly group. *
In the example below, white has a group in atari (A4), but since it can’t find an escape, it gives up and plays B6. Pretty silly. With the new rule, it sees E4 not as just a capture, but a capture+rescue.

image526×679 42.9 KB
>>
image499×665 40.3 KB

The Connectivity rule also got an attacking upgrade:

Split opponent groups.
The Liberty Attack rule already rewards cuts that touch, this is about breaking connections in open areas. It detects and rewards moves that cut short paths (up to 3 intersections) between enemy groups. The more connections it breaks, the higher the score. It does overvalues ‘cutting’ moves in the middle of good shape, like P5, so it will need some tuning. At the same time, those are fun moves.

image1510×1495 113 KB

Added: Forced Sequence playout (proper ladder checker)

I discovered the ladder checker wasn’t working, because in this game black doesn’t save its stone (H8 and B8) twice because it thinks its an inescapable ladder. Turns out I never made the ladder checker I thought I had, it only looked one move ahead.

The new version simulates forced atari sequences step by step. If any possible escape still ends in capture, the move is rejected; (as the opponent can force you to take this losing path). If all paths lead to safety (3 liberties or a counter-atari), it’s allowed.

This pushes the limit of what I expect from a beginner bot. But it’s still a simple deterministic readout for a specific case, not a Monte Carlo simulation, and important for teaching good habits

added: Vital Points (pattern matching)

I’ve avoided large-scale pattern matching because it’s more memorization than understanding, but added a few core shapes like straight-three, bent-three, square-four, and bulky-five. Example: bulky five, even with a friendly stone already present, finds the vital point on A2.

This new version (25-11-11) of the bot is now online at OGS beta: Oroton

The previous version has run stably for two weeks. The next step is fine-tuning and letting it play in the real world.

hoctaph · November 12, 2025, 3:37pm

“During many runs, it turns out … weights that that win all games.
Yet those differ only very slightly from weights that lose all games.”

Does this still happen if you randomize the opening more? For example,
the first some number moves are chosen
uniformly at random from among legal board plays.

(I would start with 9, to match the 9x9 board. )

Myrsilos · November 12, 2025, 8:37pm

Very interesting! Seems to imply that how the bot plays (at least as a human would understand it) isn’t “continuous” as a function of the weights. Otherwise very slight changes in the weights wouldn’t result in a switch from 8-of-8 win to 8-of-8 loss. Is it possible that it’s results from an interaction effect between the weights? Or maybe specific weakness of particular rival candidate weightings?