Ask questions over Sensei's Library ChatGPT-style

Counting_Zenist · May 16, 2023, 11:22pm

Actually, convolution is exactly the same pattern search, just trade memory space to process than sequentially. They are exactly the same mathematically, only that CNN filters are trained, instead of easily designed with patterns that can be understood. And in fact I didn’t even use these in the model, it was used for generating questionnaires out of the game records of a player, and find patterns of the highest appearing frequency to ask (or the questionnaires will be way too huge to finish, like thousands of combinations within local patterns), they only need to answer what they’ve played in the past often, and discard those not of their styles. By default, the trained model will be neutral for uncertain patterns. And one of the perceptions I did try was how much influence or territorial a move would be, but in isolation, it is not an easy concept to train, since human perceptions varied a lot and heavily depend on context. The height of the move is easy to detect, but not their combinations. Sometimes a lot of high positions are just a bunch of disconnected stones, not a group, and what a kyu player of different level think of influences are very “questionable” at best. And dan level players are very picky as to context, and they often don’t agree with each other. The question is what constitute a “moyo”, a few stone and a large jump constitute a territory? The early fuseki are easier, but that is just a few moves which can be easily split and some don’t even think they are territorial at all, but just “normal move”.

The fact is we can actually use the territory estimation belonging as a guide to where might be territory wise, but the more complex joseki, and life and death would often mess it up really good. What high level concept we think is concrete is actually pretty fuzzy, if you want to build training corpus for them in practice (you need to make way more training data of all kinds for supervised learning to pick up the variations and separate “noises” from nearby stones, they are not that easily separated inside a CNN). And reinforcement learning or using PPO would need some heavy human-in-the-loop to check the quality of the generated model output to be stable or not. (from the very preliminary test I’ve done, without a very sensitive discriminator model for PPO, the model would only work fine with what it seen before, and not generalize well, and hallucinate a lot for what is their “territory move”, since they don’t actually differentiated “side territory” and “territory in the air”, and often just apply the patterns on the side into the air).

Counting_Zenist · May 16, 2023, 11:51pm

The meta-content slip into the generation by adding keywords into them (black, white, move, liberty, 3-4, 4-4), and trick it to think it is about playing go game, even if the prompt specifically state never.

Replace the keywords into not Go related, some still slip it, but not as bad.

But sometimes, some go terminologies like “just one liberty remaining” still slip in, which someone who never play Go would even understand the meaning.

flat_chocolate_mount · May 20, 2023, 9:21am

The app is suspended for now. People are misusing the service, and the costs are coming ou of my pocket.

Once I have some more time, I’ll try to see if I can limit the context of what a user can talk about through some clever prompt engineering.

square.defender · May 20, 2023, 9:32am

even OpenAI unable to do it

ArsenLapin1 · May 23, 2023, 1:22pm

It was a fun experiment. Thank you for having made this and allowed us to try it.

flat_chocolate_mount · May 23, 2023, 5:04pm

You’re welcome, and thanks for participating!

Stay tuned: as soon as I get some time, I’m going to switch it back on again with some modifications.

Counting_Zenist · May 23, 2023, 8:27pm

Maybe you can also try other open source LLM (like task fine-tuned vicuna), they should be getting stronger and stronger giving some time (if GPT-4 only provides slight improvement, open source models probably aren’t far off).

Groin · June 1, 2023, 7:55am

How long does it take for an answer?

flat_chocolate_mount · June 1, 2023, 9:23am

It appears to be broken, trying to fix it.

flat_chocolate_mount · June 1, 2023, 11:33am

Link: https://sensei-library-q-and-a.replit.app/

It should go off-topic a bit less, but it could be interesting to try to make it go off-topic and share the results here (for example, I know you can get it to go off-topic by starting off with a go-related question and then asking about another topic in the same question, e.g. “what is sente and what’s the capital of the Netherlands?”).

square.defender · June 1, 2023, 12:20pm

Groin · June 1, 2023, 12:46pm

Wrong (missed the captures management).
useless AI

jlt · June 1, 2023, 12:51pm

images

Groin · June 1, 2023, 2:13pm

Answered with “markers”

square.defender · June 1, 2023, 8:38pm

…

flat_chocolate_mount · June 3, 2023, 9:12am

Questions and responses can now be viewed at Log.

Gia · June 3, 2023, 3:23pm

“Rabbity Six”

Does it count as cheating that I now know an answer to @antonTobi’s Status game that I didn’t know before, just by reading this thread?

__

“Goquilibrium” is cool.

I want to play this now!

Feijoa · June 3, 2023, 4:23pm

So, what’s the rule for the Greek quadrant?

This part also needs some work first:

Gia · June 3, 2023, 4:24pm

You must kill the stone twice, they are too headstrong to die easily.

flat_chocolate_mount · June 3, 2023, 8:31pm

App back at https://sensei-library-q-and-a.replit.app (this is a “production deployment”, so should be more stable).