A couple of questions about handicap and rating

With komi 0.0 and no compensation for handicap stones, yes, by definition in my opinion

Finally lost provisional, and am 4k, so one rank below my Main Account. It remains to be seen if I can climb higher

So back to the original question:

If someone has significantly different ratings for live and correspondence, why are handicaps being determined by the overall rating, and not by the appropriate rating for the time control? It looks like we have a system thatā€™s designed to avoid the need for multiple accounts, but isnā€™t actually doing its job. And the various chess sites have demonstrated that it can work very well this way.

4 Likes

I forget which threads, but I remember reading that this is because the
overall ratings are more accurate, due to accounting for more games.

2 Likes

Yep, thatā€™s the party line, but even granting thatā€™s true, I maintain that there are compelling reasons to compromise on that in order to provide a better overall experience

1 Like

Um, what does ā€œmore accurateā€ mean in this context? Is there data to prove that people donā€™t actually play at different strengths in different time limits, and that differences in the ranks are some kind of illusion? Iā€™m sceptical, but willing to be convincedā€¦ In any case, if creating multiple accounts is the recommended option, then it doesnā€™t look like the system is working.

It means the overall rating was better at predicting the outcome of games.

2 Likes

OK, thatā€™s surprising and interesting! Can anyone easily dig out a link to the threads where this was discussed?

Allegedly. They neither published the data nor did they make it available via the API. So there is no way for me to verify this. This topic has come up before.

1 Like

Ratings information exists via the public API, otherwise it could not be displayed on the profile page. Itā€™s possible to see what APIs are being called if you go to Browser Developer Tools > Network tab.

I can share a route once I get to a computer, but I figured ā€œteaching to fishā€ might be more useful anyway.

3 Likes

Youā€™re right in theory the data has to come from somewhere. But I did exactly that a year ago and there was some issue.

But Iā€™m looking forward to be taught how to fish. Maybe you can figure it out!

1 Like

Okay, just checked. There are a few XHRs from that page, but I think the one youā€™re looking for is /termination-api/player/{PLAYER_ID}

Screenshot 2024-05-01 at 10.21.47 AM

The ratings field has the full breakdown from the ratings table:

"ratings": {
    "9x9": {
      "rating": 1388.705030253521,
      "deviation": 68.04753746552261,
      "volatility": 0.060000229890117066
    },
    "live": {...},
    "13x13": {...},
    "19x19": {...},
    "overall": {...},
    "version": 5,
    "live-13x13": {...},
    "live-19x19": {...},
    "correspondence": {...},
    "correspondence-9x9": {...},
    "correspondence-13x13": {...},
    "correspondence-19x19": {...}
  },
1 Like

I donā€™t really understand the scepticism.

I feel like anoek has been fairly open to discussing any aspect of the data analysis

and previously

and you can even download a big batch of games and the rating system code

You can do your own experiments with that.

But specifically

Etc.

5 Likes

Ah, now I remember the issue I had a year ago when I did this. Look what I need for this analysis is all the game records of the players plus the different rating information at the time of playing. Through the API Iā€™m able to get the game records and Iā€™m able to get the current rating information, but I was not able to retrieve all of the playerā€™s category rating information at the time of playing. That is the route Iā€™m looking for.

Sceptic by nature, nothing personal.

I havenā€™t read through them again but iirc the first two links you posted only go into the result of the analysis and do not provide the source data. The goratings-repo is new to me, maybe thatā€™s exactly what I was looking for. Thanks for the link. I will check it out when Iā€™ve got time.

3 Likes

About this data, Iā€™m not very convinced.

  1. The total count of games doesnā€™t match: the tables for overall rating have more games in all handicaps. As far as I understand, the same games should be used for predicting with overall rating vs rating for different sizes/speeds.
  2. Even if this data is correct and the overall rating works well for a majority of players, that doesnā€™t rule out that there is a problem with an specific type of players which is significantly stronger in some specific size/speed. ĀæDoes this type of player exist in reality or not? Thatā€™s what we should investigate. If it exists, maybe the solution would be to let the player decide if the system should use their overall or specific rating.
3 Likes

if someone always play 9x9, reached high level and never played 19x19, of course they would be some ranks weaker on 19x19 for a while. Its rare but possible.

1 Like

current system may be better for most players, but of course it inevitably would cause problems to those rare cases

1 Like

100% separate ranks may be better for those rare cases, but it may cause problems to most players

1 Like

This ^

Iā€™ve seen a few players (including myself) with one or more outlier. But often it comes down to ā€œthis player hasnā€™t played this setting very much recentlyā€. Iā€™d be interested to know to what extent this specialized players exist.

Still, I think anoekā€™s finding holds pretty well overall. I havenā€™t seen strong counterexamples.

1 Like