Hi, this sounds like what I’m doing for my Master thesis!
You might be interested in this dataset, if you do not have it already:
The ranks in these SGFs are inaccurate, unfortunately. So one option is to use the official open-source implementation of OGS’ rating system here and run it on your dataset.