or the model could include time windows of histories of pairings. (distinguishing IDs from rating, to define how much of the maximal pool has actually been effectively sampled... many games of same rating but different individuals, or many games with same individuals.).
also, what if we want to play the same person for many games, and ratings be damned....
should rating only serve to put eveyone on some ladder? or just be about expected game difficulty given entrant ratings..I guess that depends whether chess is only happening in tournaments or has a background murmur of activity and single games across the pool do happen without tournament wrapping, or within an effective pool, subset of the maximal pool.
this brings back to what do people want the rating to help with.. then mitigate that with what systematic measure systems at the population scale can do.. not what we might abuse them to be.
the idea of true rating or true strength might be wanting too much. and is not defined in any rating system anyway.
I am not my rating.. never assumed that.. and i only want my online rating to give other players that don,t know me some sense of difficulty of my play if they were to accept playing with me... i would not want to be either too much underrated or too much overrated. but i care mostly about having a good game... so I do need ratings... But I am not talking about tournaments. We have seen that they have own constraints (some essential to their physical possibility).
Actually the idea of looking at history time window for relative pool "isotropic" sampling was a bit coarse. For the glicko** rating system convergence results (pool size?, or game events size? not sure) to hold, I am not sure that the pairings of each individuals at some current rating (or during a time window, also not sure) need to sample all possible ratings, perhaps there is only the need, given a fuzzyness or confidence level at population ordering level (ladder focus), that there be enough connectivity between strates (i said given a confidence level, as i would bet it would affect the size of the bands of ratings around any rating that need to be connected to other bands.
I think in tournament (from afar and outside), the tiering obligation would also be dependent on that, although I think the purpose of the tournament is to clarify from those band approximations of ordering, through a limited entry pool and duration of whole event a clearer ordering... There might be population size and time windows criticality in any context... anyway.
** (and or that kind of rating system based on individual estimate assumptions, not whole population)
or the model could include time windows of histories of pairings. (distinguishing IDs from rating, to define how much of the maximal pool has actually been effectively sampled... many games of same rating but different individuals, or many games with same individuals.).
also, what if we want to play the same person for many games, and ratings be damned....
should rating only serve to put eveyone on some ladder? or just be about expected game difficulty given entrant ratings..I guess that depends whether chess is only happening in tournaments or has a background murmur of activity and single games across the pool do happen without tournament wrapping, or within an effective pool, subset of the maximal pool.
this brings back to what do people want the rating to help with.. then mitigate that with what systematic measure systems at the population scale can do.. not what we might abuse them to be.
the idea of true rating or true strength might be wanting too much. and is not defined in any rating system anyway.
I am not my rating.. never assumed that.. and i only want my online rating to give other players that don,t know me some sense of difficulty of my play if they were to accept playing with me... i would not want to be either too much underrated or too much overrated. but i care mostly about having a good game... so I do need ratings... But I am not talking about tournaments. We have seen that they have own constraints (some essential to their physical possibility).
Actually the idea of looking at history time window for relative pool "isotropic" sampling was a bit coarse. For the glicko** rating system convergence results (pool size?, or game events size? not sure) to hold, I am not sure that the pairings of each individuals at some current rating (or during a time window, also not sure) need to sample all possible ratings, perhaps there is only the need, given a fuzzyness or confidence level at population ordering level (ladder focus), that there be enough connectivity between strates (i said given a confidence level, as i would bet it would affect the size of the bands of ratings around any rating that need to be connected to other bands.
I think in tournament (from afar and outside), the tiering obligation would also be dependent on that, although I think the purpose of the tournament is to clarify from those band approximations of ordering, through a limited entry pool and duration of whole event a clearer ordering... There might be population size and time windows criticality in any context... anyway.
** (and or that kind of rating system based on individual estimate assumptions, not whole population)