@Toscani said in #5:
Thanks for the interesting link.
storage.googleapis.com/uncertainty-over-space/alphachess/index.html
I should have looked that before jumping on the paper and its appendices....
That link states that they did feature extraction first (which i need to read more closely), and then looked for correspondance with known concepts (using SF somewhere from version 8).
I find interesting the historical human population evolution of opening preferences (not sure how that is quantified or defined, yet), and that of A0 self-play epochs. I mean the question is worth wondering, although I am not sure that A0 self-play is to be expect to follow population of learning trajectories that might be behind the human population statistics. or i am not yet getting the meaning of those graphs..
here is my guess (not having read paper for equivalent figures yet). Epoch might be about self-play batches, where the previous pair of self instances, with one not learning and the other learning are changed for a new learning epoch of self play starting with the same learned previous instance on each side.. That is for the horizontal axis.
For humans (lower curves), we have human years. from 1978 to 2018.
Hovering over one figure through the various connected curve we see that the 2 figures are connected for their actual move sequence (on the right some display of which sequence with a color code).
The color code is redundant hint about which band is which initial sequence of moves.. so the prevalence would be the area between bounding curves. same color same sequence..
Q: is A0 epoch during some number of self play independent runs or just one run and the many games played. That last possible misunderstanding of mine may be was is itching me in that comparison. The human games being played by many pairs in a population of possibly high level players..
So not sure how to interpret this parallel display. Would we have expected the human population to learn over years how to play chess. One may be population knowledge evolution and spread of news and rumors and likes being displayed, while the other is more about an individual learning with many many games.
I thought i would give some legend to the figures there.. hopefully not that off-track. if so, i would be glad to be corrected or opinionated otherwise.
@Toscani said in #5:
> Thanks for the interesting link.
> storage.googleapis.com/uncertainty-over-space/alphachess/index.html
I should have looked that before jumping on the paper and its appendices....
That link states that they did feature extraction first (which i need to read more closely), and then looked for correspondance with known concepts (using SF somewhere from version 8).
I find interesting the historical human population evolution of opening preferences (not sure how that is quantified or defined, yet), and that of A0 self-play epochs. I mean the question is worth wondering, although I am not sure that A0 self-play is to be expect to follow population of learning trajectories that might be behind the human population statistics. or i am not yet getting the meaning of those graphs..
here is my guess (not having read paper for equivalent figures yet). Epoch might be about self-play batches, where the previous pair of self instances, with one not learning and the other learning are changed for a new learning epoch of self play starting with the same learned previous instance on each side.. That is for the horizontal axis.
For humans (lower curves), we have human years. from 1978 to 2018.
Hovering over one figure through the various connected curve we see that the 2 figures are connected for their actual move sequence (on the right some display of which sequence with a color code).
The color code is redundant hint about which band is which initial sequence of moves.. so the prevalence would be the area between bounding curves. same color same sequence..
Q: is A0 epoch during some number of self play independent runs or just one run and the many games played. That last possible misunderstanding of mine may be was is itching me in that comparison. The human games being played by many pairs in a population of possibly high level players..
So not sure how to interpret this parallel display. Would we have expected the human population to learn over years how to play chess. One may be population knowledge evolution and spread of news and rumors and likes being displayed, while the other is more about an individual learning with many many games.
I thought i would give some legend to the figures there.. hopefully not that off-track. if so, i would be glad to be corrected or opinionated otherwise.