In case you haven't been following, the PS server issues mean I can't run my stats scripts on the server, and copying them off is slow as balls.
If we wait to get all the logs in a place where I can process them, we're looking at a delivery date for the March stats some time in May, even if I just do the usage-based tiers.
There is a shitty alternative though: running stats over a subsample of the month's stats, maybe 10-100k battles for each tier?
That might only take a few days to copy over and should process in a few hours.
The primary issue is going to be figuring out a way to have the subsample be at least somewhat random, but I think that's just a matter of researching functionality in rsync.
Thoughts?
Stats aren't usually within less than a tenth of a percentage point of the cutoff, so someone who's taken Stats 101 should be able to tell me what size sample you need to have high confidence in the outcome.
If we wait to get all the logs in a place where I can process them, we're looking at a delivery date for the March stats some time in May, even if I just do the usage-based tiers.
There is a shitty alternative though: running stats over a subsample of the month's stats, maybe 10-100k battles for each tier?
That might only take a few days to copy over and should process in a few hours.
The primary issue is going to be figuring out a way to have the subsample be at least somewhat random, but I think that's just a matter of researching functionality in rsync.
Thoughts?
Stats aren't usually within less than a tenth of a percentage point of the cutoff, so someone who's taken Stats 101 should be able to tell me what size sample you need to have high confidence in the outcome.