Jonathan212 said:
If we want to assess a single draw, how extreme a single draw is, given K for this draw, what's the proper way to do it?
(50 choose 6)/K must be ok as a factor for small K's, but it can't be right for K=(50 choose 6) even if people choose with a random number generator because even the random number generator will produce duplicates.
If you only know how many tickets have been sold, but not how widely the tickets are distributed, then there is no way to predict the frequency of a lottery win. But, the total number of winners -over a potentially long time - should be more predictable.
Take an example of a lottery with 100 tickets and 50 players. If, for whatever reason, they all have different numbers, then you'll get one win every two weeks on average; and, only ever one winner.
At the other extreme, if they all have the same numbe, then you will only get one win every 100 weeks, but 50 winners every time.
And, if there is something between the two, with perhaps 40 different numbers, then you will get a win less than once every two weeks but sometimes more than one winner.
The common factor is the total number of winners, which relates only to the total number of tickets sold.
In the real lottery, out of 6.1 tickets sold, you might have only 3.2 million different numbers. Most of these would be held by only a few players: perhaps 1-5. But, some special "lucky" numbers might be held by thousands of different players. This could result in the pattern from your data. Most weeks there are a small numbers of winners, but if the lottery is played long enough, eventually one of the commonly held numbers will turn up and you'll get hundreds or thousands of winners.
In this case, it may take a long time for the number of winners to average out to match the ticket sales.
In the meantime, there is no definite, immediate way to know for sure why there are so few winners - given the number of ticket sales.