Probability of misprints on each page

  • Context: MHB 
  • Thread starter Thread starter WMDhamnekar
  • Start date Start date
  • Tags Tags
    Probability
Click For Summary
SUMMARY

The discussion focuses on calculating the probability of misprints on each page of a book containing 500 pages and 50 misprints. The probability that pages 1 through 500 contain specific numbers of misprints, denoted as r1, r2, ..., r500, is given by the formula $$\frac{\binom{N}{r_1}\binom{N}{r_2}...\binom{N}{r_n}}{\binom{nN}{r}}$$. For large N, this probability can be approximated by $$\frac{r!}{r_1! r_2! ... r_n!} n^{-r}$$, indicating that misprints are distributed randomly across the pages. The discussion also emphasizes the application of Fermi-Dirac statistics in this context.

PREREQUISITES
  • Understanding of combinatorial mathematics, specifically binomial coefficients.
  • Familiarity with Fermi-Dirac statistics and its application in probability theory.
  • Basic knowledge of probability distributions and random variables.
  • Ability to manipulate mathematical expressions involving factorials and combinations.
NEXT STEPS
  • Study the principles of Fermi-Dirac statistics in greater detail.
  • Learn about advanced combinatorial techniques and their applications in probability.
  • Explore the implications of random distributions in statistical mechanics.
  • Investigate the relationship between binomial distributions and real-world phenomena.
USEFUL FOR

Mathematicians, statisticians, and researchers interested in probability theory, particularly those exploring the distribution of errors or misprints in large datasets.

WMDhamnekar
MHB
Messages
378
Reaction score
30
Hi,
Each page of a book contains N symbols, possibly misprints. The book contains n =500 pages and r =50 misprints. Show that (a) the probability that pages number 1, 2, . . . , n contain, respectively , $r_1, r_2 , . . . , r_n $ misprints equals $$\frac{\binom{N}{r_1}\binom{N}{r_2}. . .\binom{N}{r_n}}{\binom{nN}{r}};$$

(b) Show that for large N, this probability may be approximated by $$\frac{r!}{r_1! r_2! . . . r_n!} n^{-r}$$

Conclude that the r misprints are distributed in the n pages approximately in accordance with a random distribution of r balls in n cells.

(Note: The distribution of r misprints among the N available places follows the Fermi-Dirac statistics. Our assertion may be restated as a general limiting property of Fermi-Dirac statistics.)

My attempt to answer (a)
1651060074138.png

Now, in our case, n =500 pages and r= 50 misprints. So, there is only 10% chance that any single page has one misprint. ⇒ It is impossible for two or more misprints to be on the same page. So, first condition of Fermi_Dirac statistics is satisfied. All distinguishable arrangements satisfying the first condition have equal probabilities. In our case, It is $\binom{nN}{r}^{-1}$. So, the given answer to (a) is correct.

Now, how to prove (b)?
 
Last edited:
Physics news on Phys.org
WMDhamnekar said:
Now, in our case, n =500 pages and r= 50 misprints. So, there is only 10% chance that any single page has one misprint. ⇒ It is impossible for two or more misprints to be on the same page.
This is nonsense. Of course it is possible for two or more misprints to be on the same page, what is not possible is for two misprints to be in the same symbol. If you are still interested in answering this question I suggest you start again at the beginning.
 
  • Like
  • Informative
Likes   Reactions: malawi_glenn and WMDhamnekar
My answer to (a) Yes. N= Total symbols on each page. r= 50 misprints. So, what is impossible is two or more misprints occurring in the same symbol. So, first condition of Fermi-Dirac statistics is satisfied. All distinguishable arrangements satisfying the first condition have equal probabilities. In our case, It is ## \binom{nN}{r}^{-1}##.


So, we showed that the probability the pages number 1,2,... ,500 contain, respectively, ## r_1, r_2, ...,r_{500} ## misprints equals ## \displaystyle\frac{\binom{N}{r_1} \binom{N}{r_2} ... \binom{N}{r_n}}{\binom{nN}{r}}## because the probability that page number 1,2 ,..., 500 contain ##r_1 , r_2, ..., r_{500}## misprints, respectively, (where ##r_1 + r_2 + ... + r_{500} =50##) equals ##\binom{500N}{50}^{-1}## under Fermi-Dirac statistics provided each rj equals 0 or 1.

My answer to (b): We know that r1 + r2 + ... + r500= 50. The number of placements of 50 misprints on 500 pages resulting in the occupancy numbers r1 ,r2 , ... ,r500 is given by ##\binom{500}{50}##

Assuming that all 50050 possible placements are equally probable, the probability to obtain the given occupancy numbers r1, ... , r500 equals ##\displaystyle\frac{50!}{r_1!r_2!...r_{500}!}##500-50
 
Last edited:
WMDhamnekar said:
Now, in our case, n =500 pages and r= 50 misprints. So, there is only 10% chance that any single page has one misprint. ⇒ It is impossible for two or more misprints to be on the same page.
I don't agree wiht that.
 
  • Like
Likes   Reactions: malawi_glenn
What is ##50_{500}!##?
 
malawi_glenn said:
What is 50500!

malawi_glenn said:
50500= r500 where rk stands for number of misprints in the kth page. r= 50 misprints. n=500 pages
 
Last edited:
malawi_glenn said:
@WMDhamnekar why should a number have an index?
I edited my post #3. Please Recheck it.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 100 ·
4
Replies
100
Views
12K
  • · Replies 97 ·
4
Replies
97
Views
23K
  • · Replies 29 ·
Replies
29
Views
3K
  • · Replies 2 ·
Replies
2
Views
10K
  • · Replies 96 ·
4
Replies
96
Views
12K