Picking an appropriate distribution

  • Context: Graduate 
  • Thread starter Thread starter aaaa202
  • Start date Start date
  • Tags Tags
    Distribution
Click For Summary
SUMMARY

This discussion focuses on modeling the probability of mutation accumulation in a biological system of approximately 10,000 cells using statistical distributions. The initial approach utilizes the binomial distribution to express the probability of a cell acquiring mutations, defined by the formula p(k mutations on n tries) = K(n,k) * (p/N)^k * (1-p)^(n-k). The user seeks to simplify the model by considering the variability of mutation probability (p) across different cells and inquires about the feasibility of approximating the distribution with a Poisson distribution to facilitate calculations regarding the mean deviation of mutations.

PREREQUISITES
  • Understanding of binomial distribution and its applications in probability theory
  • Familiarity with Poisson distribution and its use in modeling rare events
  • Basic knowledge of biological mutation processes and cell biology
  • Proficiency in mathematical modeling and statistical analysis
NEXT STEPS
  • Explore the derivation and applications of the Poisson distribution in biological contexts
  • Investigate methods for estimating parameters in models with heterogeneous probabilities
  • Learn about advanced statistical techniques for analyzing mutation rates in populations
  • Review case studies where binomial and Poisson distributions have been applied to genetic mutation research
USEFUL FOR

Researchers in genetics, biostatisticians, and anyone involved in modeling biological systems with a focus on mutation probabilities and statistical distributions.

aaaa202
Messages
1,144
Reaction score
2
I am studying a biological system comprised of roughly 10000 cells. My model studies the probability that a cell accumulates four independent mutations and thus transform into a vicious cancer cell.
Starting from basic theory of the binomial distribution it is easy to write an expression for the probability that a particular cell acquires k mutations after n timesteps. Calling the probability that an arbitrary cell acquires a mutation for p we have for a single cell:
pcell = p/N
And thus:

p(k mutations on n tries) = K(n,k) * (p/N)^k * (1-p)^(n-k)

And summing all these up should give us the total probability that one cell has acquires k mutations. Now multiplying by N wouldn't actually work since p is actually specific to each cell (I assumed it to be the same for simplicity).

Now my question is: This expression becomes quite nasty when we add the fact that p differs from cell to cell. Is it possibly to make some estimations to make the expression more easy to work with. As N is pretty big (we could make it a lot bigger) would it be possible to model the distribution as a poisson distribution? And would that then make cell dependence of p easier to work with, or could we at least then find a straightforward expression for the deviation from the mean amount of mutations?
 
Physics news on Phys.org
Could you explain your model a little more clearly? First what exactly is N and "a mutation for p"?
 

Similar threads

  • · Replies 29 ·
Replies
29
Views
6K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 17 ·
Replies
17
Views
2K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 15 ·
Replies
15
Views
3K
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K