Understanding Probability Distributions for Generating Random Data

EliteLegend · Jul 14, 2009

I have recently come across the 80/20 rule... I am using the Pareto Distribution's pdf to generate some dataset that I wanted... Now if I have a set of 50 items and I need to generate 40% of these items 60% of the times, how am I supposed to go about doing this? I know how to select items with certain probabilities but this task is confusing me... Anyone has some inputs for me please?

EnumaElish · Aug 6, 2009

One way is to create duplicate (multiplicate) items until you reach the desired proportion, then use a uniform rule to select.

For example, if I had 2 items {x, y} and wanted to obtain x 67% of the time, I'd duplicate x once, and make draws from the set {x, x, y}.

mXSCNT · Aug 6, 2009

To select one of m items, from a total of n items, a proportion y of the time, you need to select each of the m items with probability y/m, and each of the remaining n-m items with probability (1-y)/(n-m)

Understanding Probability Distributions for Generating Random Data

SUMMARY

PREREQUISITES

NEXT STEPS

USEFUL FOR

Similar threads

Graduate Expected numbers of cards of a last color remaining

Undergrad The problem of points

Graduate Probability puzzle

Undergrad How does axiom of foundation prevent infinite sequence of elements?

Undergrad Understanding permutations and combinations in a coin toss experiment

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect