How Do I Determine the Probability Distribution from Time and Size Data?

  • Thread starter Thread starter serdayne
  • Start date Start date
  • Tags Tags
    Data
AI Thread Summary
To determine the probability distribution from time and size data, one approach is to calculate the average time for each size category. The average time can then be plotted against the size, creating a visual representation of how time varies with size. While the initial plot using NORMDIST shows the distribution of times, it does not incorporate size, which is crucial for a comprehensive analysis. A combined plot that reflects the average probability of a given time for each size can help illustrate the relationship more effectively. This method, while not perfect, provides a condensed view of the data on a two-axis plot.
serdayne
Messages
9
Reaction score
0
I have a program for which I am trying to analyze its performance. My adviser recommended that I find the probability distribution for the data I have. However, I am not quite sure how to do this.

The data is something like:

Code:
[U]Time[/U]      [U]Size[/U]
2.10 ms     2  
2.30 ms     2 
2.90 ms     3
3.10 ms     2
3.30 ms     4
4.10 ms     4
4.30 ms     4
5.30 ms     5
5.50 ms     6

..etc

He suggested I find the probability distribution between the times and the size. I am not really sure what he means by this.

What I've tried: I found the average and the standard deviation of the times. I then, in Excel, used the function:

Code:
NORMDIST(Time[x], avg, std dev, true)

Where x is a Time on the above list. I do this for every single time.

I then plot the distributions (on the Y) vs. the Times (on the X). With this, I get a plot that resembles the one I've attached to this post.

The question is: is the plot of distributions for each value vs. Time a meaningful plot?

Also, I do not have Size factored in. What plot would allow me to compare Time vs. Size?

Thank you.
 

Attachments

  • example.jpg
    example.jpg
    20.2 KB · Views: 460
Physics news on Phys.org
Welcome to PF :P

My interpretation is that he wants you to tell him what is the chance that a given time belongs to a given size.
 
martix said:
Welcome to PF :P

My interpretation is that he wants you to tell him what is the chance that a given time belongs to a given size.

Thanks!

However, how can I show that? The above chart does not factor in Size at all. It is NORMDIST(TIME) vs. TIME.

Thanks.
 
Also, I should mention, that for me, the best way to interpret this data would be to figure out the average Time and measure that against the size. That way, for every sample that is Size 2 I'd have an Average Time, for Size 3 and Average Time, etc.

I would plot the Average Time for Each Size vs. Each Size. That way I'd know how long, on average, each Size sample took.

I'm not sure where the probability comes into play here.
 
Yes indeed.
Since size is a discrete variable and assuming a normal distribution, you can do a plot of the probability distribution for each size and then combine these to form the average probability of a given time offset from the average of the time for a particular size belonging to that size. In other words on Y you have average prob, and on X in the center you have average time for given size(with different sizes you put different average times there).
It may not be the most accurate approach, but it does condense all the information you have in on 2-axis plot.
 
I was reading a Bachelor thesis on Peano Arithmetic (PA). PA has the following axioms (not including the induction schema): $$\begin{align} & (A1) ~~~~ \forall x \neg (x + 1 = 0) \nonumber \\ & (A2) ~~~~ \forall xy (x + 1 =y + 1 \to x = y) \nonumber \\ & (A3) ~~~~ \forall x (x + 0 = x) \nonumber \\ & (A4) ~~~~ \forall xy (x + (y +1) = (x + y ) + 1) \nonumber \\ & (A5) ~~~~ \forall x (x \cdot 0 = 0) \nonumber \\ & (A6) ~~~~ \forall xy (x \cdot (y + 1) = (x \cdot y) + x) \nonumber...
Back
Top