MHB -aug.16.data list from table intervals

AI Thread Summary
The discussion centers on calculating statistical measures such as mean, median, mode, variance, and standard deviation from a frequency table of height intervals. Participants clarify that midpoints of intervals should be used for calculations, leading to the correct mean and median values. The mode is identified as the height corresponding to the highest frequency, which is 13. Variance and standard deviation formulas are discussed, with emphasis on using midpoints rather than frequencies in calculations. The conversation concludes with a focus on interquartile range and the correct interpretation of quartiles within the context of interval data.
karush
Gold Member
MHB
Messages
3,240
Reaction score
5
in creating a list of numbers to find mean, median, mode, range and some other questions with this table

height in meters| frequency
$$8 \le h < 10\ \ \ \ \ \ \ \ \ \ 6$$
$$10 \le h < 12\ \ \ \ \ \ \ \ 5$$
$$12 \le h < 14\ \ \ \ \ \ \ \ 7 $$
$$14 \le h < 16\ \ \ \ \ \ \ \ 4$$

how do we make a list when you have intervals? or do you just use the number in between like $$8 \le h < 10$$ would be $$\{9,9,9,9,9,9\}$$
 
Mathematics news on Phys.org
let me ask a different question would the mean of this table be
$$\frac{4+5+6+7}{4} = \frac{11}{2} $$
 
Hi karush! :)

karush said:
in creating a list of numbers to find mean, median, mode, range and some other questions with this table

height in meters| frequency
$$8 \le h < 10\ \ \ \ \ \ \ \ \ \ 6$$
$$10 \le h < 12\ \ \ \ \ \ \ \ 5$$
$$12 \le h < 14\ \ \ \ \ \ \ \ 7 $$
$$14 \le h < 16\ \ \ \ \ \ \ \ 4$$

how do we make a list when you have intervals? or do you just use the number in between like $$8 \le h < 10$$ would be $$\{9,9,9,9,9,9\}$$

Yes. You would use the number in the middle as you suggest.
karush said:
let me ask a different question would the mean of this table be
$$\frac{4+5+6+7}{4} = \frac{11}{2} $$

So no.
The mean would be $$\frac{6\cdot 9 + 5 \cdot 11 + 7 \cdot 13 + 4 \cdot 15}{6+5+7+4}$$.
 
Last edited:
The mean would be $$\frac{6\cdot 9 + 5 \cdot 11 + 7 \cdot 13 + 4 \cdot 15}{6+5+7+4}$$.

so would the median of this be from

$$\{9,11,13,15\} = 12$$

and the mode be $$7$$ since it has the highest frequency

do I have to start a new OP if I continue to ask more Q on this table?
 
karush said:
...
do I have to start a new OP if I continue to ask more Q on this table?

As long as your additional questions pertain to the data already provided, it is best to ask further questions regarding it here in this topic. :D
 
karush said:
so would the median of this be from

$$\{9,11,13,15\} = 12$$

The median is the height where half is smaller and the other half is taller.
At height 12, you have 6+5=11 people smaller, and 7+4=11 people taller.
So indeed the median is 12.

and the mode be $$7$$ since it has the highest frequency

The mode is the height that occurs most... but no one has height 7. ;)
 
I like Serena said:
The mode is the height that occurs most... but no one has height 7.

so the most frequent is $$12\leq h < 14$$ or 13 for mode.

my next question is standard deviation and variance
from Wikipedia
In statistics and probability theory, standard deviation (represented by the symbol sigma, $$\sigma$$) shows how much variation or dispersion exists from the average (mean), or expected value.

So I would presume $$\sigma$$ here is 2 since that is the size of the intervals

I read variance in Wikipedia but not sure if it applies to this table.
so how is variance derived?
 
karush said:
so the most frequent is $$12\leq h < 14$$ or 13 for mode.

Right. :)
my next question is standard deviation and variance
from Wikipedia
In statistics and probability theory, standard deviation (represented by the symbol sigma, $$\sigma$$) shows how much variation or dispersion exists from the average (mean), or expected value.

So I would presume $$\sigma$$ here is 2 since that is the size of the intervals

I read variance in Wikipedia but not sure if it applies to this table.
so how is variance derived?

Not quite.

Let's start with variance.
In your case the formula is:
$$\text{Variance} = \frac{\sum n_i \times (x_i - \text{mean})^2}{\sum n_i}$$
where $n_i$ is the frequency of each category, $x_i$ is the mid value of each height interval, and $\text{mean}$ is the value you already found.

Variance is often denoted as $\sigma^2$.
Standard deviation (denoted as $\sigma$) is the square root of the variance.
 
I like Serena said:
Let's start with variance.
In your case the formula is:
$$\text{Variance} = \frac{\sum n_i \times (x_i - \text{mean})^2}{\sum n_i}$$
where $n_i$ is the frequency of each category, $x_i$ is the mid value of each height interval, and $\text{mean}$ is the value you already found.

Variance is often denoted as $\sigma^2$.
Standard deviation (denoted as $\sigma$) is the square root of the variance.

by $$\sum n_i $$ would this mean $$6+5+7+4=22$$

If so then
$$\frac{6 \times (6-11.82)^2 +5 \times (5-11.82)^2 +7 \times (7-11.82)^2 +4 \times (4-11.82)^2}{22}=variance$$
or is this composed wrong?
 
  • #10
karush said:
by $$\sum n_i $$ would this mean $$6+5+7+4=22$$

Yes!

If so then
$$\frac{6 \times (6-11.82)^2 +5 \times (5-11.82)^2 +7 \times (7-11.82)^2 +4 \times (4-11.82)^2}{22}=variance$$
or is this composed wrong?

Almost.
But you have substituted the frequencies instead of the mid interval values for $x_i$.
 
  • #11
I like Serena said:
Almost.
But you have substituted the frequencies instead of the mid interval values for $x_i$.
how this?$$\frac{9 \times (9-11.82)^2 +11 \times (11-11.82)^2 +13 \times (13-11.82)^2 +15 \times (15-11.82)^2}{22}=11.307$$ or $$ \sigma^2$$

thus standard deviation would be $$\sqrt{11.307}=3.3626$$
 
  • #12
karush said:
how this?$$\frac{9 \times (9-11.82)^2 +11 \times (11-11.82)^2 +13 \times (13-11.82)^2 +15 \times (15-11.82)^2}{22}=11.307$$ or $$ \sigma^2$$

thus standard deviation would be $$\sqrt{11.307}=3.3626$$

Hold on.
Now you have substituted the mid interval values for the freqencies $n_i$.
Check where it says $n_i$ and where it says $x_i$.

Btw, the meaning of variance is the average of the squared deviations from the mean.
 
  • #13
I like Serena said:
Hold on.
Now you have substituted the mid interval values for the freqencies $n_i$.
Check where it says $n_i$ and where it says $x_i$.

Btw, the meaning of variance is the average of the squared deviations from the mean.

$$\frac{6 \times (9-11.82)^2 +5 \times (11-11.82)^2 +7 \times (13-11.82)^2 +4 \times (15-11.82)^2}{22}=4.60331 $$ or $$ \sigma^2$$

so if correct then $$\sqrt{4.60331} = 2.14553$$ or $$\sigma$$
 
  • #14
Yep. That looks right.
 
  • #15
there's still more ??

Number of Data
$$= 22$$ assume sum of frequenciesInterquartile range
assume we could go off the intervals

so $$Q_1=10 \ \ Q_2=12 \ \ Q_3=14$$

then $$14-10=4$$

range
$$16-8=8$$
 
  • #16
karush said:
there's still more ??

Number of Data
$$= 22$$ assume sum of frequencies

Correct.
Interquartile range
assume we could go off the intervals

so $$Q_1=10 \ \ Q_2=12 \ \ Q_3=14$$

then $$14-10=4$$

You're not supposed to work from the intervals.
$Q_1$ is the height such that 25 percent is below.
Since 25% of 22 persons is 5.5, the $Q_1$ height is somewhere in the interval 8-10, which contains 6 persons.
There can be some discussion where that height actually is when talking about intervals, but let's keep it simple and say that $Q_1=9$, which is the middle of the lowest interval.
Similarly $Q_3$ is the height with 75% below.
Keeping it simple that is the middle of the third interval. So $Q_1=13$.

Anyway, your interquartile range comes out the same.
range
$$16-8=8$$

Right.
 
  • #17
thanks everyone for your help. it was a new topic for me
sure I'll be back with more
 
Back
Top