Calculating statistical values from given data

  • Context: MHB 
  • Thread starter Thread starter mathmari
  • Start date Start date
  • Tags Tags
    Data Statistical
Click For Summary
SUMMARY

This discussion focuses on calculating statistical values from stock performance data over ten years. Key calculations include the arithmetic mean of 10.1%, a median of 8%, a mode of 8%, a standard deviation of approximately 0.0686, a coefficient of variability of 67.92%, and a coefficient of asymmetry of approximately 0.68696. The discussion also addresses the identification of extreme values using a cutoff limit of z = ±2 and the verification of a claim regarding an expected yield of at least 7.5% through hypothesis testing.

PREREQUISITES
  • Understanding of basic statistical concepts such as mean, median, mode, and standard deviation.
  • Familiarity with the coefficient of variability and coefficient of asymmetry.
  • Knowledge of hypothesis testing and normal distribution assumptions.
  • Ability to perform calculations involving z-scores and standard deviations.
NEXT STEPS
  • Learn about hypothesis testing and t-tests in statistics.
  • Explore the implications of the coefficient of variability in financial analysis.
  • Study the significance of extreme values and outlier detection methods.
  • Investigate the differences between population and sample standard deviation calculations.
USEFUL FOR

Data analysts, statisticians, financial analysts, and anyone involved in quantitative analysis of stock performance data will benefit from this discussion.

mathmari
Gold Member
MHB
Messages
4,984
Reaction score
7
Hey! :giggle:

Analyst has collected the following data on the performance of the $X$ stock for $10$ different years.

1631077089551.png


a) Calculate the arithmetic mean, the median, the mode, the standard deviation, the coefficient of variability and of asymmetry. You interpreted your results.

b) Does the data have extreme values (consider the cut-off limit $z = \pm 2$ or $z = \pm 3$).

c) The analyst claims that the average yield next year will be at least $7.5\%$. Is the claim correct? Do you think you would have an increased risk if you bought the $X$ share?
I have done the follwoing :

a) The arithmetic mean is $$\frac{5\%+3\%+8\%+12\%+2\%+18\%+5\%+8\%+24\%+16\%}{10}=\frac{101\%}{10}=10.1\%$$
For the median, we write the percentages in an increasing order then we calculate the half of the sum of $5$-th and $6$-th percentage.

Therefore the median is equal to $$\frac{8\%+8\%}{2}=8\%$$

The modeis the value that appears most often, so $8\%$.

For the standard deviation wesubtract from eachgiven value the mean, square the result and take the sum, divide that by the number $10$ and take the square root, right?

So we get $$\sigma=\sqrt{\frac{1}{10}\left [(0.05-0.101)^2+(0.03-0.101)^2+(0.08-0.101)^2+(0.12-0.101)^2+(0.02-0.101)^2+(0.180-0.101)^2+(0.05-0.101)^2+(0.08-0.101)^2+(0.24-0.101)^2+(0.16-0.101)^2\right ]}=\sqrt{\frac{1}{10}\cdot 0.04709}\approx 0.0686$$

The coefficient of variability is $$CV=\frac{\sigma}{\mu}=\frac{0.0686}{0.101}\approx 67.92\%$$ The coefficient of asymmetry is \begin{align*}\beta_1&=\frac{1}{10}\cdot \sum_{i=1}^{10}\left (\frac{x_i-\mu}{\sigma}\right )^3\\ & =\frac{1}{10\cdot \sigma^3}\cdot \left [\left (x_i-\mu\right )^3\right ]\\ & =\frac{1}{10\cdot 0.0686^3}\cdot \left [(0.05-0.101)^3+(0.03-0.101)^3+(0.08-0.101)^3+(0.12-0.101)^3+(0.02-0.101)^3+(0.180-0.101)^3+(0.05-0.101)^3+(0.08-0.101)^3+(0.24-0.101)^3+(0.16-0.101)^3\right ]\\ & =\frac{1}{0.00322828856}\cdot 0.00221772 \\ & \approx 0.68696\end{align*}

Is everything correct so far? How can we interpret the results?

b) Could you give me a hint for that?

c) Do we have to check what distribution we have in this case? Or how can we know that?
 
Physics news on Phys.org
mathmari said:
For the standard deviation wesubtract from eachgiven value the mean, square the result and take the sum, divide that by the number $10$ and take the square root, right?

So we get $$\sigma=...$$

Is everything correct so far? How can we interpret the results?

First the symbols, we use the Greek symbols $\mu$ and $\sigma$ to denote the so called population mean and standard variation.
They only apply if we are measuring the whole population, and as such they are usually unknown.
We use the Latin symbols $\bar x$ and $s$ to denote the mean and standard deviation of a sample of the population. 🧐

More specifically, the formula for the standard deviation of a sample is different from the one of the entire population.
That is because we "lose a degree of freedom" since we don't know the population mean $\mu$ and can only estimate it as $\bar x$ through the same sample.

In this case it means we must divide by $n-1=10-1=9$ to find the standard deviation $s$.
So $s=\sqrt{\frac 1{10-1}\cdot0.04709}$. 🤔

It also means we have $CV=\frac{s}{\bar x}$ and $b_1 = \frac 1{n}\sum\left(\frac{x_i-\bar x}{s}\right)^3$. 🤔

mathmari said:
b) Could you give me a hint for that?

The cut-off limit $z=\pm 2$ means that we look at 2 standard deviations from the mean ($\bar x\pm 2s$) and we check if we have values further away from the mean than that. 🤔

mathmari said:
c) Do we have to check what distribution we have in this case? Or how can we know that?

To check the claim we first formulate a null hypothesis and an alternative hypothesis, which we then try to verify.
The usual way to verify it, is by assuming that the "entire population" has a normal distribution, and that we have an independent sample of that population.
Questions (a) and (b) seem to be intended to verify if it is "reasonable" to assume a normal distribution.
It's also why question (a) asks to "interpret the results".
Assuming it is reasonable, we can do a t-test. 🤔
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 18 ·
Replies
18
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K