Determining a confidence interval for data.

Click For Summary

Homework Help Overview

The discussion revolves around determining a 95% confidence interval for the mean surface roughness of oil pipes based on a sample of 20 data points. The original poster notes that the data is negatively skewed and not normally distributed, raising questions about the appropriateness of using the t-table for their calculations.

Discussion Character

  • Assumption checking, Exploratory, Conceptual clarification

Approaches and Questions Raised

  • The original poster attempts to apply the formula for the confidence interval using the t-value, standard deviation, and sample mean. Some participants discuss the implications of sample size on the use of t-intervals, particularly in relation to the normality of the data distribution.

Discussion Status

Participants are exploring the validity of using the t-distribution with a sample size of 20, considering the skewness of the data. There is a recognition that while the common rule of thumb suggests a sample size of 30 for normality, some argue that a sample size of 20 may still be acceptable depending on the degree of skewness.

Contextual Notes

There is a discussion about the implications of sample size on statistical methods, particularly in relation to the normality of the data distribution and the potential for using t-intervals despite the sample size being less than 30.

NewtonianAlch
Messages
453
Reaction score
0

Homework Statement


I have 20 data points of the surface roughness of oil pipes, the mean, and the std of the sample.

I need to get a 95% confidence interval of the mean surface roughness of these pipes. I did a boxplot, and found that it's negatively skewed, and therefore not normally distributed:

32234.jpg


Would using the t-table be OK? I'm usually used to doing this for data when it is normally distributed, I'm not too sure what to do when it's not normal.

This is what I did:

x ± t-val(s/√n)

Where the t-val is t19 and I looked up the t-table for the value. s = std and n = 20 and x is of course the mean.
 
Physics news on Phys.org
when the sample size is greater than 30 you can use the t-confidence interval even if the population is not normal.
 
several of my professors have said n=30 is a number that's been around for a while and is used as a rule of thumb, but you don't have to follow it super strictly. How skewed is the data? If it's not horribly skewed, if it's even vaguely bell-shaped, I'd imagine n=20 is acceptable. But please correct me if I'm wrong, I'm still new at this.
 
I suppose n=20 is reasonable for using t-intervals
 

Similar threads

  • · Replies 21 ·
Replies
21
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
7K
  • · Replies 10 ·
Replies
10
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
2
Views
2K
  • · Replies 12 ·
Replies
12
Views
2K