When does difference in sample size become an issue?

Click For Summary
Sample size can significantly influence the detection of differences between groups, as seen in the comparison of two groups with n=1600 and n=700, where significant differences were found. Randomly selecting equal sample sizes resulted in similar significance levels, suggesting that the initial sample size disparity may not be the sole factor. Bootstrapping within the restricted sets yielded consistent results, indicating robustness in findings. Additionally, plotting data can help assess the distribution type, which is essential for validating assumptions in statistical analysis. Overall, ensuring normality in data distribution is crucial for accurate interpretation of results.
80past2
Messages
39
Reaction score
0
I was comparing two different groups, and in one, my n was 1600, and the other n was around 700. I found pretty much all significant differences, but is that maybe due to sample size. I tried doing a random selection making the sample sizes equal (both around 700) and got more or less the same numbers and significance every time. Should I do anything else, or is this fine?
I also bootstrapped within one of these restricted sets and got about the same numbers.
 
Physics news on Phys.org
Do you need help checking distribution type by plotting it?

80past2 said:
I was comparing two different groups, and in one, my n was 1600, and the other n was around 700. I found pretty much all significant differences, but is that maybe due to sample size. I tried doing a random selection making the sample sizes equal (both around 700) and got more or less the same numbers and significance every time. Should I do anything else, or is this fine?
I also bootstrapped within one of these restricted sets and got about the same numbers.

I am doing a plotting program to look at data-sets and check for normal-ness,
if you'd like, & your data is "near" normal -- you can attach a text file with the data (or a scaled version of it...to obscure what it is) that just lists the data values. eg:
7
3.5
11.0

etc;
And I could plot the data into 1% or 0.05% quantiles; like this:
converting binomial/normal distribution into quantiles and comparing against normal
and then I could post the graphs for you... :smile:
It will show skewness, and some information that could help identify what type of distribution it really is, but it's mostly to check Gaussian data...
 
Last edited:
First trick I learned this one a long time ago and have used it to entertain and amuse young kids. Ask your friend to write down a three-digit number without showing it to you. Then ask him or her to rearrange the digits to form a new three-digit number. After that, write whichever is the larger number above the other number, and then subtract the smaller from the larger, making sure that you don't see any of the numbers. Then ask the young "victim" to tell you any two of the digits of the...

Similar threads

  • · Replies 31 ·
2
Replies
31
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K