Comparing Approaches for Determining Punctuality

  • Context: Undergrad 
  • Thread starter Thread starter musicgold
  • Start date Start date
Click For Summary

Discussion Overview

The discussion revolves around determining the best statistical approach for assessing punctuality based on recorded arrival times at work. Participants explore different methods for analyzing a dataset of 500 time entries to evaluate whether the individual typically arrives on time or early.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant suggests using option 3, which involves treating the 500 data points as a sample and performing a hypothesis test, arguing it provides the most accurate result.
  • Another participant questions the reasoning behind the preference for option 3 and requests further clarification.
  • A different participant critiques option 1 for being influenced by outliers and explains that option 2 involves hypothesis testing based on a smaller sample size, while option 3 uses a larger sample, potentially yielding more accurate results.
  • This participant also mentions the importance of the central limit theorem in justifying the use of larger samples to achieve results that resemble a normal distribution.
  • Concerns are raised about the unknown distribution of the arrival times, which could affect the validity of the statistical methods applied.

Areas of Agreement / Disagreement

Participants express differing opinions on which statistical approach is best, with no consensus reached on a single method. The discussion includes both support for and critique of the proposed options.

Contextual Notes

Participants note the potential influence of outliers on the mean and the necessity of understanding the distribution of the data before applying certain statistical tests. There is also mention of the complexity involved in calculating variance and conducting hypothesis tests.

musicgold
Messages
303
Reaction score
19
Hi,

I have the electronic record of my time sheets at my work, for a two year period. Every day when I reach and leave the factory, I swipe my identity card through a machine which records the time.

I generally reach work at 8.00 am, sometimes early, sometimes late. I wish to know if I am a punctual person, statistically – on average, do I reach the office at 8.00 am or before?

I have about 500 data points. I am thinking of the following three approaches. I am not sure which one is the best.

1. Take a simple average of all the 500 arrival times. The average will tell me if I am punctual.

2. Take a sample of 50 data points, calculate the sample mean, calculate the sample variance and estimate the population variance. Assume that the population mean is 8.00 am. And see if the sample mean is within 3 standard deviations (population’s SD) from the population mean.

3. Treat the 500 data points as a sample and follow the steps outlined in the #2 approach.

Which one do you think is the best way and why?

Thanks,

MG.
 
Physics news on Phys.org
hmm I'd go with option 3. It will give you the most acurrate result.
 
Do you mind explaning a bit more? Thanks.
 
well let's look at option 1. you are finding the mean but this is largely affected by outliers in the data. Then option 2 you are taking, essentially, a sample size of 50 and doing a hypothesis test based on that sample and well in option 3 you are doing the same thing but just taking a larger sample. Either way the hypothesis test is more accurate than just finding the mean, however, because we do not know the whether your times are normally distributed or not we have to apply the central limit theorem. (if you do not know about this I suggest you read up about it somewhere, it's a simple concept) The theorem then implies that the larger the sample the closer your data resembles a normally distributed curve. Hence if you take a larger sample the more accurate your result will be. Of course it will be more time consuming to find your variance etc. but if accuracy is your aim then by all means.
( I apologize if my explanation seems a little haywire but my english isn't the best.. I'll clear up whatever.. just ask)
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 22 ·
Replies
22
Views
4K