Can a Paired Two Sample T Test Reveal Significant Changes in Flux over 8 Hours?

  • Context: Undergrad 
  • Thread starter Thread starter leonmate
  • Start date Start date
  • Tags Tags
    Test
Click For Summary
SUMMARY

This discussion focuses on conducting a paired two-sample t-test to analyze flux measurements from the Herschel space telescope taken at two time intervals approximately 8 hours apart. The user, Leon, seeks guidance on implementing this statistical test using data that includes error measurements for each flux value. Key formulas provided include the calculation of pooled standard error and the t-statistic, which are essential for determining significant changes in flux. The conversation emphasizes the importance of considering error terms in the analysis and suggests averaging errors when they are similar.

PREREQUISITES
  • Understanding of paired two-sample t-tests
  • Familiarity with statistical error analysis
  • Proficiency in Python or Excel for data analysis
  • Basic knowledge of flux measurements in astrophysics
NEXT STEPS
  • Learn how to implement a paired two-sample t-test in Python using libraries such as SciPy
  • Study the concept of pooled standard error and its application in hypothesis testing
  • Explore methods for handling error terms in statistical analyses
  • Investigate alternative statistical tests for comparing multiple observations
USEFUL FOR

Physics undergraduates, data analysts, and researchers in astrophysics looking to perform statistical analysis on time-series flux data from telescopes.

leonmate
Messages
81
Reaction score
1
Hi, physics undergrad here with bad statistics skills!

I'm working on my 3rd year project and I need to do some statistical analysis. First, I'll paint the scene:

We have a number of tables, each of 10,000+ sources taken from the space telescope Herschel. They have had their flux measured at two time intervals ~ 8 hours apart. For each source I had flux1, flux2 and each data point has an error. So my table looks like:

| Flux_1 | Err_1 | Flux_2 | Err_2 |
(lots of data)

I need to run a paired two sample t test (I believe) on this data to see if any sources have significant changes in flux over the 8 hour period.

I've been looking at two sample t tests everywhere and I can't seem to find an example that has errors associated with each value taken.

I was hoping someone could shine some light on the situation that I'm in. And perhaps drop some hints on how I would implement running the tests on each data point. Excel or Python, maybe something easier?

Thanks,

Leon
 
Physics news on Phys.org
It sounds to me like you are looking to conduct 10,000+ independent comparisons.
H0 : Flux1_i-Flux2_i = 0
H1 : |Flux1_i - Flux2_i|>0
If the errors are similar, then it is best to average the error for your test. Assuming your error term is the standard error (S), and based on the same number of observations in the system, then you can use:
##S^2_p = \frac{S_1^2 + S_2^2}{2}##
##t_0 = \frac{Flux_1 - Flux_2}{\sqrt{S^2_p/2}}##

The other option would be to ignore the error term and just compare the observations. This might be prudent if the errors are relatively small compared to the observations.
Doing that would follow the standard process form paired tests, where you are testing the pairwise difference against the mean difference of the total sample.
 

Similar threads

  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
Replies
2
Views
4K
  • · Replies 19 ·
Replies
19
Views
3K
Replies
2
Views
3K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 1 ·
Replies
1
Views
4K