Question about data analysis and error

Click For Summary

Discussion Overview

The discussion revolves around the analysis of data involving a constant variable and a function dependent on another variable. Participants explore how to determine values of the variable that satisfy a specific condition regarding the difference between the function and the constant, while also considering the implications of error and distribution of values.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant presents a scenario involving a constant variable and a function, aiming to find values of the variable that yield a specific difference.
  • Another participant emphasizes the importance of ensuring that the errors associated with the constant and the function are uncorrelated.
  • Concerns are raised about the method of iterating through values of the variable, suggesting it may lead to incorrect results due to the distribution of values.
  • Participants discuss the distribution of the variable, with one noting that their printed values appeared normally distributed, while another questions whether this is due to luck or a dependency of the function on the variable.
  • There is a discussion about calculating the mean and standard deviation of the variable, with differing opinions on the appropriateness of these statistics given the distribution type.
  • One participant references the standard deviation for a uniform distribution and contrasts it with the properties of a Gaussian distribution, highlighting the differences in how standard deviations are calculated and interpreted.

Areas of Agreement / Disagreement

Participants express differing views on the implications of the distribution of values and the appropriate statistical measures to use. There is no consensus on the best approach to analyze the data or the nature of the distribution.

Contextual Notes

Participants acknowledge limitations in their assumptions about the distributions and correlations involved in the analysis, which may affect the validity of their conclusions.

Arman777
Insights Author
Gold Member
Messages
2,163
Reaction score
191
Let us suppose we have one constant variable ##b \pm \delta b = 20 \pm 1 ## and one function that depends on ##x ##, such as, ##a(x) \pm \delta a ##

The problem is I want the difference between ##a(x) ## and ##b ## to be ##0 ##. Let me denote this difference as ##c \pm \delta c ##. To obtain a difference which is ##0 ##, we can have only one condition ##c \leq \delta c ##.

We are also given a range of ##x ## values.

So the problem is something like this.

Take a range of ##x ## values (such as ##{0,2} ##)

For each value of ##x ## in this range;

1) Calculate ##a(x) \pm \delta a ##

2) Take the difference between ##a(x) ## and ##b ##;
##c \pm \delta c = (a(x) \pm \delta a) - (b \pm \delta b) ##

3) if ##c \leq \delta c ## (if the difference can be ##0 ##), add it to an array that stores the values of the ##x ##.

Now after this we have some values of ##x ## that satisfy ##c \leq \delta c ##. How can you find ##x ## and ##\delta x ## from this array
 
Last edited:
Physics news on Phys.org
Some caveats are in order:

1) You want to be sure ##\Delta a## and ##\Delta b## are uncorrelated
2) You mean ##\ | c | \leq \Delta c\ ## of course.

And the 'for each ##x##' can lead to a wrong result. Numerical example:

a = 400 ± 40, b = 360 ± 20, so c = 40 ± 45

And let a = 200 x ##\Rightarrow ## x = 2.0 ± 0.2

Your array of x is, for example, 1.78, 1.79, ... 2.22.
Then the average is 2.0 and the standard deviation 0.13

Reason: your ##x## has a flat distribution instead of a normal distribution##\ ##
 
BvU said:
1) You want to be sure Δa and Δb are uncorrelated
2) You mean |c|≤Δc of course.
Yes
BvU said:
Reason: your x has a flat distribution instead of a normal distribution
Yes I have noticed that...that seems like a trouble
BvU said:
And the 'for each x' can lead to a wrong result. Numerical example:

a = 400 ± 40, b = 360 ± 20, so c = 40 ± 45

And let a = 200 x ⇒ x = 2.0 ± 0.2

Your array of x is, for example, 1.78, 1.79, ... 2.22.
I mean a different way actually. Think like this,

You have a some range of ##x## values from ##0## to ##2##. Then you are doing a loop such that
for each value of ##x##:
$c \pm \delta c = (a(x) \pm \delta a) - (b \pm \delta b)$
if $|c|≤Δc$:
add x to an array
 
I have revised the problem
 
Last edited:
Arman777 said:
I have revised the problem
Makes my reply quasi-incomprehensible :cry: !

Even so: some more editing is in order :wink:
 
  • Like
Likes   Reactions: Arman777
BvU said:
Makes my reply quasi-incomprehensible :cry: !

Even so: some more editing is in order :wink:
Waiting
 
Waiting for ? Ah - ##\TeX## details adjusted.
Arman777 said:
Then you are doing a loop such that
That doesn't tell us what the probability distribution of ##x## is going to be.
 
Arman777 said:
I have revised the problem

Please don't do that. Because...

BvU said:
Makes my reply quasi-incomprehensible :cry: !
 
Vanadium 50 said:
Please don't do that. Because...
sorry about that
 
  • #10
BvU said:
Waiting for ? Ah - ##\TeX## details adjusted.
That doesn't tell us what the probability distribution of ##x## is going to be.
Well I have printed the values and they were normally distributed
 
  • #11
So how can that be ? Pure luck, or something else, like the dependency of ##a## on ##x## ?
Or some monte carlo principle ?
 
  • #12
BvU said:
So how can that be ? Pure luck, or something else, like the dependency of ##a## on ##x## ?
Or some monte carlo principle ?
Its actually has the similar idea as your. The relationship between ##x## and ##a## is linear. So as we increase ##x## at some point it hits ##-\delta c## and another point it hits ##+\delta c##. The problem how can I calculate ##x## and ##\delta x## from the given data list.

Figure_1.png


This is the histpgram of the ##x## values.
 
  • #13
Looks pretty uniform to me :smile:
 
  • #14
BvU said:
Looks pretty uniform to me :smile:
So mean(x_values) and std(x_values) are the correct solution ?
 
  • #15
According to the numerical example, mean would be ok, st dev would be too optimistic
 
  • #17
Yes, I know. In the example I started with 2##\pm##0.2. The flat distribution gave a sigma of 0.13, about a factor of ##\sqrt 3## lower. Your procedure cuts at ##\pm\sigma## -- whereas a gauss distribution has , what was it again, 68% within ##\pm\sigma## and the remainder outside, contributing happily to the standard deviation.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
8K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 13 ·
Replies
13
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 6 ·
Replies
6
Views
5K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K