Looking for empirical equation in experimental data

Click For Summary

Discussion Overview

The discussion revolves around finding an empirical equation to model experimental data in nuclear research, specifically focusing on a dependent variable G that is influenced by three independent variables A, B, and C. Participants explore various modeling approaches, including linear and polynomial forms, to better fit the experimental results.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant describes G as a function of three variables, suggesting that G is approximately linear with respect to each variable when others are held constant, but questions the validity of a simple additive model.
  • Another participant proposes using a linear model with interaction terms to account for non-linear relationships, suggesting that the model could include terms like z = a*x + b*y + c*x*y.
  • A different participant introduces a specific function f(A,B,C) = AB + AC + BC, noting that while it is linear in each variable, it does not conform to a simple additive form with constants.
  • There is a suggestion to apply least-squares fitting to find the best linear approximation to the data, with a detailed explanation of the matrix formulation involved.
  • One participant reports that a more complex model including higher-order terms improved the correlation between estimated and measured values, raising questions about the underlying significance of the additional terms used.

Areas of Agreement / Disagreement

Participants express differing views on the appropriate modeling approach, with some advocating for simpler linear models and others suggesting more complex polynomial forms. The discussion remains unresolved regarding the best method to accurately represent the data.

Contextual Notes

Participants mention issues with outliers in the data and the impact of removing certain terms on the model's accuracy, indicating that the choice of model may depend on specific data characteristics.

Curran919
Messages
2
Reaction score
0
I am an engineer student working in nuclear research. I am performing some experiments looking for an empirical equation to apply to results in a test section, but am having trouble making a mental leap. Here is the core of the problem with all of the engineering 'fat' trimmed off:

I have a variable with three dependents:
G = f(A,B,C)

I have shown that G is more or less linear WRT each variable for multiple values of the other variables (sorry, I'm an undergrad engineer, mathematic notation is lacking):
G=f(A) of O(1) for every B,C
G=f(B) of O(1) for every A,C
G=f(C) of O(1) for every A,B


I would like to say that because of this,
G = f(A)+g(B)+h(C)
or even,
G = aA+bB+cC+d where a,b,c,d are constants

but this would only be true if the slope of f(A) where constant regardless of B,C (and the same for f(B)/f(C)). Of course, it isn't. Is what I've said correct, and if so, is there an alternative conclusion I can make?

G = (A-a)(B-b)(C-c)?
 
Physics news on Phys.org
A common model for empirical work is the linear model. Where things don't fit so well, the experimenter can include interaction terms. Thus, the model might be (for two independent variables x and y)

z = a*x + b*y + c*x*y

The a, b, and c are constants are need to be fit to the data. You might check some books on the Design of Experiments, as such modeling is often done in that context.
 
Consider the function f(A,B,C) = AB+AC+BC. This is linear in each variable, but not globally approximate to anything on the form Aa+Bb+Cc for constants a, b and c. If you are only interested in local behavior, you should add the constraints of a, b and c. Then maybe you can get an approximate linear form.

If you know some linear algebra, you could find the linear expression that is "closest" to your set of data-points in the manner that the sum of squares of the differences from the data points and the values of a linear expression is minimized. If you have values for f(x,y,z) at [tex](x_1,y_1,z_1), (x_2,y_2,z_2),...,(x_n,y_n,z_n)[/tex], solve for the least-squares solution to [tex]Mx=b[/tex], where
[tex]M = \begin{bmatrix} x_1 & y_1 & z_1 & 1 \\ x_2 & y_2 & z_2 & 1 \\ \vdots & \vdots & \vdots & \vdots \\ x_n & y_n & z_n & 1 \end{bmatrix}[/tex]

and

[tex]b = \begin{bmatrix} f(x_1,y_1,z_1) \\ f(x_2, y_2, z_2) \\ \vdots \\ f(x_n , y_n , z_n) \end{bmatrix}[/tex].

I.e. solve for [tex]M^TMx = M^Tb[/tex].



Then one of your [tex]x = \begin{bmatrix} a \\ b \\ c \\ d \end{bmatrix}[/tex] will give an approximation [tex]f(x,y,z) \approx ax+by+cz+d[/tex] on these data-points. The more "linear" your function behaves the better the approximation.

If you suspect it to be on other forms, such as higher degree polynomials or linear combinations of entirely different functions this can also be done similarly. To do this: If you think the function is approximately a linear combination of the functions [tex]g_1(x,y,z),...,g_k(x,y,x)[/tex], substitute [tex]\begin{bmatrix} g_1(x_i,y_i,z_i) & \ldots & g_k(x_i,y_i,z_i) \end{bmatrix}[/tex] for the i'th row of the matrix M, and solve for some k-vector x, which will be the coefficients of the functions. The linear form corresponds to the case where [tex]g_1(x,y,z) = x, g_2(x,y,z) = y, g_3(x,y,z) = z[/tex], and [tex]g_4(x,y,z) = 1[/tex].

Often [tex]M^TM[/tex] will be invertible giving a unique solution [tex](M^TM)^{-1}M^Tb[/tex], and inverting will not be very difficult as the matrix M^TM is a k x k matrix where k is the number of functions you are considering. You should probably constrain your data-set so you can multiply the matrices without difficulty. Hope this helps, good luck.
 
Last edited:
Thank Jarle, very helpful.

Indeed using [tex] f(x,y,z) \approx ax+by+cz+d [/tex] gave a poor correlation between the estimated and the measured readings. I tried:

[tex] f(x,y,z) \approx axyz+bxy+cxz+dyz+ex+fy+gz+h [/tex]

and the correlation appears much better. I think I have some outliers in the measurement data, so I will remove a few instances and see what happens. Is there an underlying explanation to the terms that I used, or is it just a mathematical catch-all (or more terms [tex]\approx[/tex] less error)? I tried nixing the terms that seemed to have a low correlation, which was okay for [tex]axyz[/tex], but removing any of the second order terms introduced considerable error.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 27 ·
Replies
27
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 66 ·
3
Replies
66
Views
8K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K