How would I correlate many variables to a few coefficients?

  • Context: Graduate 
  • Thread starter Thread starter mcovalt
  • Start date Start date
  • Tags Tags
    Coefficients Variables
Click For Summary

Discussion Overview

The discussion revolves around the challenge of correlating multiple variables, specifically the concentrations of various substances in chemical compounds, with coefficients derived from fitted asymmetrical sigmoid curves representing strength as a function of time and temperature. The context includes experimental data and the desire to find a method for predicting or estimating these coefficients based on compound composition.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant describes having 550 asymmetrical sigmoid curves fitted with 4 varying coefficients, representing strength as a function of time and temperature for different compounds made from varying substances.
  • Another participant clarifies the scenario, emphasizing the need to predict the coefficients based on the concentrations of up to 7 substances from a total of 20 in each compound.
  • A participant expresses uncertainty about the use of the term "correlation" and states a goal to produce functions that estimate the coefficients based on the substances in a compound.
  • One participant suggests using the method of least squares to fit a system of multiple regression equations as a potential solution.
  • Another participant raises a concern about the appropriateness of least squares fitting, questioning whether the errors in the experimental curves are random or fixed, which could affect the validity of the method.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the best approach to correlate the variables with the coefficients. There are differing views on the use of least squares fitting and the nature of the errors in the data.

Contextual Notes

The discussion highlights uncertainties regarding the nature of measurement errors in the experimental data and the implications for using least squares fitting. There is also ambiguity in the terminology used, particularly around the concept of correlation versus prediction.

mcovalt
Messages
28
Reaction score
0
I have around 550 asymmetrical sigmoid curves fitted to a function with 4 varying coefficients. Each of these curves represent strength as a function of time and temperature for a different compound. Each compound is made up of varying substances at varying concentrations.

Overall, I have 550 curves of 550 different compounds, 20 substances, and each compound has no more than 7 of these 20 substances. I'm trying to correlate these substances and their concentrations with their coefficients.

If I had one substance at varying concentration, I know how I'd go about finding it's correlation to the coefficients. I'd find the trend of:

coefficienti(substance,concentration)

But I have no idea how I'd go beyond just one substance, and I've got a lot more than two! I'm hoping someone can point me in the right direction.
 
Physics news on Phys.org
mcovalt said:
I have around 550 asymmetrical sigmoid curves fitted to a function with 4 varying coefficients.

I'll suggest how to rewrite your question - see if the details are correct.

I'm trying to predict the "strength" vs "tempertature" and "time" curves of some chemical compounds as a function of the concentrations of their component substances.

I have 20 different substances. From these, I created 550 chemical compounds by combining up to 7 of the substances. Among the different compounds, the concentrations of the various substances varies..

For each compound, I experimentally measured strength as a function of temperature and time. I have selected a particular family of functions to fit the experimental data. Each member of this family is specified by specifying the values of 4 constants.

I would like to predict the value of the 4 constants as a function of compound's composition - i.e. the substances that are in it and their concentrations.

----

(Perhaps you don't want the predict he value of the 4 constants - perhaps you only want to find "correlations"? )
 
Thank you Stephen! You have a way with words! That's exactly the scenario and problem.

Each experimentally produced curve follows an asymmetrical "S" shape. These 550 experimentally produced curves have been fitted with an equation containing four constants to alter the function to fit each experimentally produced curve.

I may misuse the word "correlation". My end goal is to produce four functions to estimates each of these four constants when given the substances of an imaginary compound.

Since writing the post, I believe I have stumbled upon an answer. I would use the method of least squares to fit a system of multiple regression equations. I believe this ought to do it.
 
mcovalt said:
I would use the method of least squares to fit a system of multiple regression equations. I believe this ought to do it.

It might. Fitting equations to data using least squares is most often done when the variables being predicted had some sort of random error in their measurement. In your case, it isn't clear whether the error in fitting is mostly random error. For example, if your experimental curves are very precise then if a least squares fit produces a big error for the coeffiicent of one curve, it would always produces a big error for that curve, because there the errors in the coefficients aren't random, they are fixed.
 

Similar threads

  • · Replies 29 ·
Replies
29
Views
7K
  • · Replies 16 ·
Replies
16
Views
3K
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 21 ·
Replies
21
Views
4K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K