Finding the Right Curve Fit: A Question About Curve Fitting

In summary: If anyone has a not too large data set (up to about 20 points) from some experiment I would be interested...In summary, the conversation discusses different approaches to curve fitting, including using linear or quadratic fits if the data appears to follow a certain trend, using splines or Bezier curves for a more artistic approach, and considering the nature of the data (deterministic or with a random component) before choosing a fitting method. The importance of understanding the data and using mathematical knowledge is also emphasized.
  • #1
fog37
1,568
108
Hello everyone,

A question about curve fitting. Hope you can share some helpful hints.

Given some data we plot a graph y versus x. How do we approach the problem from a curve fitting standpoint? For example, using Excel, if the data "looks" linear from the graph, we choose a linear fit and get a best fit line.
If the data looks "parabolic", we choose a quadratic fit, etc. But what if we have no idea of what fit may be the appropriate fit and the data distributes along a curve but the curve does not look like any familiar curve? What is the wise way to proceed?

Thank you.
Fog37
 
Mathematics news on Phys.org
  • #3
Thank you! I will read up on splines today.
 
  • #4
fog37 said:
Given some data we plot a graph y versus x. How do we approach the problem from a curve fitting standpoint?

There is no unique answer to that question. If the data represents some phenomena in the real world then you must consider what family of mathematical functions represent it using your knowledge of that phenomena. It isn't a question that can be answered just by considering pure mathematics.

If your goal is artistic (to create a pleasing curve) then, as Nidum suggests, splines of various types could be useful.
 
  • Like
Likes lavinia and symbolipoint
  • #5
It might be impossible to find a good fitting function by hand. A solution is to cut up the curve in a lot of pieces, where each piece can be approximated by a simple curve (a polynomial or something familiar).
 
  • #6
You may consider a linear fit and then look for the confidence interval for the coefficient m of x in y=mx+b. There is test for whether the coefficient is 0. Maybe you can do similar tests for coefficients in quadratic regressions. Usually, or at least often, lack of linearity has to see with interactions between the variables.
 
  • #7
Stephen Tashi said:
There is no unique answer to that question. If the data represents some phenomena in the real world then you must consider what family of mathematical functions represent it using your knowledge of that phenomena. It isn't a question that can be answered just by considering pure mathematics.

If your goal is artistic (to create a pleasing curve) then, as Nidum suggests, splines of various types could be useful.
That is why students at colleges seem to be required to earn credit in College Algebra. By getting through THAT course, student has enough sense to perform some basic curve-fitting; and he never knows exactly when he may find some data which he may like to fit to a function. One problem about this is that, as I best remember, curve-fitting exercises may be missing from College Algebra courses - but the basic idea is there.
 
  • #8
I've had some good results fitting polynomials where I not only require the curve to pass through the specified data points, but also specify slope and curvature at specific points. There are lots of option, but you have to apply what you know about the situation.
 
  • #9
Dr.D said:
I've had some good results fitting polynomials where I not only require the curve to pass through the specified data points, but also specify slope and curvature at specific points. There are lots of option, but you have to apply what you know about the situation.
What do you use, Lagrange interpolation?
 
  • #10
I have used the Lagrange technique, but I have used a wide variety of other methods, some of them original with me.
 
  • #11
A lot depends on if the data is deterministic or has a random component. If there is no random component, then splines of a particular order can allow you to fit the data with a piecewise defined function that has smooth derivatives up to an order. If there is a random component, then it is possible that the same x value will have multiple y values. That rules out a direct application of splines. Some sort of statistical pre-process would have to be done before a spline can be used.
 
Last edited:
  • #12
fog37 said:
Hello everyone,

A question about curve fitting. Hope you can share some helpful hints.

Given some data we plot a graph y versus x. How do we approach the problem from a curve fitting standpoint? For example, using Excel, if the data "looks" linear from the graph, we choose a linear fit and get a best fit line.
If the data looks "parabolic", we choose a quadratic fit, etc. But what if we have no idea of what fit may be the appropriate fit and the data distributes along a curve but the curve does not look like any familiar curve? What is the wise way to proceed?

Thank you.
Fog37
Like you say, GRAPH the (x,y) points and decide what you see? Can you identify a type of equation that might fit? Pick the form of equation and set up a system of equations and try to find the unknown coefficients or other unknown values. You would probably what some computer power to help, and you would expect nonwhole constants to finish finding the equation or function which fits your data.
 
  • #13
Splines don't necessarily have to go through all of the data points .

Bezier curves

Essentially they generate the best smoothed curve fit to a set of data points .
 
  • Like
Likes FactChecker
  • #14
Nidum said:
Splines don't necessarily have to go through all of the data points .

Bezier curves

Essentially they generate the best smoothed curve fit to a set of data points .
Interesting. It looks like the curves will not go through any points except the endpoints of each curve. So, for instance, the 3 intermediate points of a quartic Bezier curve are just control points, not points on the curve, regardless of how well behaved the points are. I don't know how acceptable that would be for the OP. I have no experience with Bezier curves.
CORRECTION: Looking at @Nidum post below, it looks like the Bezier curve does go through the points (or close enough).
 
Last edited:
  • #15
Splines.jpg
 

Attachments

  • Splines.jpg
    Splines.jpg
    11.6 KB · Views: 505
  • Like
Likes FactChecker
  • #16
If anyone has a not too large data set (up to about 20 points) from some experiment I would be interested to see what sort of Belzier spline my CAD would generate and how good a fit it would actually be .
 
  • #17
If you're interested going the B-spline interpolation route, "The NURBS Book" by Piegl and Tiller has a few algorithms that I've personally used. They work well.
 

What is curve fitting?

Curve fitting is a statistical method used to find a mathematical function that best represents a set of data points. This allows for the prediction and analysis of trends in the data.

Why is curve fitting important?

Curve fitting is important because it allows us to make predictions and understand patterns in data. It is commonly used in fields such as physics, economics, and engineering to model and analyze real-world phenomena.

What are the different types of curve fitting methods?

There are several types of curve fitting methods, including linear regression, polynomial regression, and non-linear regression. These methods differ in the type of mathematical function used to fit the data.

How do you choose the best curve fit for your data?

The best curve fit for your data depends on the type of data and the purpose of your analysis. It is important to choose a method that fits the data well and does not overfit or underfit the data. This can be determined through statistical measures such as the coefficient of determination (R-squared) and visual inspection of the fit.

What are some common challenges in curve fitting?

Common challenges in curve fitting include choosing an appropriate mathematical function, dealing with outliers and missing data, and avoiding overfitting the data. It is important to carefully consider the data and use appropriate techniques to address these challenges for a successful curve fitting analysis.

Similar threads

  • General Math
Replies
22
Views
3K
  • MATLAB, Maple, Mathematica, LaTeX
Replies
9
Views
1K
Replies
4
Views
1K
  • General Math
Replies
19
Views
1K
  • General Math
Replies
12
Views
949
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
853
  • STEM Educators and Teaching
Replies
5
Views
623
Back
Top