Two data sets and want to do a regression excel y = C(x^n)

Click For Summary

Discussion Overview

The discussion revolves around performing a regression analysis in Excel for two data sets, specifically seeking a model of the form y = C(x^n), where C and n are constants. Participants explore different approaches to achieve this and express concerns about the suitability of Excel for statistical analysis.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant inquires about how to perform a regression in Excel for the model y = C(x^n).
  • Another suggests using the regression tool from the data analysis add-in and provides a link to a guide.
  • A participant notes that the referenced worksheet deals with a different model, y = b e^{mx}, and highlights the differences in transformation when applying logarithms to both models.
  • It is pointed out that taking logarithms leads to different forms for the two models, which may affect the regression approach.
  • Concerns are raised regarding the reliability of Excel for regression analysis, citing issues with its statistical tools and algorithms.
  • Participants discuss the implications of treating exponential and polynomial regression as identical, emphasizing that this can lead to problems in analysis.
  • One participant requests clarification on the problems that arise from treating the two regression types as the same.

Areas of Agreement / Disagreement

Participants express differing views on the appropriateness of Excel for regression analysis and the validity of using logarithmic transformations for the models discussed. There is no consensus on the best approach or the reliability of Excel's regression capabilities.

Contextual Notes

Limitations include potential misunderstandings regarding the application of logarithmic transformations to different regression models and the unresolved concerns about Excel's statistical functionality.

engineer23
Messages
68
Reaction score
0
I have two data sets and want to do a regression so that the equation that relates them is of the form y = C(x^n), where C and n are constants.

How do I do this in Excel?
 
Physics news on Phys.org
engineer23 said:
I have two data sets and want to do a regression so that the equation that relates them is of the form y = C(x^n), where C and n are constants.

How do I do this in Excel?

Use the regression tool from the data analysis addin. Here is a guide that may help:

http://www.tcc.edu/faculty/webpages/pgordy/Excel/expstat.pdf

CS
 
Last edited by a moderator:


Note that the aforementioned worksheet deals with the model

[tex] y = b e^{mx}[/tex]

and not to

[tex] y = C x^n[/tex]
 


Take log on both sides. Then the model will reduce to the common one.
 


No, it does not. The model given in the reference

[tex] y = b e^{mx}[/tex]

reduces to

[tex] \ln y = \ln b + mx[/tex]

The other model reduces to

[tex] \ln y = \ln C + n \ln x[/tex]

With the obvious changes in notation the first is a simple linear regression model, in which the original [tex]x[/tex] values can be used. In the second both [tex]y[/tex] and [tex]x[/tex] must have their logarithm calculated. Blindly applying the first approach would miss this: that was my point.

Of course, there is the question of why Excel would be used for regression in the first place.
 


statdad said:
Of course, there is the question of why Excel would be used for regression in the first place.

Just qurious, why do you not like to use Excel for regression?

CS
 


Excel doesn't have a very good reputation with regards to statistics (I haven't used the most recent version much, but since the problems that existed in previous versions were never addressed, I would be surprised if they were fixed this time).

  1. The histogram algorithm is flawed. There is no check to make sure the final "bin" (usually labeled as other) is the width as the others
  2. The default rendering of the histogram leaves gaps between the rectangles. Easy to fix, but annoying
  3. The regression procedure has many problems. The residual plots are poorly done. The fit mechanism has a tendency to fit multiple-regression models even when there is a high amount of collinearity in the predictors
  4. (This may be considered minor by many, but it is an example of something that shouldn't occur) In the situations where you want to force a regression through the origin (i.e., no intercept term) Excel still reports a value for [tex]R^2[/tex], which is inappropriate
  5. Many plots that are commonly used aren't available (boxplot, dotplot, two of the simplest) without using an external Add-In (which typically means buying the Add-In)
  6. Support for non-parametic/robust methods in general, and regression in particular, seems to be non-existent. It would be a great improvement to have (at least) one-variable rank regression, even with Wilcoxon weighting, included

There are other problems, but many are not related to regression. I understand why the temptation to use Excel is so high: immense market penetration - it seems almost every school/workplace has it. It is fantastic for many purposes - I just don't think regression in particular, and statistics in general is one of those purposes.

I hope this hasn't sounded too much like an angry rant - I apologize if it has.
 


statdad said:
No, it does not.

?? Whatever the model (of the two you considered) may be, after taking log on both sides, what remains to convert the the model to the common one is to suitably rename the variable(s). OP mentioned ony one model. So, no question of blindly applying same trick to "both".
 
Last edited:


ssd said:
?? Whatever the model (of the two you considered) may be, after taking log on both sides, what remains to convert the the model to the common one is to suitably rename the variable(s). OP mentioned ony one model. So, no question of blindly applying same trick to "both".

No - exponential regression (the method you pointed out) and polynomial regression (the question posed by the OP) are not identical, and treating them as such causes problems.
 
  • #10


statdad said:
No - exponential regression (the method you pointed out) and polynomial regression (the question posed by the OP) are not identical, and treating them as such causes problems.
I don't get your point. Can you point out some of such problems? It will be a nice help.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 13 ·
Replies
13
Views
5K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 64 ·
3
Replies
64
Views
6K
  • · Replies 2 ·
Replies
2
Views
2K