How accurate is the correlation coefficient for log-log data?

  • Context: Graduate 
  • Thread starter Thread starter Old Guy
  • Start date Start date
  • Tags Tags
    Fit
Click For Summary
SUMMARY

The discussion centers on the validity of using the correlation coefficient for log-log transformed data in linear regression analysis. The user successfully calculated the slope of their data set after applying logarithmic transformations to both x and y values. However, a coworker raised concerns about the appropriateness of the correlation coefficient for assessing goodness of fit in this context. The consensus is that while the correlation coefficient may not be valid post-transformation, the R² statistic is suitable for evaluating the fit of the transformed data.

PREREQUISITES
  • Understanding of linear regression techniques
  • Familiarity with logarithmic transformations in data analysis
  • Knowledge of correlation coefficients and R² statistics
  • Experience with data visualization, particularly log-log plots
NEXT STEPS
  • Research the implications of data transformation on statistical measures
  • Learn about R² statistics in the context of transformed data sets
  • Explore alternative measures of goodness of fit for non-linear data
  • Investigate best practices for interpreting log-log plots in regression analysis
USEFUL FOR

Data analysts, statisticians, and researchers involved in regression analysis and data transformation techniques, particularly those working with log-log data sets.

Old Guy
Messages
101
Reaction score
1
I am familiar with linear regression and the correlation coefficient. My current problem involves a data set that is pretty linear on a log-log plot. I have calculated the slope by taking logs of all my x's and y's, and doing the linear regression on the transformed data set. The resulting slope appears to be correct, and I'm happy with that.

The question is, how good a fit do I have on the data? I planned to simply calculate the correlation coefficient on the transformed data, but a coworker challenged this - said that the transformations alter the measurements in a way that makes the typical correlation coefficient calculation invalid.

Is that correct? And if it is, what is the correct measure of goodness of fit for log-log data? Thanks.
 
Physics news on Phys.org
Old Guy said:
I am familiar with linear regression and the correlation coefficient. My current problem involves a data set that is pretty linear on a log-log plot. I have calculated the slope by taking logs of all my x's and y's, and doing the linear regression on the transformed data set. The resulting slope appears to be correct, and I'm happy with that.

The question is, how good a fit do I have on the data? I planned to simply calculate the correlation coefficient on the transformed data, but a coworker challenged this - said that the transformations alter the measurements in a way that makes the typical correlation coefficient calculation invalid.

Is that correct? And if it is, what is the correct measure of goodness of fit for log-log data? Thanks.

If it's linear with a log-log transform, then the correlation is based on a log-log data set with transformed expectations. As along as it's clear what the data is, I don't see a problem.

EDIT: I forgot to answer your second question. If you have a linear plot in a standard regression on transformed data, the [tex]R^2[/tex] statistic should be appropriate for the transformed data (but only for the transformed data.)
 
Last edited:

Similar threads

  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 3 ·
Replies
3
Views
12K
  • · Replies 5 ·
Replies
5
Views
9K
  • · Replies 7 ·
Replies
7
Views
8K