A Good Examples of Causation does not Imply Correlation

  • A
  • Thread starter Thread starter WWGD
  • Start date Start date
  • Tags Tags
    Correlation
AI Thread Summary
The discussion explores the concept that causation does not imply correlation, particularly when the relationship is non-linear. Examples like Hooke's law and the quadratic relationship between voltage and power are cited to illustrate scenarios where causation exists but correlation is zero. Participants emphasize the importance of distinguishing between general correlation and linear correlation, noting that certain relationships, such as temperature variations throughout the year, can also exhibit zero correlation despite underlying causative factors. The conversation also touches on the significance of mutual information as a measure of statistical association, which can highlight relationships even when traditional correlation metrics fail. Overall, the thread seeks to clarify the nuances of causation and correlation in statistical contexts.
  • #51
FactChecker said:
If you are going to teach an introductory class, I think you should be careful about these terms. Saying that A and B are uncorrelated implies that A and B are random variables. Saying that A causes B implies that A and B are events. The two implications are conflicting. It would be better to talk about random variables X and Y being correlated and about the event ##X \in A## implying (not causing) the event ##Y \in B##. (You could talk about events A and B being independent, but not uncorrelated).
Also, you should be careful to indicate that "causation" is a logic problem that depends on subject knowledge, not a statistical problem.
Well, this is part of the problem of trying to popularize not-so-simple topics. I have to do enough handwaving to start a tornado.
 
  • Haha
Likes Klystron
Physics news on Phys.org
  • #52
This is just a Stats 101 for incoming freshmen , with little to no background in Math/Philosophy. I was just asked to incorporate this topic to the existing curriculum. At any rate, there is still a lot of handwaving when introducing the CLT, Hypothesis Testing, etc. I will include the caveat of the necessary oversimplification and just direct the curious ones to more advanced sources.
 
  • #53
WWGD said:
Consider Hooke's law and the causal relation ( Causality is still somrwhat of a philosophical term at this stage, so I am settling for accepted Physical laws as describing /defining causality) with a shift, to## y=k(x-1)^2## . Then the samples at opposite sides of the above equalities described by it, as well as by other Physical laws may give rise to uncorrelated data sets. I cannot afford to enter or present serious background on causality in an intro-level class.

For the purposes of statistics, the main point is not the philsopical definition of causality, but rather the mathematical point that the correlation between A and B is ( as @FactChecker says) only defined for random varables A and B. If A and B are physical measurements, they are only random variables if some scheme is specified for taking random samples of the measurements. So a "Hookes Law" relation between A and B does not define A and B as random variables. To suggest to an introductory class that the names of two measurements (e.g. length, force or height, weight) implies the concept of a correlation or a lack of correlation between the measurements is incorrect. A fundamental problem in applications of statistics is how to design sampling methods. You cannot provide a coherent example of "measurement A is not correlated with measurement B" without including the sampling method.
 
  • #54
You do not need to dwell in class on the technicalities, but you can arm yourself with a few simple, intuitive, examples. I think that is what you want from this thread.
If you randomly select a person and measure the lengths of their arms, those lengths are random variables. The event that the selected right arm is more than 2 feet long is an event. The right arm lengths are highly correlated with left arm lengths, but the right arm being over 2 feet long does not cause the left arm to be over two feet long -- it just implies, it does not cause.
 
  • #55
Stephen Tashi said:
For the purposes of statistics, the main point is not the philsopical definition of causality, but rather the mathematical point that the correlation between A and B is ( as @FactChecker says) only defined for random varables A and B. If A and B are physical measurements, they are only random variables if some scheme is specified for taking random samples of the measurements. So a "Hookes Law" relation between A and B does not define A and B as random variables. To suggest to an introductory class that the names of two measurements (e.g. length, force or height, weight) implies the concept of a correlation or a lack of correlation between the measurements is incorrect. A fundamental problem in applications of statistics is how to design sampling methods. You cannot provide a coherent example of "measurement A is not correlated with measurement B" without including the sampling method.
My actual take on causation of B by A would be that in several independent experiments, variable A was controlled for, ( instances of it were) selected randomly so that the major non-error variation in B is explained through variation in A. But there is little room to delve into this, to specify how/where random variables or events come into place in this setting. And, yes, I was assuming a scheme to take random samples from each has been defined. Students I have had have trouble understanding what a probability distribution is, so delving into events and random variables is overkill, as I am only allotted a single class to go into this topic.
 
  • #56
WWGD said:
But there is little room to delve into this, to specify how/where random variables or events come into place in this setting.

I don't understand how an introductory course in statistics can have little room for discussing random variables!
 
  • #57
Stephen Tashi said:
I don't understand how an introductory course in statistics can have little room for discussing random variables!
Nursing and other humanities students , high school with hardly any/ very poor Math or Science background, I guess. Not a comfortable position for me to be in, for sure.
 
Last edited:
  • #58
WWGD said:
Consider Hooke's law and the causal relation ( Causality is still somrwhat of a philosophical term at this stage, so I am settling for accepted Physical laws as describing /defining causality)

But even in Hooke's law, does the displacement cause the force, or does the force cause the displacement?

Yes!
 
  • #59
WWGD said:
Ok, so if the causality relation between A,B is not linear, then it will go unnoticed by correlation, i.e., we may have A causing B but Corr(A, B)=0. I am trying to find good examples to illustrate this but not coming up with much. I can think of Hooke's law, where data pairs (x, kx^2) would have zero correlation. Is this an " effective" way of illustrating the point that causation does not imply ( nonzero) correlation? Any other examples?

Here's a nice figure with some examples illustrating your point:
1605632004715.png

https://janhove.github.io/teaching/2016/11/21/what-correlations-look-like
 
  • Like
Likes Klystron, Jarvis323, jim mcnamara and 1 other person
Back
Top