Discussion Overview
The discussion revolves around the use of correlation to predict values from multiple variables, particularly in the context of calculating expected values given correlated random variables. Participants explore the implications of joint distributions and the potential application of techniques like principal component analysis and multiple regression.
Discussion Character
- Exploratory, Technical explanation, Debate/contested
Main Points Raised
- One participant seeks to understand how to predict values from multiple correlated variables, specifically asking for the form of E(Y | A, B, C).
- Another participant questions whether the term "predict" is being used correctly, suggesting a distinction between predicting values and computing expected values based on distribution functions.
- A participant clarifies their intent to calculate the expected value of Y given A, B, C, and known correlations, as well as necessary variances.
- Concerns are raised about the application of the formula E(Y|X) to random variables, with a focus on whether the variables have a joint multinormal distribution.
- One participant proposes using principal component analysis to transform correlated variables into independent ones, suggesting this could facilitate multiple regression analysis.
- Another participant notes that while marginal distributions may be normal, this does not guarantee that the joint distribution is multivariate normal, indicating a need to verify this assumption.
Areas of Agreement / Disagreement
Participants express differing views on the interpretation of "predict" versus "compute expected value," and there is no consensus on the assumptions regarding the joint distribution of the variables.
Contextual Notes
Participants highlight the importance of understanding the assumptions behind the joint distribution of the variables, particularly in relation to normality and independence.