Discussion Overview
The discussion revolves around the implications of using a zero mean for inputs in the context of gradient descent and backpropagation algorithms in artificial neural networks (ANNs). Participants explore the relationship between the covariance matrix of inputs and the Hessian, as well as the significance of the mean in these calculations.
Discussion Character
- Technical explanation
- Conceptual clarification
- Debate/contested
Main Points Raised
- One participant notes that the error function in backpropagation is defined as E(X,W) = 0.5(target - W^T X)^2 and questions the necessity of a zero mean for inputs, suggesting that shifting the mean should not affect the covariance matrix.
- Another participant suggests that the author may be performing "whitening," which involves normalizing data to have a mean of zero and a covariance matrix equal to the identity, commonly used in image processing.
- A participant raises concerns about the implications of using a covariance matrix equal to the identity, questioning the uniqueness of eigenvalues and the logic behind using lambda*I - cov(X) instead of cov(X) inverse.
- There is mention of a "standard Fresnel representation for the determinant of a symmetric matrix," with one participant expressing confusion about its definition and relevance to Gaussian multivariate optimization.
- Another participant expresses gratitude for the mention of the Fresnel integral, indicating it clarifies their understanding of eigenvalue distributions derived from random matrices.
Areas of Agreement / Disagreement
Participants express differing views on the significance of a zero mean for inputs and the implications for the covariance matrix and eigenvalues. The discussion remains unresolved regarding the specific mathematical relationships and interpretations presented in the paper.
Contextual Notes
There are unresolved questions about the mathematical steps involved, particularly concerning the use of determinants and eigenvalues in the context of the covariance matrix and the implications of the mean being zero.