Discussion Overview
The discussion centers around the challenge of fitting a plane to sampling data represented by variables x and y, specifically in the context of approximating a probability distribution p(x,y) without resorting to data binning. Participants explore various methods for achieving this fit, including linear regression, maximum likelihood estimation, and projection techniques.
Discussion Character
- Exploratory
- Technical explanation
- Debate/contested
- Mathematical reasoning
Main Points Raised
- One participant suggests using multiple linear regression to find coefficients that minimize the error between the plane and the data.
- Another participant notes the necessity of ensuring that the area under the fitted plane integrates to 1 when fitting a probability distribution, raising questions about how to impose this constraint.
- Some participants express concern that linear regression typically involves binning data, which the original poster wishes to avoid, leading to the suggestion of maximum likelihood fitting as an alternative.
- There is a discussion about the nature of the data and its bounds, with one participant emphasizing the importance of understanding the range of x and y values to fit a probability distribution accurately.
- One participant proposes using non-linear bases for projection if the data does not conform to a linear distribution, suggesting the use of orthogonal polynomials.
- Another participant questions the meaning of "projecting a data point" onto a function that represents a probability density, highlighting the challenge of working with isolated measurements.
- There are suggestions for interpolation methods, including piecewise linear functions, to create a surface that could represent the data without binning.
Areas of Agreement / Disagreement
Participants express differing views on the appropriate methods for fitting a plane to the data, with no consensus reached on a single approach. The discussion remains unresolved regarding the best technique to use without binning the data.
Contextual Notes
Participants highlight limitations related to the assumptions about the data, the need for a suitable basis for projection, and the implications of not binning the data on the fitting process.