Kernel Estimation: Questions about Bandwidths & R Functions

eoghan · Jul 7, 2015

Hi there!
I'm new in the technique of Kernel Estimation, so it could be that the following questions are really elementary. There is something I don't understand about the bandwidths. Using R I have two functions to perform the estimate:
kde2d from MASS
bkde2D from KernelSmooth
Here are my questions
1) I see from the source code of kde2d that it divides the bandwidth provided by the user by 4 and I've seen this practice also somewhere else. Why the bandwidth is divided by 4?
2) kde2d perform uses an axis-aligned bivariate normal distribution, while bkde2D uses a standard bivariate normal distibution. Are they the same?

Thank you

Jhero · Jul 7, 2015

in advance!1) The bandwidth is divided by 4 because it is used for each dimension of the data. For example, if the data consists of two variables, X and Y, then the bandwidth is divided by 4 to get the bandwidth for each dimension (X and Y).2) No, they are not the same. The axis-aligned bivariate normal distribution has a covariance matrix that is diagonal, while the standard bivariate normal distribution has a full covariance matrix. The axis-aligned distribution assumes that the data points are independent and have no correlation between them, while the standard bivariate normal distribution allows for correlation between the data points.

Kernel Estimation: Questions about Bandwidths & R Functions

What is kernel estimation and how does it work?

What is bandwidth and why is it important in kernel estimation?

How do I choose the optimal bandwidth for kernel estimation?

Are there specific R functions for performing kernel estimation?

Can kernel estimation handle non-continuous data?

Similar threads

Hot Threads

Recent Insights