Watts
- 37
- 0
Assume I have a data set that I am trying to find a distribution that describes how the data is distributed. Assume I have found a function say f(x) = e^{ - x^2 } that describes the distribution of data. Statistics tells me that my first move in doing so is to normalize this function so that \int\limits_{ - \infty }^\infty {P(x)dx} = 1. The common approach is to integrate \int\limits_{ - \infty }^\infty {e^{ - x^2 } dx} = \sqrt \pi and multiply the function times the reciprocal of that integral \frac{1}{{\sqrt \pi }} \cdot \int\limits_{ - \infty }^\infty {e^{ - x^2 } dx} = 1. But what if I can normalize it a different way say integrate the function \int\limits_{ - \infty }^\infty {e^{ - x^2 } dx} = \sqrt \pi and place the result of that integral in the parentheses beside the variable P(x) = e^{ - (\sqrt \pi \cdot x)^2 } = e^{ - \pi \cdot x^2 } instead of in front of the function. If you now integrate the function \int\limits_{ - \infty }^\infty {e^{ - \pi \cdot x^2 } dx} = 1 it still is equal to one. So my question is which PDF do I use? I have normalized the same function two different ways. Any thoughts on this?