I am trying to create a simple implementation of the Bayes decision rule with minimum error criterion and I am running into a problem. Specifically, if I have a data set consisting of a number of feature vectors stored in rows, how can I generate a probability density function from this data?(adsbygoogle = window.adsbygoogle || []).push({});

Also, how can I do this if some of the data is discrete, some is continuous, and some is missing? For example, let us assume each feature vector, x, has three elements.

x = [ a, b, c]

where;

a is categorical data and will be an element of the set {0, 1, 2, 3}

b is continous data and will be in the range [0,1]

c is also continous data in the range [0,1], but may be missing for some feature vectors

I want to be able to calculate the likelihood of a feature vector, x, based on the total data set or given that x is from a subset, w, of the total data set.

p(x) = ? and p(x|w) = ?

I have also posted this on Stack Exchange Mathematics, here:

http://math.stackexchange.com/quest...sity-function-from-a-set-of-multivariate-data

I would really appreciate if someone can help me out or point me in the right direction!

**Physics Forums - The Fusion of Science and Community**

Dismiss Notice

Join Physics Forums Today!

The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

# Generating a probability density function

Loading...

Similar Threads - Generating probability density | Date |
---|---|

Statistics - Moment Generating Functions | Nov 14, 2015 |

Probability of generating higher number | Mar 28, 2015 |

Structure of generated sigma algbra | Mar 12, 2015 |

Approximating the probability density function from the moment generating function | Jul 4, 2011 |

Moment Generating Functions and Probability Density Functions | Nov 10, 2006 |

**Physics Forums - The Fusion of Science and Community**