pawelch
- 10
- 0
Hi all,
I am trying to devise a mathematical model for my project I am working at. Description is as follows:
we have a sample space
<br /> \Omega=\{w_1,w_2,\cdots, w_N\}<br />
It is very large. Suppose further, that we have some assumption of frequency of occurrence of each w_i , stored in probability vector \pi .
In general, suppose we observe occurrences of w_i , stored as a sequence \{S\} . Now, we would like to compare \{S\} against \pi .
One of the solution would be application of Kullback–Leibler divergence. However, the problem is that |S| < |\Omega| (by || I mean cardinality) and as a result, we will observe that for some w_i \in \Omega , we have corresponding w_s \in S that are 0 (it stems from the fact that \{S\} has not managed to explore \Omega throughout). In this case we have undefined element 0\cdot \log \frac{0}{s_i} = \infty .
In principal, I generate multiple \{S\} that are of two types, say, \{1,0\} . The underlying concept for my project is that \{S\}_1 of the first type will hit w_i that posses high frequency in \Omega , on the contrary \{S\}_2 of the second type would generate w_i that have low frequency. And, it is unlikely that |S| = |\Omega|
Thus, again, I thought I would compare \{S\}_1 against \pi and then \{S\}_2 against \pi and observe the differences. But, because of the assumption of 0\cdot \log \frac{0}{s_i} = \infty , I could not get it right. Thus, I thought that maybe I "normalise" (i.e. shrink) \Omega, so that the new \Omega contains only elements that have occurred in \{S\}. but I have been told it is not a good idea either.
So hmm.. well, the question is how should I compare both types \{S\}_1 and \{S\}_2 against \pi if their are of different length ?
Thank you for any suggestions, and accept my apology for poor mathematical language of this description,
cheers!
I am trying to devise a mathematical model for my project I am working at. Description is as follows:
we have a sample space
<br /> \Omega=\{w_1,w_2,\cdots, w_N\}<br />
It is very large. Suppose further, that we have some assumption of frequency of occurrence of each w_i , stored in probability vector \pi .
In general, suppose we observe occurrences of w_i , stored as a sequence \{S\} . Now, we would like to compare \{S\} against \pi .
One of the solution would be application of Kullback–Leibler divergence. However, the problem is that |S| < |\Omega| (by || I mean cardinality) and as a result, we will observe that for some w_i \in \Omega , we have corresponding w_s \in S that are 0 (it stems from the fact that \{S\} has not managed to explore \Omega throughout). In this case we have undefined element 0\cdot \log \frac{0}{s_i} = \infty .
In principal, I generate multiple \{S\} that are of two types, say, \{1,0\} . The underlying concept for my project is that \{S\}_1 of the first type will hit w_i that posses high frequency in \Omega , on the contrary \{S\}_2 of the second type would generate w_i that have low frequency. And, it is unlikely that |S| = |\Omega|
Thus, again, I thought I would compare \{S\}_1 against \pi and then \{S\}_2 against \pi and observe the differences. But, because of the assumption of 0\cdot \log \frac{0}{s_i} = \infty , I could not get it right. Thus, I thought that maybe I "normalise" (i.e. shrink) \Omega, so that the new \Omega contains only elements that have occurred in \{S\}. but I have been told it is not a good idea either.
So hmm.. well, the question is how should I compare both types \{S\}_1 and \{S\}_2 against \pi if their are of different length ?
Thank you for any suggestions, and accept my apology for poor mathematical language of this description,
cheers!