Basic notation (conditional probability delim in linear equation)

AI Thread Summary
The discussion centers on the use of conditional probability notation within linear functions, specifically in the context of Bayesian prior distributions over polynomial coefficients in curve fitting, as illustrated in Bishop's "Pattern Recognition and Machine Learning." Participants explore the use of different delimiters, such as semicolons and vertical bars, to clarify the distinction between variables and parameters in probability expressions. The conversation also touches on the Bayesian expression for conditional probability and its derivation, emphasizing the relationship between likelihood, prior, and marginal likelihood. Overall, the thread seeks to clarify the notation and its implications in Bayesian statistics. Understanding these notations is crucial for accurately interpreting Bayesian models in machine learning contexts.
dspiegel
Messages
3
Reaction score
0
Hey all.

Looking at "Pattern Recognition and Machine Learning" (Bishop, 2006) p28-31, the author appears to be using what would ordinarily be a delimiter for a conditional probability inside a linear function. See the first variable in normpdf as below. This is in the context of defining a Bayesian prior distribution over polynomial coefficients in a curve fitting problem.

p(\textbf{w} | \alpha) = NormPDF(\textbf{w} | \textbf{0}, \alpha^{-1}\textbf{I}) = \left(\frac{\alpha}{2\pi}\right)^{(M+1)/2} exp \left(-\frac{\alpha}{2}\textbf{w}^T\textbf{w}\right)

Can anybody shine some light on this for me please?

Many thanks.
 
Physics news on Phys.org
I don't know if this is precisely the case here, but sometimes delimiters other than comma are used in functions. I have mostly seen semicolons (;) and vertical bars (|).
Often this is done to separate arguments by meaning. For example, an author may write
Consider a normal distribution with mean \mu and standard deviation \sigma. We define the probability of finding a value between a and b as P(a, b \mid \mu, \sigma) as ...
You can just as well write P(x, \mu, \sigma). However, writing a separate delimiter hopefully makes it more clear to the reader that a and b are really the variables here and, though technically mu and sigma are variables as well, in this case they are more like parameters that have been previously fixed (some arbitrary values for some normal distribution we are interested in).
 
CompuChip said:
I don't know if this is precisely the case here, but sometimes delimiters other than comma are used in functions. I have mostly seen semicolons (;) and vertical bars (|).
Often this is done to separate arguments by meaning. For example, an author may write

You can just as well write P(x, \mu, \sigma). However, writing a separate delimiter hopefully makes it more clear to the reader that a and b are really the variables here and, though technically mu and sigma are variables as well, in this case they are more like parameters that have been previously fixed (some arbitrary values for some normal distribution we are interested in).

Thanks for your reply.

Although I am quite sure that's not the case in this particular instance, in general, I know non-variable parameters may be written after a semicolon.

I believe the case to be that it reads as, "the value of t_n evaluated for y(x_n,\textbf{w})" as described on http://en.wikipedia.org/wiki/Vertical_bar#Mathematics".

Elsewhere the likelihood of the parameters \{w,\beta\} is written for two i.i.d. variables \{\textbf{x,t}\} where the function y(x,w) computes the predicted value of t.

p(\textbf{t}|\textbf{x},w,\beta) = \prod_{n=1}^N NormPDF(t_n|y(x_n, \textbf{w}),\beta^{-1})

2sLvo.png


So it seams a reasonable interpretation in this context.
 
Last edited by a moderator:
dspiegel said:
Hey all.

This is in the context of defining a Bayesian prior distribution over polynomial coefficients in a curve fitting problem.

p(\textbf{w} | \alpha) = NormPDF(\textbf{w} | \textbf{0}, \alpha^{-1}\textbf{I}) = \left(\frac{\alpha}{2\pi}\right)^{(M+1)/2} exp \left(-\frac{\alpha}{2}\textbf{w}^T\textbf{w}\right)

Can anybody shine some light on this for me please?

Many thanks.

I don't know what this is. The Bayesian expression for the conditional probability p(w|a) is:

p(w|a)=p(a|w)p(w)/p(a).
 
SW VandeCarr said:
I don't know what this is. The Bayesian expression for the conditional probability p(w|a) is:

p(w|a)=p(a|w)p(w)/p(a).

Well there're a bit more to it. The formula you quoted is just for the prior.

The derivation is thus.

p(w|x,t,\alpha,\beta) = (likelihood * prior) / marginal likelihood

p(w|x,t,\alpha,\beta) \propto p(t|x,w,\beta) * p(w|\alpha)

\{\alpha,\beta\} are hyperparameters.
 
dspiegel said:
Well there're a bit more to it. The formula you quoted is just for the prior.

The derivation is thus.

p(w|x,t,\alpha,\beta) = (likelihood * prior) / marginal likelihood

p(w|x,t,\alpha,\beta) \propto p(t|x,w,\beta) * p(w|\alpha)

\{\alpha,\beta\} are hyperparameters.

OK. I was going by the original equation where the left side was simply p(w|\alpha)=
 
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...

Similar threads

Replies
1
Views
2K
Replies
4
Views
2K
Replies
42
Views
10K
3
Replies
114
Views
11K
Replies
2
Views
10K
Replies
3
Views
2K
Back
Top