Restricted Boltzmann machine uniqueness

Click For Summary

Discussion Overview

The discussion revolves around the uniqueness of configurations in restricted Boltzmann machines (RBMs) and whether different sets of parameters can yield the same probability distribution. Participants explore theoretical aspects of RBMs in the context of a final degree project, focusing on the mathematical formulation and implications of biases and weights.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant questions if a given RBM configuration can be matched by another configuration that produces the same distribution.
  • Another participant suggests that setting certain biases to zero may allow for the exchange of weights and visible neuron states to achieve the same distribution, although they express uncertainty about the validity of this approach within the framework of RBMs.
  • Technical details about the mathematical formulation of RBMs are provided, including the definitions of biases, weights, and the resulting probability distributions.
  • There are ongoing technical issues related to the forum's post editor affecting the rendering of mathematical expressions.

Areas of Agreement / Disagreement

Participants do not reach a consensus on whether different configurations can yield the same distribution, and the discussion includes both exploratory reasoning and technical challenges without a definitive resolution.

Contextual Notes

There are unresolved technical issues regarding the forum's Mathjax rendering, which may affect the clarity of mathematical expressions presented in the discussion.

Jufa
Messages
101
Reaction score
15
I am dealing with restricted Boltzmann machines to model distributuins in my final degree project and some question has come to my mind.
A restricted Boltzmann machine with v visible binary neurons and h hidden neurons models a distribution in the following manner:

## f_i= e^{ \sum_k b[k] \sigma^i[k] + \sum_s \log(c[ s ] + e^{\sum_k w[ s ][ k ] b[ k ] })} ##
## Z = \sum_i f_i ##
## p_i = f_i/Z ##
Where b[ k ] and c[ s ] are, respectively, the k-th and the s-th bias of, again respectively, the visible and hidden layer.
w[ s ][k] is the component s, k of the weight matrix of the network.
"i" here refers to a certain binary vector with components ##\sigma^i[k]##.
My question is:
Given a certain restricted Boltzmann machine (i.e. a certain set of biases and weights) that models a certain distribution ##p_i##, is it possible to find another configuration (i.e. a different set of parameters and weights) such that it gives the same distribution?
Thanks in advance.
 
Last edited by a moderator:
Physics news on Phys.org
If some moderator can explain why my text is strikethrough I would appreciate it.
 
We are attempting to fix the problem. I got rid of the strike through text but now the latex Mathjax isn’t rendering. We moved to a new version of forum software with a new post editor and perhaps it has injected or requires some special way to enter math expressions.
 
Testing mathjax ##e=mc^2## now And ##2+2=4## in base 10.
 
I think I have reset the corrected original version. Please refresh your browsers.

Hint: Do not use ,,,. If you do not "cheat" as I did here, the interpreter takes it for the command tag BEGIN bold / underline / strikeout / italic.
 
  • Like
Likes   Reactions: Jufa and jedishrfu
Many thanks!
 
  • Like
Likes   Reactions: jedishrfu
Jufa said:
I am dealing with restricted Boltzmann machines to model distributuins in my final degree project and some question has come to my mind.
A restricted Boltzmann machine with v visible binary neurons and h hidden neurons models a distribution in the following manner:

## f_i= e^{ \sum_k b[k] \sigma^i[k] + \sum_s \log(c[ s ] + e^{\sum_k w[ s ][ k ] b[ k ] })} ##
## Z = \sum_i f_i ##
## p_i = f_i/Z ##
Where b[ k ] and c[ s ] are, respectively, the k-th and the s-th bias of, again respectively, the visible and hidden layer.
w[ s ][k] is the component s, k of the weight matrix of the network.
"i" here refers to a certain binary vector with components ##\sigma^i[k]##.
My question is:
Given a certain restricted Boltzmann machine (i.e. a certain set of biases and weights) that models a certain distribution ##p_i##, is it possible to find another configuration (i.e. a different set of parameters and weights) such that it gives the same distribution?
Thanks in advance.
I’m not very familiar with the topic, but trivially, if you set ##c[ s]=0##, then you get the same ##f_i## by exchanging the ##w[ s][k]## and the ##\sigma^i[k]##. So for instance,
$$f_{i,1}=\exp{\left(\sum_k{b[k]\sigma_1^i[k]}+\sum_s{ \log{e^{\sum_k{w_1[ s][k]b[k]}}}}\right)}$$
$$f_{i,2}=\exp{\left(\sum_k{b[k]\sigma_2^i[k]}+\sum_s{ \log{e^{\sum_k{w_2[ s][k]b[k]}}}}\right)}$$
Then let ##\sigma_1^i[k]=w_2[ s][k]## and ##\sigma_2^i[k]=w_1[ s][k]##.
That gives you the same probability distributions but I don’t know if that’s allowed with restricted Boltzmann machines.
 
Last edited:

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 26 ·
Replies
26
Views
6K
  • · Replies 49 ·
2
Replies
49
Views
12K
  • · Replies 1 ·
Replies
1
Views
14K
  • · Replies 42 ·
2
Replies
42
Views
11K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 175 ·
6
Replies
175
Views
27K
  • · Replies 11 ·
Replies
11
Views
4K