Restricted Boltzmann machine uniqueness

Click For Summary
SUMMARY

This discussion centers on the uniqueness of distributions modeled by restricted Boltzmann machines (RBMs). The mathematical formulation provided includes the equations for the probability distribution, where biases and weights play critical roles. A participant suggests that by setting certain biases to zero, alternative configurations can yield the same distribution, raising questions about the constraints of RBM configurations. The conversation also touches on technical issues related to rendering mathematical expressions in the forum's new software version.

PREREQUISITES
  • Understanding of restricted Boltzmann machines (RBMs)
  • Familiarity with probability distributions and their mathematical representations
  • Knowledge of biases and weights in neural networks
  • Basic proficiency in LaTeX for mathematical notation
NEXT STEPS
  • Research the mathematical properties of restricted Boltzmann machines
  • Explore the implications of bias and weight configurations on RBM outputs
  • Learn about the uniqueness of distributions in generative models
  • Investigate the use of LaTeX in online forums for mathematical expressions
USEFUL FOR

Students and researchers in machine learning, particularly those focusing on generative models and restricted Boltzmann machines, as well as developers working with mathematical rendering in web applications.

Jufa
Messages
101
Reaction score
15
I am dealing with restricted Boltzmann machines to model distributuins in my final degree project and some question has come to my mind.
A restricted Boltzmann machine with v visible binary neurons and h hidden neurons models a distribution in the following manner:

## f_i= e^{ \sum_k b[k] \sigma^i[k] + \sum_s \log(c[ s ] + e^{\sum_k w[ s ][ k ] b[ k ] })} ##
## Z = \sum_i f_i ##
## p_i = f_i/Z ##
Where b[ k ] and c[ s ] are, respectively, the k-th and the s-th bias of, again respectively, the visible and hidden layer.
w[ s ][k] is the component s, k of the weight matrix of the network.
"i" here refers to a certain binary vector with components ##\sigma^i[k]##.
My question is:
Given a certain restricted Boltzmann machine (i.e. a certain set of biases and weights) that models a certain distribution ##p_i##, is it possible to find another configuration (i.e. a different set of parameters and weights) such that it gives the same distribution?
Thanks in advance.
 
Last edited by a moderator:
Physics news on Phys.org
If some moderator can explain why my text is strikethrough I would appreciate it.
 
We are attempting to fix the problem. I got rid of the strike through text but now the latex Mathjax isn’t rendering. We moved to a new version of forum software with a new post editor and perhaps it has injected or requires some special way to enter math expressions.
 
Testing mathjax ##e=mc^2## now And ##2+2=4## in base 10.
 
I think I have reset the corrected original version. Please refresh your browsers.

Hint: Do not use ,,,. If you do not "cheat" as I did here, the interpreter takes it for the command tag BEGIN bold / underline / strikeout / italic.
 
  • Like
Likes   Reactions: Jufa and jedishrfu
Many thanks!
 
  • Like
Likes   Reactions: jedishrfu
Jufa said:
I am dealing with restricted Boltzmann machines to model distributuins in my final degree project and some question has come to my mind.
A restricted Boltzmann machine with v visible binary neurons and h hidden neurons models a distribution in the following manner:

## f_i= e^{ \sum_k b[k] \sigma^i[k] + \sum_s \log(c[ s ] + e^{\sum_k w[ s ][ k ] b[ k ] })} ##
## Z = \sum_i f_i ##
## p_i = f_i/Z ##
Where b[ k ] and c[ s ] are, respectively, the k-th and the s-th bias of, again respectively, the visible and hidden layer.
w[ s ][k] is the component s, k of the weight matrix of the network.
"i" here refers to a certain binary vector with components ##\sigma^i[k]##.
My question is:
Given a certain restricted Boltzmann machine (i.e. a certain set of biases and weights) that models a certain distribution ##p_i##, is it possible to find another configuration (i.e. a different set of parameters and weights) such that it gives the same distribution?
Thanks in advance.
I’m not very familiar with the topic, but trivially, if you set ##c[ s]=0##, then you get the same ##f_i## by exchanging the ##w[ s][k]## and the ##\sigma^i[k]##. So for instance,
$$f_{i,1}=\exp{\left(\sum_k{b[k]\sigma_1^i[k]}+\sum_s{ \log{e^{\sum_k{w_1[ s][k]b[k]}}}}\right)}$$
$$f_{i,2}=\exp{\left(\sum_k{b[k]\sigma_2^i[k]}+\sum_s{ \log{e^{\sum_k{w_2[ s][k]b[k]}}}}\right)}$$
Then let ##\sigma_1^i[k]=w_2[ s][k]## and ##\sigma_2^i[k]=w_1[ s][k]##.
That gives you the same probability distributions but I don’t know if that’s allowed with restricted Boltzmann machines.
 
Last edited:

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 26 ·
Replies
26
Views
5K
  • · Replies 49 ·
2
Replies
49
Views
12K
  • · Replies 1 ·
Replies
1
Views
14K
  • · Replies 42 ·
2
Replies
42
Views
11K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 175 ·
6
Replies
175
Views
27K
  • · Replies 11 ·
Replies
11
Views
4K