Spectral Graph Clustering: where does the 'scoring' function come from

Click For Summary
SUMMARY

The discussion centers on the scoring function in spectral graph clustering, specifically the formula \mathbf{w}_n ^T A \mathbf{w}, where A represents the affinity matrix of an undirected graph. The formula calculates the total weight of connections among nodes within a cluster, emphasizing that larger values indicate stronger clustering. The scaling requirement \mathbf{w}_n ^T \mathbf{w}_n = 1 leads to the derivation of the eigenvalue equation, which is crucial for understanding the clustering process.

PREREQUISITES
  • Understanding of spectral graph theory
  • Familiarity with eigenvalues and eigenvectors
  • Knowledge of affinity matrices in graph theory
  • Basic concepts of clustering algorithms
NEXT STEPS
  • Study the derivation of the eigenvalue equation in spectral clustering
  • Explore the properties of affinity matrices in graph representation
  • Learn about different clustering algorithms and their applications
  • Investigate the role of scaling in clustering functions
USEFUL FOR

Researchers, data scientists, and machine learning practitioners interested in advanced clustering techniques and the mathematical foundations of spectral graph clustering.

Master1022
Messages
590
Reaction score
116
Homework Statement
For an undirected graph with affinity matrix (i.e. weighted adjacency matrix) ##A##, how can we determine a good cluster using eigenvalues and eigenvectors.
Relevant Equations
Eigenvectors and eigenvalues
Hi,

I was reading through some slides about graph clustering. In the slides was a very short discussion about 'eigenvectors and segmentation'. I don't quite understand where one of the formulae comes from.

Context: We have some undirected graph with an affinity matrix (i.e. weighted adjacency matrix) ##A##. The slide poses the question: What is a good cluster?
It then says: "The element associated with the cluster should have large values connecting one another in the affinity matrix." and then quotes the following formula:
\mathbf{w}_n ^T A \mathbf{w}
where it defines ##\mathbf{w}_{n} ^ T ## = association of element ##i## to cluster ##n##.
It then says we can impose a scaling requirement such that ## \mathbf{w}_n ^T \mathbf{w}_n = 1 ##. Putting these two equations together yields the familiar eigenvalue equation.

Question: Where does this first formula come from: ## \mathbf{w}_n ^T A \mathbf{w} ##? As much as I think about it, it doesn't really make much sense to me. I can understand the latter parts of the slide, so if I just move past my misunderstanding, then the rest falls into place. However, I am keen to understand what that 'score' is really calculating.
 
Physics news on Phys.org
You can write that as ##\sum_{I,j} w_i a_{ij} w_j##, which for every choice of vertices ##i## and ##j## adds the products of their weights in the cluster and how strong their edge in the graph is.
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
7K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 0 ·
Replies
0
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K