What is the Mathematics Behind Nearest Neighbour Analysis?

  • Context: Undergrad 
  • Thread starter Thread starter antevante
  • Start date Start date
  • Tags Tags
    Analysis
Click For Summary

Discussion Overview

The discussion centers around the mathematics of Nearest Neighbour Analysis (NNI), particularly its application in fields such as biology and geography to analyze the dispersion of entities like plants or shops. Participants explore the mathematical formulation, implications of the results, and the underlying concepts of clustering.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Andreas inquires about the mathematical workings of Nearest Neighbour Analysis, mentioning its relevance to clustering and dispersion.
  • AI suggests a connection between NNI and clustering, prompting a discussion on measuring distances to nearest neighbours and calculating mean distances.
  • Andreas provides a formula for NNI and expresses confusion about the significance of specific values, particularly why 2.15 is the maximum and why a value of 1 indicates random distribution.
  • AI proposes a reverse engineering approach to understand the implications of the NNI formula, suggesting that lower NNI values indicate higher clustering, but questions the statistical nature of the values 1 and 2.15.
  • Another participant explains that the upper limit of 2.15 arises from hexagonal spacing in a plane, which maximizes distances between neighbours, referencing a specific ecological study for further reading.

Areas of Agreement / Disagreement

Participants express uncertainty regarding the theoretical basis for the values of 1 and 2.15 in the context of NNI. There is no consensus on the interpretation of these values, and multiple viewpoints on their significance are presented.

Contextual Notes

The discussion includes assumptions about the nature of clustering and the statistical limits of NNI values, which remain unresolved. The mathematical derivation of the upper limit is referenced but not fully explored within the thread.

antevante
Messages
8
Reaction score
0
Hi!
Does anyone know how the mathematics behind the Nearest Neighbour Analysis/Index work?
It is used in biology and geography and shows the dispersion of for example plants or shoe-shops.
/Andreas
 
Physics news on Phys.org
i don't know but this sounds vaguely familiar with clustering ...
are u talking abt clustering ??

-- AI
 
Yes it is about clustering...
You have an area, A, in which you have a number of points, n. For every point you measure the distance to its nearest neighbour. Then you calculate the mean nearest neighbour distance, d.
Then you use the formula NNI=2d*square-root(n/A)
Values for NNI close to 0 means clustered distribution, around 1 random distribution and close to the maximum value 2,15 uniform distribution.
I do not understand why 2.15 is the largest value you can get and why a value of 1 indicates a random distribution.
Anyone that knows? I would bwe thakful if you helped me...
/Andreas
 
I have not done much of NNI ...
But as i see ur formula for NNI , i thought of doing a bit of reverse engineering ...

case > clustered points
If we set n and A as constant, then it may be shown that d = radius of a cluster
So it gets pretty intuitive, as to why if NNI -> 0 , would mean high clustering since NNI>0 means radii of cluster is reducing thereby increased clustering ...

Doing a bit more of this,
we may come to a conclusion that
NNI for cluster < NNI for random < NNI for uniform

However the values of 1 and 2.15 don't seem to come up anywhere throughout ...
So i feel they are statistical limits and not theoretical ones ...
I may be wrong , but i just thought if i am wrong , it may generate counter arguments ...

-- AI
 
The upper limit comes from an observation that in the plane hexagonal spacing (each point has six equidistant neighbours) maximizes the distance between neighbours for a given density. You can read about it in this reference:

"Distance to Nearest Neighbour as a Measure of Spatial Relationships in Populations"

Clark and Evans, Ecology, Vol 35. No 4, 1954.

They also refer to earlier work by Hertz in 1909. The appendix gives a derivation of this measure.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
41
Views
9K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
Replies
2
Views
3K