| New Reply |
Need Advice on Spatial Statistics for a Lattice |
Share Thread | Thread Tools |
| Jun6-12, 11:10 PM | #1 |
|
|
Need Advice on Spatial Statistics for a Lattice
My professor gave me data points representing living cells on a rectangular plane and was told to analyze their spatial pattern, i.e., do the date points on the finite plane have a tendency to be clustered, random, or dispersed. I successfully accomplished this for points on a continuous plane using the nearest-neighbor (NN) method but now he wants it done for points on a regular 2D lattice!
I know I cannot use the NN approach as it's not well suited for discrete space. Does some know of an equivalent method to NN but for a discrete lattice? I, do not know spatial statistics all that well and I have been searching books and the web for an appropriate method without success. Either what I read is too technical or I am mistaking what the methods can be used for. Any suggested would help a great deal. Thank you |
| Jun6-12, 11:49 PM | #2 |
|
|
I'm pretty much just guessing here, but do you think a k-means algorithm would work well with a lattice underpinning? It's the most geometrical method I know of, and perhaps that would make it useful.
|
| Jun7-12, 12:33 AM | #3 |
|
|
You should consider methods that use metrics on lattices. Things like the Manhattan metric for one. Consider these metrics in the context of general techniques rather than having to select specific techniques for lattices: you will find that using different metrics may be able to be used with conventional techniques. You could also apply this metric to things like k-means as suggested already in this thread. |
| Jun7-12, 03:44 AM | #4 |
|
|
Need Advice on Spatial Statistics for a LatticeWhat strikes me the most is the fact that your professor asks you to solve something but it seems he gives you no clue to how, is that common in your school/country? In my case the professor would explain a problem, a group of solutions for it, and then we would exercise on that. So your professor gave you the data points and loose you all in the wild to solve the problem? Anyway, it seems to me NN is suited for your problem whether you discretize the space of not, for instance, you haven't said how fine this discretization should be; if you make the lattice fine enough you wouldn't have any difference analyzing that discretization as if it was a continuum. So I am going to guess now; could it be that your professor wants you to discretize the rectangular plane so that you use a Poisson distribution to test the randomness? Let me explain, NN is a better method to test randomness of points in a space than using a Poisson, but since you mention you are working with living cells I can picture the following scenario: There is a biologist using a microscope and looking at living cells distributed in a rectangular area divided into squares. Then the biologist counts the number of living cells per square but he/she has no accurate way to measure the distance between cells.... Now, in this scenario, for obvious reasons, you cannot use a NN method, but you can resort to the fact that the number of cells per square must follow a Poisson distribution if the cells are randomly distributed. Therefore, by using one of the many goodness of fit distribution test for the Poisson (e.g χ2) you can decide how randomly distributed the cells are. I know I am doing a lot of Sherlock guessing work here but, could it be this is what you want/need? |
| Jun7-12, 08:30 AM | #5 |
|
Recognitions:
|
Paradise Lost,
If your professor is a professor of statistics and you have been given a mathematical challenge then you can concentrate on lattices. If your professor is a professor of biology and you have been given a real world problem then you should think about the real world aspects of the data. For example, one reason that a real world problem might produce data on a lattice is that the real world cells are distributed on a continuous plane but when a data reducer measures them, he uses a slide with a grid and assigns the location of each cell to be the coodinates of the grid square that contains most of the cell. Or perhaps the data comes from some automatic image recognition process that implements a similar method. There is also the question of what "randomly distributed" means in a real world problem. For example, suppose cells are grown on a culture dish. It may be that what an experimenter considers randomly distributed growth is more orderly than mathematically random distribution. |
| Jun9-12, 07:25 PM | #6 |
|
|
Thank you, thus far, to everyone responding. I still haven't found an even close analogous process for analyzing data on a discrete lattice.
To clarify, the data are biological cells represented by values on actual lattice points and not within square areas--the data is laterally discrete. Some points do not have cells (so their weight is zero) while others do, or might have multiple copies per grid point (so those lattice points will have more weight). Suppose the lattice has a plane area of 1000X1000. Now I desire to analyze the distribution of the cells and see whether they tend to form 1) clusters, 2) are randomly distributed, or 3) dispersed. Using the standard nearest neighbor (NN) method developed for a continuous plane for this lattice would only work if the lattice spacing was sufficiently small to approximate the continuous case. viraltux, noted an earlier suspicion of mine that the lattice spacing needs to be fine but arbitrarily changing my unit lattice distance presents problems with the NN derivation since the notion of unit distance changed. E.g., too fine of a lattice will make (from the perspective of NN) it look like the whole lattice is clustering. Anyway, the literature even says NN is typically not used for lattices. theorem4.5.9, thanks for the suggestion of the k-means algorithm. I have been trying to understand it and see if it can be used but I am running into the same problem of no clear examples or discussions that I have been having with textbooks on spatial analysis. I just can't cut through the abstractness of their discussions. I will continue reading up on it. chiro, I thought about using a NN-like analysis using a taxicab metric but deriving a complete spatial randomness (CSR) model based on such a metric is proving exceedingly hard to do. I can't find anything in the literature about it. I'm still searching for other methods that use taxicab geometry. The thing I might use, but with only partial success is viraltux's suggestion of analyzing the number of cells per lattice point site, assuming a Poisson distribution and then testing to see if the cells are distributed randomly by taking it to be the null hypothesis and applying χ2 test to see if I can reject it. The one failure of this test is that if I reject the null (random distribution via Poisson process) then I can't really claim the cells are distributed in any other particular way, no? In other words, all I can do is say that the cells are not randomly distributed, but I CAN'T say if they are then clustered or dispersed. Any further insight is welcomed. Thank you... |
| Jun9-12, 08:48 PM | #7 |
|
Recognitions:
|
Or is this not real biological data? - just a conceptual model for biological data? |
| Jun9-12, 10:10 PM | #8 |
|
|
They can be biological cells, christmas trees, or anything else one desires. It's kind of like the game Battleship. Suppose someone took an ariel snapshot of the game and wanted to know if the battleships on the grid are clustered, dispersed or randomly distributed. |
| Jun10-12, 06:59 AM | #9 |
|
|
OK, then our problem is about what kind of non-randomness is present if any, right? Checking for randomness is a tricky thing, strictly speaking, even if the lattice points passes the Poisson test, we could not still guarantee the randomness; we might also want to check for autocorrelations and other non-random behavior, so keep this in mind. Anyway, in this example we only care about disperse vs clustered kind of non-randomness and, luckily, autocorrelations are sensitive to clusters and insensitive to dispersion, so you can use the autocorrelations to decide the kind of non-randomness you are dealing with. Summarizing, a simple approach could be:
|
| Jun10-12, 02:14 PM | #10 |
|
|
E.g., if no cells exist, than at that lattice it's 0; if there is one cell, the value is 1; etc... If so, I think that should solve it. 1) Is this a corrected assumption? 2) Should I use "Moran's I" or "Geary’s C" as the autocorrelation method? Thank you viraltux, you have been a big help. |
| Jun10-12, 02:56 PM | #11 |
|
|
1) Yeah, if no cells then 0, and then the number of cells per grid point; this should work just fine. 2) They both are similar, my advice is that you try them both and choose the one that fits better in your problem, you can actually use them both; there are many different test you can use for testing randomness, it is for you to decide when you have the right collection for your needs though any of these two methods should work just fine. You're welcome
|
| New Reply |
| Thread Tools | |
Similar Threads for: Need Advice on Spatial Statistics for a Lattice
|
||||
| Thread | Forum | Replies | ||
| primitive lattice vectors, reciprocal lattice, wigner seitz cell | Atomic, Solid State, Comp. Physics | 1 | ||
| spatial statistics - Point process on a network of one-dimensional lines | Set Theory, Logic, Probability, Statistics | 7 | ||
| Statistics problem, need advice! | Calculus & Beyond Homework | 0 | ||
| Could you advice a useful internet source in statistics? | Set Theory, Logic, Probability, Statistics | 2 | ||
| One-dimensional monatomic chain; square lattice; hcp close packed 2D lattice | Advanced Physics Homework | 0 | ||