How can I use a statistical approach to match patterns in a 3x3 grid?

Click For Summary
To statistically match patterns in a 3x3 grid, one can use a distance formula to quantify how closely a candidate pattern aligns with a reference pattern. The distance is calculated by flattening the matrix into a one-dimensional array and applying the Euclidean distance formula. A distance of zero indicates an exact match, while nonzero distances allow for comparison, with the closest match being the one with the smallest distance. This approach is particularly useful in applications like optical character recognition, where accurate pattern identification is crucial. Implementing this method can enhance the accuracy of matching sequences based on training data.
squaremeplz
Messages
114
Reaction score
0

Homework Statement



If I have a 3*3 grid, or 3*3 matrix, which records clicked points.
I.e.

Pattern T =

[1,1] = 1
[1,2] = 1
[1,3] = 1
[2,1] = -1
[2,2] = 1
[2,3] = -1
[3,1] = -1
[3,2] = 1
[3,3] = -1

or

1 1 1
-1 1 -1
-1 1 -1

What is the best way to statistically match x = 1 vs x = not 1
 
Last edited by a moderator:
Physics news on Phys.org
squaremeplease said:

Homework Statement



If I have a 3*3 grid, or 3*3 matrix, which records clicked points.
I.e.

Pattern T =

[1,1] = 1
[1,2] = 1
[1,3] = 1
[2,1] = -1
[2,2] = 1
[2,3] = -1
[3,1] = -1
[3,2] = 1
[3,3] = -1

or

1 1 1
-1 1 -1
-1 1 -1

What is the best way to statistically match x = 1 vs x = not 1

What do you mean "statistically match"?
 
Last edited:
I.e.

A new sequence is entered and we wish to identify group membership to T or C based on sample data for T and C. SInce T and C won't necessarily consist of 2 example database( 1 for each) but rather 1000 for T and 1000 C examples, how does one accurately calculate a match?
 
Last edited by a moderator:
I still don't have a clue what you're trying to do.
 
Last edited:
My question is related to optical character recognition. So if I draw a T, how do I match it to T that accurately reflects the entire training data?

I.e. if I have pattern T1, pattern T2, .., are the sequences for n T's.
C1, C2,..,CN is C training points for n C's.

If I feed a single C sequence in now, how do I weigh my decision most accurately?
 
Last edited by a moderator:
OK, now I understand. At the risk of oversimplification, let me define T this way:
Pattern T =

1 1 1
0 1 0
0 1 0

To match this pattern, a candidate pattern A should have a distance of 0 from this pattern, with distance calculated as
\sqrt{(a_0 - t_0)^2 + (a_1 - t_1)^2 + (a_2 - t_2)^2 + ... + (a_8 - t_8)^2}

In this formula I have flattened out your matrix to a one-dimensional array. The ti values are from pattern T, and the ai values are from the candidate pattern A.

If you get a "distance" of 0, the two patterns match exactly. If you have several candidate patterns with nonzero distances, pick the one with the smaller "distance."

That's how I would approach it.
 
Question: A clock's minute hand has length 4 and its hour hand has length 3. What is the distance between the tips at the moment when it is increasing most rapidly?(Putnam Exam Question) Answer: Making assumption that both the hands moves at constant angular velocities, the answer is ## \sqrt{7} .## But don't you think this assumption is somewhat doubtful and wrong?

Similar threads

  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 20 ·
Replies
20
Views
5K
  • · Replies 17 ·
Replies
17
Views
1K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 3 ·
Replies
3
Views
4K
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
8K
Replies
1
Views
2K
Replies
4
Views
2K
Replies
3
Views
4K