Automatic pattern recognition and data modeling

AI Thread Summary
The discussion centers on the potential for computers to recognize patterns in data sets and fit equations to model those patterns. The idea is that such systems could revolutionize fields like stock forecasting, weather prediction, sales analysis, and air traffic control. While current technologies, such as YouTube's content recognition, utilize pattern recognition for specific applications, the conversation highlights the challenge of accurately determining the best model for a given data set without additional context. It is noted that existing mathematical tools, like MATLAB, offer data fitting and smoothing models, but the complexity of data relationships means that a computer cannot definitively identify a linear relationship without further constraints. The discussion also mentions advanced techniques in pattern recognition, including artificial neural networks and Lowess curve fitting, which employ multiple equations for better accuracy across data ranges.
moonman239
Messages
276
Reaction score
0
I can't help but think that some day, someone will figure out a way to get a computer to recognize patterns in a given set of data, and fit an equation, if there is any, to that set.

Such a system could be used in areas like forecasting stocks, weather, and sales, or even in air traffic control, which would be by far the coolest use of such a system.
 
Technology news on Phys.org
youtube can already "recognize" some video or audio patterns to search for "matched 3rd party content" in submitted videos, mostly used to include ad's on videos where some of the profit for the ad goes to the "3rd party content" copyright owner.
 
rcgldr said:
youtube can already "recognize" some video or audio patterns to search for "matched 3rd party content" in submitted videos, mostly used to include ad's on videos where some of the profit for the ad goes to the "3rd party content" copyright owner.

I see. However, I'm talking about having a computer look at a data set such as:

2,4.2,6,8.5

and figure out that the data can be best modeled by a linear equation.
 
moonman239 said:
I see. However, I'm talking about having a computer look at a data set such as: 2,4.2,6,8.5 and figure out that the data can be best modeled by a linear equation.
Doesn't MATLAB and/or some other mathematical tools have a data fitting / smoothing model that at least suggests what type of equation would be best to do this?
 
moonman239 said:
I see. However, I'm talking about having a computer look at a data set such as:

2,4.2,6,8.5

and figure out that the data can be best modeled by a linear equation.

A computer can't do that for precisely the same reason that a human can't: it's impossible to unambiguously identify that as evidence of a linear relationship unless you supply more information to constrain the model selection. In fact, if you try to fit functions to that particular set of data you'll find that a cubic equation is a better fit than a linear equation.

As to your original question, lots has already been done: artificial neural nets, support vector machines, kernel methods more generally, and many others are all in widespread use in pattern recognition.
 
There is Lowess curve fitting that with cross-validation looks at data and does a curve fit by gluing together whatever fits the best range-wise, in other words, it does not use a single equation, but several for a best fit everywhere.
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
I had a Microsoft Technical interview this past Friday, the question I was asked was this : How do you find the middle value for a dataset that is too big to fit in RAM? I was not able to figure this out during the interview, but I have been look in this all weekend and I read something online that said it can be done at O(N) using something called the counting sort histogram algorithm ( I did not learn that in my advanced data structures and algorithms class). I have watched some youtube...
Back
Top