What are the Limitations of Machine Learning in Causal Analysis?

FallenApple · Jul 27, 2017

From what I understand, machine learning is incredibly good at making predictions from data in a very automated/algorithmic way.

But for any inference that is going to deal with ideas of causality, it's primarily a subject matter concern, which relies on mostly on judgment calls and intuition.

So basically, a machine learning algorithm would need human level intelligence and intuition to be able to do proper causal analysis?

Here's an example where there might be issues.

Say, an ML algorithm finds that low socioeconomic status is associated with diabetes with a significant p value. We clearly know that diabetes is a biological phenomena and that any possible(this is a big if) causal connection between a non biological variable such as low SES and diabetes must logically have intermediate steps between the two variables within the causal chain. It is these unknown intermediate steps that probably should be investigated in follow up studies. We logically know(or intuit from prior knowledge+domain knowledge) that low SES could lead to higher stress or unhealthy diet, which are biological. So a significant pval for SES indicates that maybe we should collect data on those missing variables, and then redo the analysis with those in the model.

But there's no way a learning algorithm can make any of those connections because those deductions are mostly intuition and logic, which are not statistical. Not to mention, how would ML look at confounders?

Stephen Tashi · Jul 28, 2017

FallenApple said:

So basically, a machine learning algorithm would need human level intelligence and intuition to be able to do proper causal analysis?

Why do you think human level intelligence and intuition is capable of doing a proper causal analysis?

Human level intelligence hasn't reached a consensus about the definition of causality yet. If a "proper casual analysis" is concept known only to particular person's intuition then I agree that it takes a human being to know such a thing.

FallenApple · Jul 28, 2017

Stephen Tashi said:

Why do you think human level intelligence and intuition is capable of doing a proper causal analysis?

Human level intelligence hasn't reached a consensus about the definition of causality yet. If a "proper casual analysis" is concept known only to particular person's intuition then I agree that it takes a human being to know such a thing.

Humans can't do causal analysis perfectly, that's true. But we do have a better idea of what causality is, even if it's not perfectly defined. Also, humans narrow things down much better through deductive reasoning. In the example I gave, the algorithm wouldn't be able to narrow down what those latent variables are, simply because they might have been considered in the first place, and hence are not in the data set. A human analyst would think, "Ah ha! since SES is associated with diabetes, maybe low SES causes something( e.g stress) that leads to diabetes, so hindsight shows maybe we should collect data from that". So the results leads to new insights and avenues of investigation that was never thought of before. Essentially it takes detective work to do causal inference.

But if there already is data on every possible thing about diabetics(DNA, all biochemicals etc), and advanced learning algorithms that stably run models on millions of variables, then it is conceivable that an ML algorithm can get the answer blindly(or at least with subhuman intelligence) in one go without logical deduction. I'm not sure if this is mathematically possible, but if it is, then they beat humans at causal analysis.

Dale · Jul 28, 2017

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average. Similarly, even a below average human toddler can learn to speak any language with far less data than is available to computers attempting the same task.

FallenApple · Jul 28, 2017

Dale said:

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average. Similarly, even a below average human toddler can learn to speak any language with far less data than is available to computers attempting the same task.

True. One of the biggest hinderances to AI is pattern recognition, in which they can do only in very well controlled settings. The fact that they can't switch tasks well implies that machine's intuition about things are basically non existent. However, they are phenomenal at rapid calculation, which means they can do conceptually easy but extensive tasks.

Merlin3189 · Jul 29, 2017

I'm not sure whether your question / statement relates to AI we have now, or to what can eventually be achieved. I agree that what we have now is very limited, but I believe that someone can eventually build AI that will match the best human brains. I suspect that AI will be able exceed HI, simply because it can already beat us in some tasks, so just add those to the HI skills when it acquires them. (Though that is rather like us using computers, so maybe it still counts as just our equal.)

My reason is simply that I am surrounded by machines doing all the things that AI can't do. For me the main goal of AI is not to replace these HI machines, but to understand how they work.

As you say, the sort of thinking you esteem - intuition, logic(?), judgement, deduction, experience, guesswork, prejudice, (I'm extending your list a bit!) , etc - may be outside the reach of current AI. So how are these machines (the humans) doing it? What is it that they can do, in concrete definable terms, that we haven't yet put into AI? Either we say, that is unknowable and psychologists are wasting their time, or our understanding of psychology will grow and we will incorporate it into AI.

If one believes in some magical ether in the human brain - gods, human spirit, animus, life, ... ? - then obviously only machines endowed with this stuff can do these ill-defined things. Otherwise, what is the reason, other than we don't know what they are, that we can't incorporate these skills into AI machines?

This is a psychological perspective and I think most people in AI are more in the engineering camp. So I expect AI to continue to get better in specialised tasks, using algorithms not particularly related to HI. Progress in HI may (?) usefully help get us over some of the bumps, but will we be that keen on AI systems when they start to display the same faults as HI systems? If driverless cars did get as good as human driven ones, we'd still accept human error as, well, human, but computer error is another matter. How much better than HI will AI need to become? .

FallenApple · Jul 29, 2017

Merlin3189 said:

As you say, the sort of thinking you esteem - intuition, logic(?), judgement, deduction, experience, guesswork, prejudice, (I'm extending your list a bit!) , etc - may be outside the reach of current AI. So how are these machines (the humans) doing it? What is it that they can do, in concrete definable terms, that we haven't yet put into AI? Either we say, that is unknowable and psychologists are wasting their time, or our understanding of psychology will grow and we will incorporate it into AI.

Whether humans will be able to create these type of thinking will likely depend on the actual complexity of those tasks in comparison to current tasks executable by AI. For example, feeling an emotion might seem easier than computing a complicated integral to a human, but it's just the opposite. Computing an integral is just the adding up of many smaller parts, few concepts needed. But an "emotion" or gut feel intuition could have much more rich and complex mathematical algorithms with many interrelated concepts that we have not even thought of yet. It's possible that such ideas are so mathematically complex that even the smartest AI scientist/mathematician would never deduce the patterns, even though the patterns are happening in physical spacetime inside a biological machine. If all this is true, then I don't know if humans will ever figure this out because the upper limit of human brain capacity is evolutionarily limited by the size of the birth canal and we probably need a mind far greater than Einstein to really understand consciousness.

For simple repetitive tasks or tasks requiring simple low level concepts, AI will likely surpass humans at all of these, given enough training data.

Merlin3189 · Jul 29, 2017

FallenApple said:

It's possible that such ideas are so mathematically complex that even the smartest AI scientist/mathematician would never deduce the patterns,

Yes, that is a worry. It may be like turbulence: we'll get some ideas about it, extract some general principles, but maybe never get on top of the detail.
My own feeling about the brain is that it's basic elements are really quite simple, but like the molecules of a fluid, when you get enough of them involved, even simple deterministic properties can lead to fundamentally unpredictable behaviour.

atyy · Jul 29, 2017

Dale said:

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average. Similarly, even a below average human toddler can learn to speak any language with far less data than is available to computers attempting the same task.

I agree that machines have a long way to go before reaching human level performance. But is it true that they have access to the same data as humans? For example, in addition to the 60 hours of experience a teenager needs to learn to drive, that teen already spent 16 years acquiring other sorts of data while growing up. Similarly, the toddler is able to crawl about and interact in the real world, which is a means of data acquisition the computers don't have.

Dale · Jul 29, 2017

atyy said:

For example, in addition to the 60 hours of experience a teenager needs to learn to drive, that teen already spent 16 years acquiring other sorts of data while growing up

That is a good point, but I think that shows even more how amazing the human brain is at learning. It can take that general knowledge from walking and running and playing and use it to inform the ability to drive. I don't think data from walking would help a machine learn to drive.

Demystifier · Aug 22, 2017

Dale said:

The machines that are learning the same task have millions of hours and are far from average.

I am probably missing something trivial, but millions of hours means hundreds of years. But we do not have such machines for that long. So how do they learn?

Demystifier · Aug 22, 2017

atyy said:

I agree that machines have a long way to go before reaching human level performance. But is it true that they have access to the same data as humans? For example, in addition to the 60 hours of experience a teenager needs to learn to drive, that teen already spent 16 years acquiring other sorts of data while growing up. Similarly, the toddler is able to crawl about and interact in the real world, which is a means of data acquisition the computers don't have.

So perhaps we need an AI kindergarten, as proposed by my brother:
https://www.linkedin.com/pulse/ai-kindergarten-what-does-take-build-truly-machine-danko-nikolic

SSequence · Aug 22, 2017

I don't know much about this topic, but this is partly related and also somewhat amusing (this is quite a recent video):

The relevant part starts around 5 minutes or so. Though I think people mostly tend to think of programs versus humans only in the context of strategy games (talking about video games).

For arcade games, for example, there are already easy TAS (tool assisted) runs for lots of games. But they are hardly any fun to watch at all (except to see the limits) --- as compared to human replays/videos. Because the fun part is in experience of hand-eye coordination, visual cognition, mechanical perfection etc. Judgement is just one part of playing.

Actually something similar will apply to FPS and many other more action related genres.

In strategy games judgement seems to play a bigger part (as compared to other factors) so it is more amusing to see a program playing very well. And also fog of war (in RTS or derivative genres) tends to add a large element of imperfect information (and it is fun to see how a program handles that).

Demystifier said:

I am probably missing something trivial, but millions of hours means hundreds of years. But we do not have such machines for that long. So how do they learn?

I think (just from a layman perspective) that's probably because of raw computation power they can replay the same scenarios over and over in a very short period of time.

phyzguy · Aug 22, 2017

FallenApple said:

From what I understand, machine learning is incredibly good at making predictions from data in a very automated/algorithmic way.
But there's no way a learning algorithm can make any of those connections because those deductions are mostly intuition and logic, which are not statistical. Not to mention, how would ML look at confounders?

I think you're missing the point of how machine learning is being increasingly done today. Many (perhaps most) machine learning tools today are not algorithmic in nature. They use neural networks configured in ways similar to the human brain, and then train these networks with learning sets, just like a human is trained to recognize patterns. Even the (human) designer of the neural network doesn't know how the machine will respond to a given situation. Given this, I don't see why these artificial neural networks cannot match or eventually exceed human capability. Indeed, I think Google's facial recognition software is already exceeding human capability. Granted this is in a controlled environment, but given time and increasing complexity of the networks (and increasing input from the environment), I think you will see these machines able to do anything a human mind can do.

Auto-Didact · Nov 1, 2017

Demystifier said:

So perhaps we need an AI kindergarten, as proposed by my brother:
https://www.linkedin.com/pulse/ai-kindergarten-what-does-take-build-truly-machine-danko-nikolic

I have closely been following Danko's practopoiesis theory for awhile now (even been writing up a thread), tis truly a small world! Can you perhaps get him to come here and write an Insight article or something? :)

Danko Nikolic · Nov 2, 2017

Hi there,
Here is Danko. Hello everyone. Writing an article is maybe a bit too much for me right now. But I would be glad to answer questions. Here is two comments to what has been said before:
- Does AI have access to the same data as humans? In my opinion, at one important level the answer is NO. This is the knowledge we have stored in our genes. We should think about genes as a small but very extensively trained (millions if not billions of years) machine learning component that assists every toddler's learning. Without having in genes knowledge on what to learn and how to learn (we usually refer to those as instincts), a toddler could not do any of its intelligence magic. And this is the key problem: how to provide for an AI this millions-of-years-experience wisdom that we are born with? How do we provide an AI with the data that our ancestors used throughout evolution to get us the genes that we have? (not to mention the computational power needed to work AIs way through these data.)

- Can today's artificial neural networks eventually match or exceed human capabilities? I have written an article explicitly dealing with that problem: According to my calculation, the answer is NO. A good news is that I also propose organisation of AI that possibly could do that. One can download the paper here:

http://www.ijac.net/EN/article/downloadArticleFile.do?attachType=PDF&id=1958

I know that the paper is technical and scientific and that people would prefer a digest. Maybe you can just take a look at the abstract.

I hope that this is useful.

Danko

Demystifier · Nov 2, 2017

Auto-Didact said:

Can you perhaps get him to come here and write an Insight article or something? :)

As you can see above, I just did it.

Auto-Didact · Nov 2, 2017

Demystifier said:

As you can see above, I just did it.

Today is like Christmas! This is almost as exciting as was meeting Roger Penrose in person last year :D

Danko Nikolic said:

Hi there,
Here is Danko. Hello everyone. Writing an article is maybe a bit too much for me right now. But I would be glad to answer questions. Here is two comments to what has been said before:
- Does AI have access to the same data as humans? In my opinion, at one important level the answer is NO. This is the knowledge we have stored in our genes. We should think about genes as a small but very extensively trained (millions if not billions of years) machine learning component that assists every toddler's learning. Without having in genes knowledge on what to learn and how to learn (we usually refer to those as instincts), a toddler could not do any of its intelligence magic. And this is the key problem: how to provide for an AI this millions-of-years-experience wisdom that we are born with? How do we provide an AI with the data that our ancestors used throughout evolution to get us the genes that we have? (not to mention the computational power needed to work AIs way through these data.)

- Can today's artificial neural networks eventually match or exceed human capabilities? I have written an article explicitly dealing with that problem: According to my calculation, the answer is NO. A good news is that I also propose organisation of AI that possibly could do that. One can download the paper here:

http://www.ijac.net/EN/article/downloadArticleFile.do?attachType=PDF&id=1958

I know that the paper is technical and scientific and that people would prefer a digest. Maybe you can just take a look at the abstract.

I hope that this is useful.

Danko

Honoured to make your acquaintance. I'm at work currently so I cannot spend too much time reading the paper, but I will do so asap.

In the meantime I was hoping that you could elaborate on the dynamical systems description of practopoiesis, specifically that thinking was akin to changing the parameters of this system, and how new thoughts occur during phase transitions i.e. during bifurcation of this system.

Did you happen to have some specific equations and parameters in mind and how would these would be changed physically? And for humans/animals, should we be thinking about these as simple attractors detectable through analysis or more like some high dimensional attractors, perhaps akin to some Kuramoto type network model?

Danko Nikolic · Nov 2, 2017

In the meantime I was hoping that you could elaborate on the dynamical systems description of practopoiesis, specifically that thinking was akin to changing the parameters of this system, and how new thoughts occur during phase transitions i.e. during bifurcation of this system.

Did you happen to have some specific equations and parameters in mind and how would these would be changed physically? And for humans/animals, should we be thinking about these as simple attractors detectable through analysis or more like some high dimensional attractors, perhaps akin to some Kuramoto type network model?

I haven't made an interpretation based on dynamical systems. One could but I was never sure that this would be particularly insightful. Maybe it would, but one would have to try first. Instead, I focused on cybernetic/control-theory interpretation.

A dynamical system would need to be described by stochastic differential equations.

Still, intuitively, an interpretation of practopoietic hierarchy (traverses) based on dynamical systems would be quite simple to understand I think. There is nothing especially complicated about it -- at least not in principle. You just need to imagine two dynamical systems, one that operates fast (F, say updated every second) and one that operates slowly (S, say updated every five hours). Now we need the following conditions:
- the value of at least one parameter of F is being decided/adjusted by S. But S cannot affect the dynamics of F in any other way.
- In contrast, F cannot affect the parameters of S, but the accumulated results of the dynamics of F become a part of the dynamics of S. Thus, the dynamics of F affect the dynamics of S.

This is all.

The two are sort of asymmetrically coupled: In one direction they interact through parameters of a dynamical system, S -> F; in the other direction they interact through dynamics: F -> S. This results in practopoietic loop of causation: http://www.danko-nikolic.com/practopoietic-cycle-loop-of-causation/

It may be difficult to make a mental click to understand what I am talking about. But once the click occurs, it is very easy to think about these systems. There is no immense complexity that often occurs with dynamical systems. This is precisely because the two operate at different speeds. So, whenever you think about the fast one, you can neglect the operations of the slow one, and when you think about the slow one, you can approximate the operations of the fast one with some simple function (mean + noise).

Kuramoto types of networks are in my understanding not particularly relevant here.

Their interaction is such that S induces bifurcations of F, but not vice versa. In contrast, the dynamics of F accumulated over time, makes a part of the dynamics of S.

Critical is that S has knowledge on when and in which direction to change the parameters of F. And to discuss that further, we have to define certain "goals" or "target values" that S and F are trying to achieve. And this leads us to attractors. We can say that S has an attractor state, much like any regulator.

As to particular equations, you can use any equations you want. This is completely unlimited for as long as they satisfy the conditions mentioned above.

I hope this is understandable.
Danko

SSequence · Nov 2, 2017

Is the distinction of algorithm and function of "some relevance" in this topic (which I don't know anything about) in general?

Quite simple way to describe it is suppose we have a function that takes an array as input and outputs a sorted array. The function is unique but we distinguish between various "algorithms"/"methods" for achieving this.

I am not sure that this distinction can be made fully mathematically rigorous (possibly in same way as "efficient computation" or "natural examples" etc. "surely" can't be made mathematically rigorous but perhaps could be defined in practically useful ways).

Speaking quite generically, I am thinking along the lines that while stimulus and response are important parts of interaction with environment ---- possibly the representation of information internally is also of some importance (and this seems to be related with the function/algorithm distinction).

Demystifier · Nov 3, 2017

Danko Nikolic said:

A dynamical system would need to be described by stochastic differential equations.

Why stochastic? Why not deterministic?

Danko Nikolic · Nov 3, 2017

Demystifier said:

Why stochastic? Why not deterministic?

Because the interaction is between an organism and its environment. A real environment is unpredictable; you never get into an identical situation twice, and the environment never responds twice in the same way to your actions. Therefore, from the perspective of differential equations the interaction has a considerable stochastic component.

Demystifier · Nov 3, 2017

Danko Nikolic said:

Because the interaction is between an organism and its environment. A real environment is unpredictable; you never get into an identical situation twice, and the environment never responds twice in the same way to your actions. Therefore, from the perspective of differential equations the interaction has a considerable stochastic component.

OK, but from dynamical-systems perspective, unpredictable behavior of the environment can be a result of deterministic chaos. In the end, there may be no much apparent difference between stochastic and chaotic modeling of the environment, and the former may be simpler to implement in a computer simulation, but the latter seems more realistic from the fundamental physical point of view.

Anyway, this all looks like a red herring, as I agree with you that dynamical-systems perspective is not very useful here.

Auto-Didact · Nov 3, 2017

I just read the paper you linked earlier. The part about how slowly learned genetic policies enable networks themselves to gain knowledge about fast adaptation policies, causing the operation of adaptation policies to directly provide stimuli with their best interpretation, in other words an actual explanation for what 'understanding' may entail, simply blew me away.

This actually answers many long-standing philosophical questions in the philosophy of mind, including what qualia may be.

Danko Nikolic said:

I haven't made an interpretation based on dynamical systems. One could but I was never sure that this would be particularly insightful. Maybe it would, but one would have to try first. Instead, I focused on cybernetic/control-theory interpretation.

I am sure that I read somewhere a dynamical system description/metaphor of practopoiesis; this was namely what caused the click with my own (nowhere nearly as sufficiently developed) ideas about cognitive states as being represented as points in some state space.

Danko Nikolic said:

Their interaction is such that S induces bifurcations of F, but not vice versa. In contrast, the dynamics of F accumulated over time, makes a part of the dynamics of S.

Critical is that S has knowledge on when and in which direction to change the parameters of F. And to discuss that further, we have to define certain "goals" or "target values" that S and F are trying to achieve. And this leads us to attractors. We can say that S has an attractor state, much like any regulator.

The existence of such attractors are precisely why I opt for a dynamical systems description. If, as in a regular cognitive setting, many different aspects of some perceived phenomenon are to be evaluated on the same time scales, i.e. different network policies are executed in parallel, this implies that these multiple outputs together form some attractor and that similar behavior may be evoked by activating all or many of these network policies as if just a few or even one of these network policies was activated.

Demystifier said:

OK, but from dynamical-systems perspective, unpredictable behavior of the environment can be a result of deterministic chaos. In the end, there may be no much apparent difference between stochastic and chaotic modeling of the environment, and the former may be simpler to implement in a computer simulation, but the latter seems more realistic from the fundamental physical point of view.

Anyway, this all looks like a red herring, as I agree with you that dynamical-systems perspective is not very useful here.

I disagree, the usefulness of the dynamical systems perspective all depends on what a theory is aiming to explain and at what level; the perspective enables the rapid creation of experimentally checkable hypotheses which may otherwise not be appearant to check at all to those thinking directly about some naturally occurring system or using statistics to do their hypothesis testing for them. This can happen completely outside of the context of the original theory, in this case AI.

Here are some examples, 'the dynamics in the rewiring of networks into most conducive for abductive reasoning either will or will not exhibit small world characteristics'. Or, 'the equi-level synchronised activation of different network policies implies that synchronised chaos may exist across many cognitive states'. Or even, 'a sudden discontinuous increase in cognitive capacities is to be expected when comparing different species which have evolved genetic policies capable of creating small world neural networks compared with those without such policies'. Such 'insights' are far more easily generated than if one were to rely on logical deduction alone, and once envisioned they naturally raise tonnes more questions, all which definitely seem checkable in some way.

Moreover, evidence can and often already has been gained from other both top-down and/or bottom-up researchers not looking for such patterns, meaning we can rapidly falsify models in this way. We can even use the perspective to tie together many different sciences in novel ways, e.g. https://crl.ucsd.edu/~elman/Papers/dynamics/dynamics.html, directly also leading to new results in completely orthogonal directions, such as towards the subject we are actually discussing here.

As you yourself say, many spontaneous behaviors in the environment and induced by the environment on some systems need not be strictly stochastic given deterministic chaos. The nice thing is that extremely complicated but typical behavior will tend to fall on an attractor. If the goal is identifying and characterizing such possibly immensely complicated attractors, I don't see how one would do that without phase space reconstruction and/or other tools inherent to a dynamical systems perspective.

Lastly, on a more abstract level, it seems all complexity science subjects, such as cybernetics, chaos, (nonlinear) dynamical systems, network theory and so on, share an underlying mathematical backbone which is currently, as a field of mathematics, still a work in progress, perhaps one extremely relevant to physics. Many great mathematicians and physicists, both historical and contemporary (e.g. Benoit Mandelbrot, Floris Takens, John Baez, Stephen Strogatz) have made this point and I tend to agree with them on it.

Pythagorean · Nov 4, 2017

Auto-Didact · Nov 5, 2017

@Danko Nikolic:
I just reread your 2015 practopoiesis paper:
Regarding prediction #3, did you ever find a physiological mechanism underlying ideatheca? If not, Craddock et al. 2012 gives a specific physiological mechanism in the form of LTP-activated enzymes encoding information directly onto the neuronal cytoskeleton, i.e. CaMKII encoding information on microtubules (MTs).

Seeing after formation neuronal MTs remain stable i.e. don't depolymerize like non-neuronal MTs, information encoded on them would remain stable throughout adulthood, providing a means of stable long term memory formation which can last years or even a lifetime. Moreover, in the last few years it has become known that loss of neuronal cytoskeletal structure is associated with memory loss in Alzheimer's disease, leading even to experiments being carried out with MT stabilizing agents (taxanes, originally chemotherapeutic agents) in both Alzheimer mouse models and patients. For more information, see this recent review on the subject.

Dr. Courtney · Nov 5, 2017

Dale said:

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average.

The teen with 60 hours of experience is also far from average.

Krunchyman · Nov 5, 2017

The limit of machine learning is that it is still too restricted to certain kinds of problems. For example, we know how to solve optimization problems and we know how to solve classification and clustering problems. But humans classify, cluster, optimize and utilize far more advanced tricks than any algorithm is capable of performing, and they do it all day every day over decades.

In terms of machine learning, the brain is analogous to a complex system of deep spiking neural networks that possesses recurrences and convolutions. These networks form functional modules but also communicate with other modules, a phenomenon that probably gives rise to the flexibility of our cognition and let's us "think outside the box", playing around with symbols and ideas in ways that would not otherwise be possible.

It is the ultimate goal of the machine learning program to develop such a flexible algorithm for learning, but I doubt it can ever be done without a complex system approach. Marvin Minsky warned of the deceptive idea to peek inside the brain and try to find a "mind" responsible for intelligence, when every component of the brain is itself unintelligent and the mind is just a holistic property of the system.

jerromyjon · Nov 5, 2017

Krunchyman said:

It is the ultimate goal of the machine learning program to develop such a flexible algorithm for learning, but I doubt it can ever be done without a complex system approach

Krunchyman said:

But humans classify, cluster, optimize and utilize far more advanced tricks than any algorithm is capable of performing, and they do it all day every day over decades.

And with a minimal portion of the genetic code which does far more than just "execute commands", it also has a portion which encodes the machinery to construct and initiate it... I wonder what that adds up to in bits of DNA compared to source codes and data, not that it is like comparing apples to apples.

lavinia · Nov 6, 2017

A couple of computer technologists that I know are skeptical about AI largely because machine learning often requires huge amounts of data. That said, they do believe that most tasks currently done by humans will someday be done by computerized machines. This will lead to a crisis of employment when human labor becomes obsolete. This does not mean that machine thinking will be like human thinking. But it does mean that individual tasks will be mechanized.

More broadly, one might ask what sort of machine the human brain is. And even broader than that, what sort of models of thinking are there? A nerve cell is just an on/off switch with a threshold trigger and is easily modeled on a computer. Also simple nervous systems - e.g. the nervous systems of some species of clams - have been completely modeled by finite state machines. This sort of consideration would suggest that the human mind is an extremely complex finite state machine. Some have suggested that the brain may also use quantum computing. Whether or not this is true, quantum computing seems to be another possible model.

FactChecker · Nov 6, 2017

FallenApple said:

But there's no way a learning algorithm can make any of those connections because those deductions are mostly intuition and logic, which are not statistical. Not to mention, how would ML look at confounders?

I think that you are seriously underestimating the variety and seriousness of the research being done. There are already symbolic logic manipulators and theorem provers in practical applications and in general use. There are other research efforts that manipulate relationships, looking for fundamental theorems. It is a misconception to think that the state of the art of machine learning is limited to data analysis.

Crass_Oscillator · Nov 6, 2017

FallenApple said:

From what I understand, machine learning is incredibly good at making predictions from data in a very automated/algorithmic way.

This is a meaningless generalization. If the data is under sampled, inaccurately labeled (which is most of the time), or complex (e.g. one sample is hundreds of gigabytes in size), or requires high accuracy,machine learning is an atrocious approach. The majority of problems have these downsides.

Also, the effectiveness depends not only on these generalizations, but also on the method and specific problem. Clustering is hugely inaccurate and heavily dependent upon human intervention ("What is a cluster? What is similarity?"), making it very vulnerable to the "high accuracy" weakness, since constructing the similarity model either requires a vast amount of data you don't have or very good human intuition. Classification can be much easier since it is not usually constrained in the same way.

Finally, machine learning does not make predictions from data in an automated/algorithmic way; it makes models, which require some form of assumptions, in an algorithmic/automated way, and these models make the predictions. This is more than a trite observation. Consider clustering. For typical methods (e.g. k-means), you are deciding what function determines similarity (in this case, d dimensional Euclidean distance). The assumption that d-dimensional Euclidean distance is usually nonsense, and is usually not checked in any meaningful way, from my experience.

The only advantage of ML is not prediction accuracy; it is automation. Thus, if I run a company or an experiment that generates large amounts of data, ML is a useful way to write programs to build models that use this data, some times updating automatically. However, you still have to figure out how to model the data. In principle you can build every part of the model directly from data; for instance, a neural network can be fitted to compute similarity instead of a Euclidean norm, and a different clustering algorithm can be used. The difficulty is that you will essentially never have the data or computational resources to do this; it's like trying to simulate an integrated circuit using density functional theory to model all of the electronics from atoms up (i.e. stupid). You have to truncate and make modeling assumptions somewhere; they even appear in how you label the data and train the NN.

jerromyjon · Nov 6, 2017

FallenApple said:

So basically, a machine learning algorithm would need human level intelligence and intuition to be able to do proper causal analysis?

Machine learning cannot "pull a rabbit out of a hat". There is no magic. A neural net has to be fine tuned to be able to make any reasonable associations from large training data sets. I don't know for sure but I don't think there are any large data sets of causal analysis for it to "learn" from, and even if there are they aren't like humans where they can take a data set and expand upon it to make sense of unfamiliar connections.

jim mcnamara · Nov 8, 2017

This thread is diverging from AI and going into too much personal opinion. I am moving it to General Discussion. Why? because there are some good posts here mixed with less useful opinion. We do not need to throttle people for lack of a scientific poise, if the thread lives in GD.

Thread moved.

kith · Nov 9, 2017

lavinia said:

A couple of computer technologists that I know are skeptical about AI largely because machine learning often requires huge amounts of data.

There's a new version of AlphaGo which seems to use minimal input data. To quote from https://deepmind.com/blog/alphago-zero-learning-scratch/:

Deepmind said:

Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting from completely random play.

_PJ_ · Nov 13, 2017

Dale said:

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average. Similarly, even a below average human toddler can learn to speak any language with far less data than is available to computers attempting the same task.

To be honest, I think there is some false equivalence here. Not that I can necessarily produce a more equitable sense myself., The very nature of "learning" is not, in my opinion, extremely dependent not only on the capability and "suitability" (i.e. a person without legs would struggle learning to "walk" in an extreme example) of the pupil, but also of the particulars of the subject and the manner of tuition.

A human teenager has an immense advantage because they already know that a cyclist up ahead is a cyclist and might move onto the road, as opposed to a lamppost that is less likely to do so, they have instinctive emotional reactions that can make their responses to slam on the brakes. Of course, autonomous vehicles have other advantages that humans do not have, but again, there is not a direct equivalence to this and I don't believe that comparisons of such lead to a 'fair' appraisal of human and machine learning.

By that kind of logic, one might argue that deer are much faster learners than humans just because a deer can walk minutes after birth.

There is a wealth more context and available information and 'technique' that humans learn and can apply whilst learning to drive, that isn't necesarily true of machine AI autonomous driving tech.
By the time a teenager starts learning to drive, they have already seen and understand traffic lights, they know not to stop in the middle of the motorway, they know that there are some roads they might not be able to drive down and they know the implications of mechanics such that driving faster introduces less control, more risk and longer stopping distances etc. all these factors and intuitions and preconcepts are built up over the course of the lives of the teenager, and are therefore already present before the 60 hour learning begins. The laws that make up highway codes have been developed over decades and other knowledge about the world has been passed down from generations from many varied sources.

Before a machine can learn to drive a car, it needs to learn the pattern recognition and the data needs to be amassed and collated and prepared in a format that can be utilised effectively. This also requires that the data retrieval aspects of the algorithms are developed AND EFFICIENT ENOUGH to work in the realtime scenario of driving a car.
Yes, computers process gajillions of teraflops a second, but the number of data points (in visual processing alone) to be processed is also incredibly high.

Humans have had BILLIONS of years of evlution to refine the image processing.I've not used the best examples and I appreciate that there are some elements such as "it doesn't add much to a 60 hour learning time to learn that red = stop") but I hope the point is clear at least.

jerromyjon · Nov 13, 2017

_PJ_ said:

Humans have had BILLIONS of years of evlution to refine the image processing.

I guess if you assume bacteria had image processing abilities then just maybe those genes found their way into our DNA but that's a stretch...

_PJ_ said:

but I hope the point is clear at least.

If there were sub-routines specifically devoted to threat detection and it was programmed effectively that would make the rest of the systems straight-forward at least.

Khashishi · Nov 13, 2017

Dale said:

So far, I am not convinced that machines are particularly good at learning. For example, an average human teenager can learn to drive a car with about 60 hours of experience. The machines that are learning the same task have millions of hours and are far from average. Similarly, even a below average human toddler can learn to speak any language with far less data than is available to computers attempting the same task.

The human can transfer other knowledge accumulated during their lifetime to the task of driving. For example, what a pedestrian looks like, what color the sky is, how to walk. Now try to teach a newborn to drive in 60 hours.

jerromyjon · Nov 13, 2017

Khashishi said:

Now try to teach a newborn to drive in 60 hours.

All I can think of is the Simpson's intro where Marge is driving and Maggie is in her car seat mimicking her every move.

Dale · Nov 13, 2017

Khashishi said:

The human can transfer other knowledge accumulated during their lifetime to the task of driving.

That is a big part of what makes humans so good at learning, and computers not.

_PJ_ said:

By that kind of logic, one might argue that deer are much faster learners than humans just because a deer can walk minutes after birth.

Interesting point! I wonder if deer actually learn to walk or if it is already hardwired in? I don’t know the answer to that, but in general it seems to me that humans do a lot more learning than other species.

Khashishi · Nov 14, 2017

True. Less reliance on instincts is probably why humans are so good at learning new things. We have some remarkable neuroplasticity. If we hook up robot appendages to a baby, with proper connections, it could probably learn to walk on robot legs.

Auto-Didact · Jan 18, 2018

The title is a bit clickbaity, but the research mentioned seems truly remarkable or interesting to say the least; especially the possibility of democratization of ML from researchers to basically anyone for any task that automatic ML brings seems like a major revolution still waiting to happen.

@Danko Nikolic, might this be a precursor to your AI kindergarten? I did not read their paper so not sure if they refer to your papers, but their automatic ML reinforcement learning reminds me an awful lot of your practopoietic learning to learn explanation. If they haven't referenced you, they should or maybe you should give a Google Tech Talk.

Auto-Didact · Dec 20, 2018

Here we see the current limits on machine learning in action: Sofia vs Penrose!

BWV · Dec 21, 2018

phyzguy said:

I think you're missing the point of how machine learning is being increasingly done today. Many (perhaps most) machine learning tools today are not algorithmic in nature. They use neural networks configured in ways similar to the human brain, and then train these networks with learning sets, just like a human is trained to recognize patterns. Even the (human) designer of the neural network doesn't know how the machine will respond to a given situation. Given this, I don't see why these artificial neural networks cannot match or eventually exceed human capability. Indeed, I think Google's facial recognition software is already exceeding human capability. Granted this is in a controlled environment, but given time and increasing complexity of the networks (and increasing input from the environment), I think you will see these machines able to do anything a human mind can do.

Yes, but these techniques need a clear set of rules and definition of 'winning' to optimize against - like in Go, Chess or Poker (which was not a Deep Learning algorithm BTW). The 'rules' and definition of 'winning' for tasks like driving are much more nebulous and complex, let alone the optimization function for a general AI

Tom Kunich · Mar 4, 2019

FallenApple said:

From what I understand, machine learning is incredibly good at making predictions from data in a very automated/algorithmic way.

But for any inference that is going to deal with ideas of causality, it's primarily a subject matter concern, which relies on mostly on judgment calls and intuition.

So basically, a machine learning algorithm would need human level intelligence and intuition to be able to do proper causal analysis?

Here's an example where there might be issues.

Say, an ML algorithm finds that low socioeconomic status is associated with diabetes with a significant p value. We clearly know that diabetes is a biological phenomena and that any possible(this is a big if) causal connection between a non biological variable such as low SES and diabetes must logically have intermediate steps between the two variables within the causal chain. It is these unknown intermediate steps that probably should be investigated in follow up studies. We logically know(or intuit from prior knowledge+domain knowledge) that low SES could lead to higher stress or unhealthy diet, which are biological. So a significant pval for SES indicates that maybe we should collect data on those missing variables, and then redo the analysis with those in the model.

But there's no way a learning algorithm can make any of those connections because those deductions are mostly intuition and logic, which are not statistical. Not to mention, how would ML look at confounders?

Let me offer some opinions as someone that used AI early in the game. The human brain is perhaps the closest thing to infinite we've ever found. AI runs on a computer which isn't even as brilliant as a lizard. There are arguments among authorities of whether a brain works using binary logic or analog logic with me bending towards analog, We can speak of quantum computers but these are not really analog - they consist of logic of 0, maybe and 1. AI is really a method of bypassing a lot of hardware by using training algorithms. But this has the same limitations of a lot of hardware - it can only do what you train it to do. It can only use methodology you show it. It cannot actually invent anything of its own unless you train it in the method to do so. What we're speaking of here is teaching a computer how to be original. Since we cannot even tell ourselves how to be original beyond stupid things like "think outside the box" you're not going to achieve that.

Self driving cares will work fine if all of the human drivers around them obey the rules and the road designers do as well. There is NO CHANCE of that. I ride a bicycle for sport and believe me that I cannot ride one mile without observing people breaking driving laws in such a dangerous manner that it surprises me that we don't have 100 times the accident rate that we do. The fact that we don't is because other drivers are ALWAYS expecting poor driving and they are thinking further ahead than a self-driving car could do.

While waiting at a stop light, it changed to green and NO ONE MOVED. I wondered why and a car exiting the bridge came though the red light 5 full seconds after the light had changed at 60+ mph and accelerating. Because of the speed and the amount of time after the light changed there isn't a automated driving system in the world that could have predicted that. The speed of that pickup put it 150 yard out and out of practical range of the detectors on a self driving car. Put in better hardware? There are price limitations after all.

But there is even more - You are insured. If you get into a accident your insurance pays the bills if required. But with self driving cars the system manufacturer is liable. No insurance company in their right mind would insure such a company. This is like hanging out a sign and saying - "Million of dollars free for the taking".

So the single largest source of R&D money for AI research is essentially removed from the market and each company must assume the liabilities via their shareholders. I'll leave you to decide how successful that is going to be. Tesla has already ceased to call it an "autopilot" and now calls it a navigation device and you are required to have two hands on the wheel during the entire time that it is engaged. And I'll bet that there are now detectors built into remove all liabilities from Tesla if you do take a hand off before a wreck.

Having a smart phone do simple tasks under AI might give those of the Star Wars generation visions of C3PO but it isn't and won't much improve.

Google's use of AI is little more than delivering news to you that they think you want - anti-Trump stuff only or pro-Trump stuff for the other side. And Google guards this data jealously because it that becomes too widely understood that they are essentially being manipulated there is no telling the price they will pay for that. Go to YouTube and play a Glenn Miller piece and you have almost the entire rest of the selections from the 30's to 50's. This isn't smart. This is in fact rather stupid. Playing a Huey Lewis and the News piece doesn't mean that you want to listen to everything else from the 70's.

Can AI and Deep Learning be improved? What couldn't be? But will that improvement evade the real limitations of machine intelligence? That is very highly doubtful.

Trawlerman · Mar 5, 2019

I'm fascinated with the thought of analog computing returning. I have no business even entering this conversation with my level of knowledge, that said, it seems the potential for analog computing opens up avenues that are far more complex that straight binary. Arguments against analog that I've heard are that at the very base of physics lies the state of is, or is not. Binary. Yet it seems to me that synthesis and learning are more of an analog system. "Probably" is best described as a range or spectrum -analog. "Definitely" is singular and linear -binary.
I wonder if machine learning and AI will advance thru a further investigation of analog computing advances possibly crossed with binary. Much as the realization that non linear equations weren't junk, but much better described as 3 dimensional entities. Is it possible that simultaneous data running on multiple channels in an analog computer might experience resonance that could be a source of insight? Clock speeds and amplitudes could result in intersections of data that might prove profound. How one would configure a system like this is beyond me, but I have to wonder if the answer of ultimate machine learning can be found solely in a binary state. I don't believe there is a hard limit to machine learning. Humans have barely scratched the surface of the AI field, yet from my age perspective the advances are staggering. So. Carry on. All is as it should be.

Tom Kunich · Mar 5, 2019

Trawlerman said:

I'm fascinated with the thought of analog computing returning. I have no business even entering this conversation with my level of knowledge, that said, it seems the potential for analog computing opens up avenues that are far more complex that straight binary. Arguments against analog that I've heard are that at the very base of physics lies the state of is, or is not. Binary. Yet it seems to me that synthesis and learning are more of an analog system. "Probably" is best described as a range or spectrum -analog. "Definitely" is singular and linear -binary.
I wonder if machine learning and AI will advance thru a further investigation of analog computing advances possibly crossed with binary. Much as the realization that non linear equations weren't junk, but much better described as 3 dimensional entities. Is it possible that simultaneous data running on multiple channels in an analog computer might experience resonance that could be a source of insight? Clock speeds and amplitudes could result in intersections of data that might prove profound. How one would configure a system like this is beyond me, but I have to wonder if the answer of ultimate machine learning can be found solely in a binary state. I don't believe there is a hard limit to machine learning. Humans have barely scratched the surface of the AI field, yet from my age perspective the advances are staggering. So. Carry on. All is as it should be.

Digital logic is so fast because it only has two states. Analog has a theoretical infinite number of states though due to actual accuracy it has more to do with settling time. And this is a great deal slower. Whether the increased accuracy of a smaller number of "bits" is worth it hasn't been worth it up to this point but I will have to think about that. I know a couple of REAL analog engineers that are more than worth their salt. National awards and all.

Tom Kunich · Mar 5, 2019

Tom Kunich said:

Digital logic is so fast because it only has two states. Analog has a theoretical infinite number of states though due to actual accuracy it has more to do with settling time. And this is a great deal slower. Whether the increased accuracy of a smaller number of "bits" is worth it hasn't been worth it up to this point but I will have to think about that. I know a couple of REAL analog engineers that are more than worth their salt. National awards and all.

OK, went to the people who know the real stuff and it seems like the settling time to get an accuracy to one part in one thousand would be 100 times that maximum speed of the circuits. This sort of kills that idea since it would make about a millisecond to get accuracy in a common op-amp. Now you can push the envelope with his speed designed op-amps etc. But digital circuitry could do this in a hundredth of that time.

Shrinking the circuitry could speed it up or slow it down depending on many things - the less power the larger the effects of stray capacitance and conductor resistance etc.

I'm far more of a digital designer and after thinking about quantum computing I have to learn more about it. But I think that I have an idea for making a simple cell.

ZeGato · Mar 9, 2019

FallenApple said:

Say, an ML algorithm finds that low socioeconomic status is associated with diabetes with a significant p value. We clearly know that diabetes is a biological phenomena and that any possible(this is a big if) causal connection between a non biological variable such as low SES and diabetes must logically have intermediate steps between the two variables within the causal chain. It is these unknown intermediate steps that probably should be investigated in follow up studies. We logically know(or intuit from prior knowledge+domain knowledge) that low SES could lead to higher stress or unhealthy diet, which are biological. So a significant pval for SES indicates that maybe we should collect data on those missing variables, and then redo the analysis with those in the model.

But there's no way a learning algorithm can make any of those connections because those deductions are mostly intuition and logic, which are not statistical. Not to mention, how would ML look at confounders?

A ML algorithm with advanced language processing capabilities could learn about diabetes from existing literature and find out the confounder variables that would need to be accounted for. Artificial intelligence can theoretically be superior to the human brain in everything, and I'm pretty sure that it's just a matter of time until that is achieved.

Dale · Mar 9, 2019

ZeGato said:

Artificial intelligence can theoretically be superior to the human brain in everything,

Please cite the professional scientific reference that supports this claim. It seems improbable to me.

What are the Limitations of Machine Learning in Causal Analysis?

Similar threads

Hot Threads

Hackathon ideas?

Touch-typing for programmers

How to calculate Tension for a series of connected points?

Trying To Debug A Python File

Python Complaining About Python

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective