One of the biggest misconceptions around is the idea that Deep Learning (DL) or Artificial Neural Networks (ANN) mimic biological neurons. At best, ANN mimic a cartoonish version of a 1957 model of a neuron. Anyone claiming Deep Learning is biologically inspired is in doing so for marketing purposes or has never bother to read the biological literature. Neurons in Deep Learning are essentially mathematical functions that perform a similarity function of its inputs against internal weights. The closer a match is made, the more likely an action is performed (i.e. not sending a signal to zero). There are exceptions to this model (see: Autoregressive networks) however it is general enough to include the perceptron, convolution networks and RNNs.

Neurons are very different from DL constructs. The don't maintain continuous signals but rather exhibit spiking (or event driven) behavior. So, when you hear about "neuromorphic" hardware, then these are inspired on "integrate and spike" neurons. These kinds of system at best get a lot of press (see: IBM TrueNorth), but have never been shown to be effective. There has been some research work however that has shown some progress. If you ask me, if you truly want to build biologically inspired cognition, then you should at the very least explore systems that are not continuous like DL. Biological systems by their very nature will use the least amount of energy to survive. DL systems in stark contrast are power hungry. That's because DL is a brute-force method to achieve cognition. We know it works, we just don't know how to scale it down.

Jeff Hawkins of Numenta has always lamented that a more biologically-inspired approach is needed. So, in his research in building cognitive machinery, he has architected system that try to more closely mirror the structure of the neo-cortex. Numenta's model of a neuron is considerably more elaborate than the Deep Learning model of a neuron as you can see in this graphic:

The team at Numenta is betting on this approach in the hopes of creating something that is more capable than Deep Learning. It hasn't been proved to be anywhere near successful. They've been doing at this long enough that the odd of them succeeding are diminishing overtime. By contrast, Deep Learning (despite its model of a cartoon neuron) has been shown to be unexpectedly effective in performing all kinds of mind-boggling feats of cognition. Deep Learning is doing something that is extraordinarily correct, we just don't know exactly what that is!

Unfortunately, we have to throw in a new monkey wrench on all this research. New experiments on the nature of neurons have revealed that biological neurons are even more complex than we have imagined them to be:

New Types of Experiments Reveal that a Neuron Functions as Multiple Independent Threshold Units

A single neuron's spike waveform typically varies as a function of the stimulation location.

Spatial summation is absent for extracellular stimulations from different directions.

Spatial summation and subtraction are not achieved when combining intra- and extra- cellular stimulations, as well as for nonlocal time interference, where the precise timings of the stimulations are irrelevant.

In short, there is a lot more going on inside a single neuron than the simple idea of integrate and spike. Neurons may not be pure functions dependent of a single parameter (i.e weight) but rather they are stateful machines. Alternatively, perhaps the weight may not be singled value but require a complex value or maybe higher dimensions. This is all behavior that research has yet to explore and thus we have little understanding to date.

If you think this throws a monkey wrench on our understanding, there's an even newer discovery that reveals even greater complexity:

Many of the extracellular vesicles released by neurons contain a gene called Arc, which helps neurons to build connections with one another. Mice engineered to lack Arc have problems forming long-term memories, and several human neurological disorders are linked to this gene.

What this research reveals is that there is a mechanism for neurons to communicate with each other by sending packages of RNA code. These are packages of instructions and not packages of data. There is a profound difference between sending codes and sending data. This implies that behavior from one neuron can change the behavior of another neuron; not through observation, but rather through injection of behavior.

Experimental evidence reveals a new reality, even at the smallest unit of our cognition, there is a kind of conversational cognition that is going on between individual neurons that modifies each other's behavior. Thus, not only are neurons machines with state, but neurons are also machines with an instruction set and a way to send code to each other. I'm sorry, but this is just another level of complexity.

There are two obvious ramification of these experimental discoveries. The first is that our estimates of the computational capabilities of the human brain is likely to be at least an order of magnitude off. The second is that research will begin in earnest to explore DL architectures with more complex internal node (or neuron) structures.

If we were to make the rough argument that a single neuron performs a single operation, the the total capacity of the human brain is measured at 38 peta operations per second. If we're then to assume a DL model of operations being equal to floating point operations then a 38 petaflops system would be equivalent in capability. The top ranked supercomputer, Sunway Taihulight from China is estimated at 125 petaflops. However, let's say the new results reveal 10x more computation, then the number should be 380 petaflops and we perhaps have breathing room till 2019. What is obvious however is that biological brains actually perform much more cognition with less computation.

The second consequences it that it's now time to get back to the drawing board and begin to explore more complex kinds of neurons. The more complex kinds we've seen to date are the ones derived from LSTM. Here is the result of a brute force architectural search for LSTM-like neurons:

In summary, a research plan that explores more complex kinds of neurons may bear promising fruit. This is not unlike research that explores the use of complex values in neural networks. In these complex valued networks, performance improvements are noticed only on RNN networks. This should indicate that these internal neuron complexities may be necessary for capabilities beyond simple perception. I suspect that these complexities are necessary for advanced cognition that seem to evade current Deep Learning systems. These include robustness to adversarial features, learning to forget, learning what to ignore, learning abstraction and recognizing contextual switching.

I predict in the near future that we shall see more aggressive research in this area. After all, nature is already unequivocally telling us that neurons are individually more complex and therefore our own neuron models may also need to be more complex. Perhaps we need something as complicated as a Grassmann Algebra to make progress. ;-)

The most successful tyranny is not the one that uses force to assure uniformity but the one that removes the awareness of other possibilities, that makes it seem inconceivable that other ways are viable, that removes the sense that there is an outside.