10

If neurons and synapses can be implemented using transistors, what prevents us from creating arbitrarily large neural networks using the same methods with which GPUs are made?

In essence, we have seen how extraordinarily well virtual neural networks implemented on sequential processors work (even GPUs are sequential machines, but with huge amounts of cores).

One can imagine that using GPU design principles - which is basically to have thousands of programmable processing units that work in parallel - we could make much simpler "neuron processing units" and put millions or billions of those NPUs in a single big chip. They would have their own memory (for storing weights) and be connected to a few hundred other neurons by sharing a bus. They could have a frequency of for example 20 Hz, which would allow them to share a data bus with many other neurons.

Obviously, there are some electrical engineering challenges here, but it seems to me that all big tech companies should be exploring this route by now.

Many AI researchers say that superintelligence is coming around the year 2045. I believe that their reasoning is based on Moore's law and the number of neurons we are able to implement in software running on the fastest computers we have.

But the fact is, we today are making silicon chips with billions of transistors on them. SPARK M7 has 10 billion transistors.

If implementing a (non-programmable) neuron and a few hundred synapses for it requires for example 100 000 transistors, then we can make a neural network in hardware that emulates 100 000 neurons.

If we design such a chip so that we can simply make it physically bigger if we want more neurons, then it seems to me that arbitrarily large neural networks are simply a budget question.

Are we technically able to make, in hardware, arbitrarily large neural networks with current technology?

Remember: I am NOT asking if such a network will in fact be very intelligent. I am merely asking if we can factually make arbitrarily large, highly interconnected neural networks, if we decide to pay Intel to do this?

The implication is that on the day some scientist is able to create general intelligence in software, we can use our hardware capabilities to grow this general intelligence to human levels and beyond.

nbro
  • 42,615
  • 12
  • 119
  • 217
frodeborli
  • 203
  • 1
  • 8

4 Answers4

7

The approach you describe is called neuromorphic computing and it's quite a busy field.

IBM's TrueNorth even has spiking neurons.

The main problem with these projects is that nobody quite knows what to do with them yet.

These projects don't try to create chips that are optimised to run a neural network. That would certainly be possible, but the expensive part is the training not the running of neural networks. And for the training you need huge matrix multiplications, something GPUs are very good at already. (Google's TPU would be a chip optimised to run NNs.)

To do research on algorithms that might be implemented in the brain (we hardly know anything about that) you need flexibility, something these chips don't have. Also, the engineering challenge likely lies in providing a lot of synapses, just compare the average number of synapses per neuron of TrueNorth, 256, and the brain, 10,000.

So, you could create a chip designed after some neural architecture and it would be faster, more efficient, etc …, but to do that you'll need to know which architecture works first. We know that deep learning works, so google uses custom made hardware to run their applications and I could certainly imagine custom made deep learning hardware coming to a smartphone near you in the future. To create a neuromorphic chip for strong AI you'd need to develop strong AI first.

BlindKungFuMaster
  • 4,265
  • 13
  • 23
4

If neurons and synapses can be implemented using transistors,

I hope you are not talking about the neural networks which are currently winning all competitions in machine learning (MLPs, CNNs, RNNs, Deep Residual Networks, ...). Those were once used as a model for neurons, but they are only very loosely related to what happens in real brain cells.

Spiking networks should be much closer to real neurons. I've heard that the Hodgkin-Huxley model is quite realistic. However - in contrast to the models I named above - there seems not to be an effective training algorithm for spiking networks.

what prevents us from creating arbitrarily large neural networks

  • Computational resources: Training neural networks takes a lot of time. We are talking about ~12 days with a GPU cluster for some CNN models in computer vision.
  • Training data: The more variables you add to the model, the more data you need to estimate those. Neural networks are not magic. They need something they can work with.

But the fact is, we today are making silicon chips with billions of transistors on them. SPARK M7 has 10 billion transistors.

If implementing a (non-programmable) neuron and a few hundred synapses for it requires for example 100 000 transistors, then we can make a neural network in hardware that emulates 100 000 neurons.

It's not that simple:

  • Asynchonosity: Biological neural networks work asynchronously. This means one neuron might be active while all others are not active.
  • Emulation: You assume it would only need one cycle to simulate a biological neuron. However, it needs many thousand cycles. You can't simply use more computational units, because some things are not parallelizable. For example, think of the function f(x) = sin(x*x + 1). For a human, there are basically three computations: r1 = x*x, r2 = r1 + 1, r3 = sin(r2). Even if you have 3 people working on calculating the result, you will not be faster than the single fastest person in this group is. Why? Because you need the results of the last computation.
Martin Thoma
  • 1,055
  • 7
  • 17
2

While a single transistor could approximate the basic function of a single neuron, I cannot agree that any electronic element could simulate the synapses/axons. Transistors are etched on a flat surface, and could be interconnected only to adjacent or close by transistors. Axons in the brain span huge distances (compared to the size of the neuron itself), and are not restricted to a two dimensional surface. Even if we were able to approach the number of transistors on a processor to the number of neurons in a brain, we are no where near as number of connections. It could also be argued that the analogue signals in the brain carry more information per unit of time, compared to the binary impulses on a chip. Furthermore, the brain actually have plasticity i.e. connections between neurons can be weakened/discarded or straightened/created, while a CPU cannot do that.

Iliyan Bobev
  • 276
  • 2
  • 8
0

You may want to consider this list:

10 important differences between brains and computers:

  1. Brains are analog , computers are digital
  2. The brain uses content-addressable memory
  3. The brain is a massively parallel machine; computers are modular and serial
  4. Processing speed is not fixed in the brain; there is no system clock
  5. Short-term memory is not like RAM
  6. No hardware/software distinction can be made with respect to the brain or mind
  7. Synapses are far more complex than electrical logic gates
  8. Unlike computers, processing and memory are performed by the same components in the brain
  9. The brain is a self-organizing system
  10. Brains have bodies
Franck Dernoncourt
  • 3,473
  • 2
  • 21
  • 39