Most Popular

1500 questions
7
votes
1 answer

Who manufactures Google's Tensor Processing Units?

Does google manufacture TPUs? I know that google engineers are the ones responsible for the design, and that google is the one using them, but which company is responsible for the actual manufacturing of the chip?
Alecto
  • 609
  • 1
  • 7
  • 10
7
votes
1 answer

Are more than 8 high performance Nvidia GPUs practical for deep learning applications?

I was prompted towards this question while trying to find server racks and motherboards which are specialized towards artificial intelligence. Naturally I went to the SuperMicro website. There the chassis+motherboard which supported the maximum GPUs…
Rushat
  • 139
  • 8
7
votes
2 answers

How do we choose the kernel size depending on the problem?

Obviously, finding suitable hyper-parameters for a neural network is a complex task and problem or domain-specific. However, there should be at least some "rules" that hold most times for the size of the filter (or kernel)! In most cases, intuition…
7
votes
3 answers

How can I start learning mathematics for machine learning?

I am an Android programmer. Now, I would like to learn machine learning. I know it requires a mathematical background, like statistics, probability, calculus and linear algebra. However, I am a bit lost. Where should I start from? Can someone…
Anko6
  • 83
  • 5
7
votes
2 answers

How can a neural network distinguish a rotated 6 and 9 digits?

Rotated MNIST is a popular dataset for benchmarking models equivariant to rotations on $\mathbb{R}^2$, described by $SO(2)$ group or its discrete subgroups like $\mathbb{Z}^{n}$: Group equivariant convolutional networks Harmonic networks It…
7
votes
1 answer

Is LSTM a subcategory of RNN?

Is the LSTM-Architecture a subcategory of RNNs? Or are they totally different? Literature doesn't seem to be unitary on this. This figure appears to explain the models to be alternatives, but I thought of them otherwise (LSTM to be a subcategory of…
7
votes
1 answer

What are the most recent and influential breakthroughs in NLP?

I'm looking at the history of NLP, starting in the 1950s, with the Georgetown–IBM experiment. What are examples of the most recent (e.g. in the last 5-10 years) and influential breakthroughs in natural language processing?
7
votes
1 answer

How could AI solve planet's major problems?

I had been reading that AI could solve planet's major problems. How could it be done? For example, how exactly could AI be applied to address climate change? What are examples of applications of AI to solve these problems?
Shashank
  • 73
  • 3
7
votes
3 answers

What are some information processing models besides MLPs?

Feedforward or multilayered neural networks, like the one in the image above, are usually characterized by the fact that all weighted connections can be represented as a continuous real number. Furthermore, each node in a layer is connected to…
7
votes
1 answer

What exactly is an XPU?

I know about CPU, GPU and TPU. But, it is the first time for me to read about XPU from PyTorch documentation about MODULE. xpu(device=None) Moves all model parameters and buffers to the XPU. This also makes associated parameters and buffers…
hanugm
  • 4,102
  • 3
  • 29
  • 63
7
votes
3 answers

Is there a central focus on the communication methods between AI and humans?

AI is developing at a rapid pace and is becoming very sophisticated. One aspect will include the methods of interaction between AI and humans. Currently the interaction is an elementary interaction of voice and visual text or images. Is there…
7
votes
2 answers

What are the best hyper-parameters to tune in reinforcement learning?

Obviously, this is somewhat subjective, but what hyper-parameters typically have the most significant impact on an RL agent's ability to learn? For example, the replay buffer size, learning rate, entropy coefficient, etc. For example, in "normal"…
7
votes
2 answers

What makes a transformer a transformer?

Transformers are modified heavily in recent research. But what exactly makes a transformer a transformer? What is the core part of a transformer? Is it the self-attention, the parallelism, or something else?
7
votes
1 answer

Which parsing algorithm can I use for NLP question answering system?

I am currently working on my last project before graduating. For this project, I have to develop a Natural Language Question Answering System. Now, I have read quite some research papers regarding this topic and have figured out everything except…
7
votes
2 answers

Are policy gradient methods good for large discrete action spaces?

I have seen this question asked primarily in the context of continuous action spaces. I have a large action space (~2-4k discrete actions) for my custom environment that I cannot reduce down further: I am currently trying DQN approaches but was…
user9317212
  • 181
  • 2
  • 13