Most Popular

1500 questions
10
votes
2 answers

Reinforcement Learning with asynchronous feedback

I want suggestions on literature on Reinforcement Learning algorithms that perform well with asynchronous feedback from the environment. What I mean by asynchronous feedback is, when an agent performs an action it gets feedback(reward or regret)…
papabiceps
  • 201
  • 1
  • 7
10
votes
4 answers

What are the models that have the potential to replace neural networks in the near future?

Are there possible models that have the potential to replace neural networks in the near future? And do we even need that? What is the worst thing about using neural networks in terms of efficiency?
10
votes
6 answers

Why does Stephen Hawking say "Artificial Intelligence will kill us all"?

This quote by Stephen Hawking has been in headlines for quite some time: Artificial Intelligence could wipe out humanity when it gets too clever as humans will be like ants. Why does he say this? To put it simply: what are the possible threats…
Soham
  • 399
  • 1
  • 2
  • 11
10
votes
3 answers

Why does Monte Carlo work when a real opponent's behavior may not be random

I am learning about Monte Carlo algorithms and struggling to understand the following: If simulations are based on random moves, how can the modeling of the opponent's behavior work well? For example, if I have a node with 100 children, 99 of…
kgautron
  • 211
  • 1
  • 6
10
votes
4 answers

Can an AI be trained to generate the outline of a story?

I know that one of the recent fads right now is to train a neural network to generate screenplays and new episodes of e.g. the Friends or The Simpsons, and that's fine: it's interesting and might be the necessary first steps toward making programs…
bsideswiped
  • 211
  • 1
  • 4
10
votes
3 answers

Back-of-the-envelope machine learning (specifically neural networks) calculations

There is a popular story regarding the back-of-the-envelope calculation performed by a British physicist named G. I. Taylor. He used dimensional analysis to estimate the power released by the explosion of a nuclear bomb, simply by analyzing a…
Charles
  • 291
  • 2
  • 6
10
votes
1 answer

What is the difference between expected return and value function?

I've seen numerous mathematical explanations of reward, value functions $V(s)$, and return functions. The reward provides an immediate return for being in a specific state. The better the reward, the better the state. As I understand it, it can be…
10
votes
2 answers

Why should the number of neurons in a hidden layer be a power of 2?

I have read somewhere on the web (I lost the reference) that the number of units (or neurons) in a hidden layer should be a power of 2 because it helps the learning algorithm to converge faster. Is this a fact? If it is, why is this true? Does it…
10
votes
2 answers

Does placing tires on aircraft help prevent AI-assisted attacks?

I see in the screenshots below from some footage of Operation Spider's Web (Ukrainians blowing up Russian airplanes with drones on 2025-06-01) that tires were placed on planes. Some people say that this is to confuse AI-assisted attacks.
Franck Dernoncourt
  • 3,473
  • 2
  • 21
  • 39
10
votes
4 answers

Relevance of genetic algorithms in modern research

I am considering pursuing a career in AI (currently have an undergraduate background in Philosophy/Computer Science) and have been taking some time to research particular topics. One class of method that piqued my interest was the genetic algorithm.…
10
votes
3 answers

How should I represent the input to a neural network for the games of tic-tac-toe, checkers or chess?

I've been reading a lot about TD-Gammon recently as I'm exploring options for AI in a video game I'm making. The video game is a turn-based positional sort of game, i.e. a "units", or game piece's, position will greatly impact its usefulness in that…
10
votes
2 answers

How to design a neural network when the number of inputs is variable?

I'm looking to design a neural network that can predict which runner wins in a sports game, where the number of runners varies between 2-10. In each case, specific data about the individual runners (for example, the weight, height, average speed in…
10
votes
3 answers

What is the difference between encoders and auto-encoders?

How are the layers in a encoder connected across the network for normal encoders and auto-encoders? In general, what is the difference between encoders and auto-encoders?
m2rik
  • 333
  • 1
  • 9
10
votes
1 answer

Why are biases (typically) not used in attention mechanism?

Watching this video implementing attention in a transformer. He set query, key, and value biases to False and said "Typically, people don't use biases for these". Even in official PyTorch code the default bias is False: add_bias_kv: If specified,…
10
votes
4 answers

How do I select the relevant features of the data?

Recently I was working on a problem to do some cost analysis of my expenditure for some particular resource. I usually make some manual decisions from the analysis and plan accordingly. I have a big data set in excel format and with hundreds of…