Most Popular
1500 questions
5
votes
1 answer
What kind of algorithm is used by StackGAN to generate realistic images from text?
What kind of algorithm is used by StackGAN to generate realistic images from text? How does StackGAN work?
Aneesh bhat
- 53
- 4
5
votes
2 answers
Feasibility of generating large images with a convnet
I've spent the past couple of months learning about neural networks, and am thinking of projects that would be fun to work on to cement my understanding of this tech.
One thing that came to mind last night is a system that takes an image of a movie…
JR Heard
- 98
- 5
5
votes
1 answer
What is the difference between out of distribution detection and anomaly detection?
I'm currently reading the paper Likelihood Ratios for Out-of-Distribution Detection, and it seems that their problem is very similar to the problem of anomaly detection. More precisely, given a neural network trained on a dataset consisting of…
Mahmoud
- 243
- 2
- 7
5
votes
1 answer
Do AlphaZero/MuZero learn faster in terms of number of games played than humans?
I don't know much about AI and am just curious.
From what I read, AlphaZero/MuZero outperform any human chess player after a few hours of training. I have no idea how many chess games a very talented human chess player on average has played before…
220284
- 153
- 4
5
votes
1 answer
What should the initial UCT value be with MCTS, when leaf's simulation count is zero? Infinity?
I am implenting a Monte Carlo Tree Search algorithm, where the selection process is done through Upper Confidence Bound formula:
def uct(state):
log_n = math.log(state.parent.sim_count)
explore_term = self.exploration_weight *…
semyd
- 153
- 1
- 6
5
votes
1 answer
How do I design a neural network that breaks a 5-letter word into its corresponding syllables?
I am going to design a neural network which will be able to break a 5-letter word into its corresponding syllables (hybrid syllables, I mean it will not strictly adhere to grammatical syllable rules but will be based on some training sets I…
Programmer
- 174
- 6
5
votes
2 answers
Is it really possible to create the "Perfect Cylinder" used in Universal Approximation Theorem for 1-hidden layer Neural Network?
There are proofs for the universal approximation theorem with just 1 hidden layer.
The proof goes like this:
Create a "bump" function using 2 neurons.
Create (infinitely) many of these step functions with different angles in order to create a…
KoKlA
- 133
- 6
5
votes
1 answer
Why does regular Q-learning (and DQN) overestimate the Q values?
The motivation for the introduction of double DQN (and double Q-learning) is that the regular Q-learning (or DQN) can overestimate the Q value, but is there a brief explanation as to why it is overestimated?
ground clown
- 111
- 3
5
votes
1 answer
Can games be solved without an evaluation function?
Fundamentally, a game-playing AI must solve the problem of choosing the best action from a set of possible actions.
Most existing game AI's, such as AlphaGo, do this by using an evaluation function, which maps game states to real numbers. The real…
dshin
- 161
- 5
5
votes
1 answer
In OCR, how should I deal with the warped text on the sides of oval objects?
Consider an image that contains one can (or bottle, or any similar oval object), which has texts all over it. In the image below, I have many bottles, but you can assume that each image only contains one such object.
As we can see, in each can, the…
Red
- 175
- 6
5
votes
1 answer
Research into social behavior in Prisoner's Dilemma
I've been working on research into reproducing social behavior using multi-agent reinforcement learning. My focus has been on a GridWorld-style game, but I was thinking that maybe a simpler Prisoner's Dilemma game could be a better approach. I tried…
Ram Rachum
- 260
- 1
- 11
5
votes
1 answer
Is there anything novel about Zuckerberg's Jarvis?
Recently Mark got some attention from the media by stating that he had created Jarvis. Not that I'm against him or anything, but this Jarvis seems to have been done a hundred times before. He's done something which most developers would classify as…
Dylan Dsouza
- 51
- 1
5
votes
4 answers
What is the fundamental difference between an ML model and a function?
A model can be roughly defined as any design that is able to solve an ML task. Examples of models are the neural network, decision tree, Markov network, etc.
A function can be defined as a set of ordered pairs with one-to-many mapping from a domain…
hanugm
- 4,102
- 3
- 29
- 63
5
votes
1 answer
Why multiplayer, imperfect information, trick-taking card games are hard for AI?
AI reached a super-human level in many complex games such as Chess, Go, Texas hold'em Poker, Dota2 and StarCraft2. However it still did not reach this level in trick-taking card games.
Why there is no super-human AI playing imperfect-information,…
Cohensius
- 423
- 3
- 15
5
votes
1 answer
Wasserstein GAN: Implemention of Critic Loss Correct?
The WGAN paper concretely proposes Algorithm 1 (cf. page 8). Now, they also state what their loss for the critic and the generator is.
When implementing the critic loss (so lines 5 and 6 of Algorithm 1), they maximize the parameters $w$ (instead of…
Anonymous5638
- 171
- 6