Highest Voted Questions - Artificial Intelligence Stack Exchange

10

votes

1 answer

A deep learning algorithm to optimize the outcome

I'm am quite new to deep learning but I think I found just the right real-world situation to start using it. The problem is that I have only used such algorithms to predict outcomes. For my new project, I need information to feed a machine with to…

deep-learning

asked Aug 31 '17 at 17:16

Frank Malenfant

131
6

10

votes

3 answers

What do you call a machine learning system that keeps on learning?

As I understand it from this video lecture, there are three types of deep learning: Supervised Unsupervised Reinforcement All these can serve to train a neural network either only prior to its deployment or during its operating. For the latter…

neural-networks deep-learning terminology incremental-learning

asked Aug 29 '17 at 08:57

ZakC

347
2
7

10

votes

2 answers

"Attention is all you need" paper : How are the Q, K, V values calculated?

The seminal Attention is all you need paper (Google Brain team, 2017) introduces Transformers and implements the attention mecanism with "queries, keys, values", in an analogy to a retrieval system. I understand the whole process of multi-head…

transformer attention word-embedding

asked Feb 14 '23 at 10:37

Soltius

311
1
2
10

10

votes

2 answers

Are GPT-3.5 series models based on GPT-3?

In the official blog post about ChatGPT from OpenAI, there is this paragraph explaining how ChatGPT model was trained: We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with…

open-ai fine-tuning chatgpt gpt-3

asked Feb 02 '23 at 16:40

iMad

203
2
5

10

votes

1 answer

What causes ChatGPT to generate responses that refer to itself as a bot or LM?

ChatGPT occasionally generates responses to prompts that refer to itself as a "bot" or "language model." For instance, when given a certain input (the first paragraph of this question) ChatGPT produces (in part) the output: It is not appropriate…

chat-bots training-datasets language-model gpt-3 chatgpt

asked Dec 16 '22 at 08:58

Obie 2.0

202
1
8

10

votes

1 answer

How can an ensemble be more accurate than the best base classifier in that ensemble?

BACKGROUND: Ensemble classifiers are said to reduce bias by taking an "average" of predictions of several base classifiers that comprise the ensemble. However, I am uncertain if this necessarily means that they can increase accuracy. My intuition…

ensemble-learning

asked Nov 27 '22 at 14:21

Snehal Patel

1,037
1
4
27

10

votes

2 answers

How can I encode angle data to train neural networks?

I am training a neural network where the target data is a vector of angles in radians (between $0$ and $2\pi$). I am looking for study material on how to encode this data. Can you supply me with a book or research paper that covers this topic…

neural-networks reference-request training datasets data-preprocessing

asked Nov 27 '22 at 03:47

user366312

341
1
13

10

votes

1 answer

Are biological neurons organized in consecutive layers as well?

I'm now reading a book titled Hands-On Machine Learning with Scikit-Learn and TensorFlow and in the Chapter 10 of the book, the author writes the following: The architecture of biological neural networks (BNN)4 is still the subject of active…

reference-request neurons neuroscience

asked Aug 01 '17 at 17:16

Blaszard

1,097
4
11
25

10

votes

2 answers

Using AI to extend an imagine pattern

I have created some nice patterns using the MidJourney tool. I'd like to find a way to extend these patterns, and I was thinking about an AI tool that takes one of these patterns and extends it in all directions surrounding the original…

ai-design pattern-recognition image-generation diffusion-models

asked Sep 28 '22 at 08:10

Nicola Lepetit

203
2
8

10

votes

1 answer

Is analogy necessary to artificial general intelligence?

Analogies are quite powerful in communication. They allow explaining complex concepts to people with no domain knowledge, just by mapping to a known domain. Hofstadter says they matter, whereas Dijkstra says they are dangerous. Anyway, analogies can…

ai-design philosophy agi

asked Jul 18 '17 at 02:48

Eric Platon

1,510
10
22

10

votes

1 answer

What is the difference between the triplet loss and the contrastive loss?

What is the difference between the triplet loss and the contrastive loss? They look same to me. I don't understand the nuances between the two. I have the following queries: When to use what? What are the use cases and advantages or disadvantages…

comparison objective-functions siamese-neural-network contrastive-learning triplet-loss-function

asked Jun 18 '22 at 19:00

Exploring

371
7
18

10

votes

2 answers

Is Lisp still worth learning today in the particular context of Machine learning?

Lisp was originally created as a practical mathematical notation for computer programs, influenced by the notation of Alonzo Church's lambda calculus. It quickly became the favored programming language for artificial intelligence (AI) research,…

machine-learning deep-learning history programming-languages lisp

asked Jun 23 '17 at 13:06

Maheshwar Ligade

826
11
21

10

votes

2 answers

Is there a venue to publish negative results in AI/ML domain?

Negative results occur frequently in AI/ML research (and perhaps in other domains too). Most of the time, these results are not published. This is mostly because your typical AI/ML conference doesn't accept such papers. However, are there any venues…

research academia

asked Sep 07 '21 at 20:29

SpiderRico

1,040
10
18

10

votes

1 answer

Has anyone attempted to train an AI to learn all languages?

It seems that most projects attempt to teach the AI to learn individual, specific languages. It occurs to me that there are relations in written and spoken words and phrases across languages - most of use have a much easier time learning more…

natural-language-processing

asked Apr 06 '17 at 06:52

mindplay.dk

209
1
4

10

votes

3 answers

Do I need classification or regression to predict the availability of a user given some features?

While studying data mining methods I have come to understand that there are two main categories: Predictive methods: Classification Regression Descriptive methods: Clustering Association rules Since I want to predict the user availability…

machine-learning ai-design classification prediction regression

asked Apr 01 '17 at 14:19

Guest2000

305
1
4

Most Popular