Most Popular

1500 questions
10
votes
1 answer

A deep learning algorithm to optimize the outcome

I'm am quite new to deep learning but I think I found just the right real-world situation to start using it. The problem is that I have only used such algorithms to predict outcomes. For my new project, I need information to feed a machine with to…
10
votes
3 answers

What do you call a machine learning system that keeps on learning?

As I understand it from this video lecture, there are three types of deep learning: Supervised Unsupervised Reinforcement All these can serve to train a neural network either only prior to its deployment or during its operating. For the latter…
10
votes
2 answers

"Attention is all you need" paper : How are the Q, K, V values calculated?

The seminal Attention is all you need paper (Google Brain team, 2017) introduces Transformers and implements the attention mecanism with "queries, keys, values", in an analogy to a retrieval system. I understand the whole process of multi-head…
Soltius
  • 311
  • 1
  • 2
  • 10
10
votes
2 answers

Are GPT-3.5 series models based on GPT-3?

In the official blog post about ChatGPT from OpenAI, there is this paragraph explaining how ChatGPT model was trained: We trained this model using Reinforcement Learning from Human Feedback (RLHF), using the same methods as InstructGPT, but with…
iMad
  • 203
  • 2
  • 5
10
votes
1 answer

What causes ChatGPT to generate responses that refer to itself as a bot or LM?

ChatGPT occasionally generates responses to prompts that refer to itself as a "bot" or "language model." For instance, when given a certain input (the first paragraph of this question) ChatGPT produces (in part) the output: It is not appropriate…
10
votes
1 answer

How can an ensemble be more accurate than the best base classifier in that ensemble?

BACKGROUND: Ensemble classifiers are said to reduce bias by taking an "average" of predictions of several base classifiers that comprise the ensemble. However, I am uncertain if this necessarily means that they can increase accuracy. My intuition…
Snehal Patel
  • 1,037
  • 1
  • 4
  • 27
10
votes
2 answers

How can I encode angle data to train neural networks?

I am training a neural network where the target data is a vector of angles in radians (between $0$ and $2\pi$). I am looking for study material on how to encode this data. Can you supply me with a book or research paper that covers this topic…
10
votes
1 answer

Are biological neurons organized in consecutive layers as well?

I'm now reading a book titled Hands-On Machine Learning with Scikit-Learn and TensorFlow and in the Chapter 10 of the book, the author writes the following: The architecture of biological neural networks (BNN)4 is still the subject of active…
Blaszard
  • 1,097
  • 4
  • 11
  • 25
10
votes
2 answers

Using AI to extend an imagine pattern

I have created some nice patterns using the MidJourney tool. I'd like to find a way to extend these patterns, and I was thinking about an AI tool that takes one of these patterns and extends it in all directions surrounding the original…
10
votes
1 answer

Is analogy necessary to artificial general intelligence?

Analogies are quite powerful in communication. They allow explaining complex concepts to people with no domain knowledge, just by mapping to a known domain. Hofstadter says they matter, whereas Dijkstra says they are dangerous. Anyway, analogies can…
Eric Platon
  • 1,510
  • 10
  • 22
10
votes
1 answer

What is the difference between the triplet loss and the contrastive loss?

What is the difference between the triplet loss and the contrastive loss? They look same to me. I don't understand the nuances between the two. I have the following queries: When to use what? What are the use cases and advantages or disadvantages…
10
votes
2 answers

Is Lisp still worth learning today in the particular context of Machine learning?

Lisp was originally created as a practical mathematical notation for computer programs, influenced by the notation of Alonzo Church's lambda calculus. It quickly became the favored programming language for artificial intelligence (AI) research,…
10
votes
2 answers

Is there a venue to publish negative results in AI/ML domain?

Negative results occur frequently in AI/ML research (and perhaps in other domains too). Most of the time, these results are not published. This is mostly because your typical AI/ML conference doesn't accept such papers. However, are there any venues…
SpiderRico
  • 1,040
  • 10
  • 18
10
votes
1 answer

Has anyone attempted to train an AI to learn all languages?

It seems that most projects attempt to teach the AI to learn individual, specific languages. It occurs to me that there are relations in written and spoken words and phrases across languages - most of use have a much easier time learning more…
mindplay.dk
  • 209
  • 1
  • 4
10
votes
3 answers

Do I need classification or regression to predict the availability of a user given some features?

While studying data mining methods I have come to understand that there are two main categories: Predictive methods: Classification Regression Descriptive methods: Clustering Association rules Since I want to predict the user availability…