Highest Voted 'video-classification' Questions - Artificial Intelligence Stack Exchange

7

votes

3 answers

How to classify human actions?

I'm quite new to machine learning (I followed the Coursera course of Andrew Ng and now starting deeplearning.ai courses). I want to classify human actions real-time like: Left-arm bended Arm above shoulder ... I first did some research for…

asked Dec 25 '19 at 09:56

user1007522

171
1

4

votes

1 answer

Why don't those developing AI Deepfake detectors use two detectors at once so as to catch deepfakes in one or the other?

Why don't those developing AI Deepfake detectors use two differently trained detectors at once that way if the Deepfake was trained to fool one of the detectors the other would catch it and vice-versa? To be clear this is really a question of can…

machine-learning generative-adversarial-networks deepfakes video-classification

asked Jan 26 '21 at 01:55

Ethan

103
6

3

votes

1 answer

How can I determine whether a car in a video is moving or not?

How can I classify a given sequence of images (video) as either moving or staying still from the perspective of the person inside the car? Below is an example of the sequence of 12 images animated. Moving from the point of the person inside the…

machine-learning convolutional-neural-networks video-classification

asked Apr 15 '18 at 12:54

Naveen

153
4

3

votes

1 answer

How can I do video classification while taking into account the temporal dependencies of the frames?

I need to solve a video classification problem. While looking for solutions, I only found solutions that transform this problem into a series of simpler image classification tasks. However, this method has a downside: we ignore the temporal…

deep-learning convolutional-neural-networks reference-request video-classification action-recognition

asked Dec 22 '20 at 08:30

אבנר יעקב

33
2

2

votes

1 answer

Most suitable model for video classification with a fixed camera

Consider a fixed camera that records a given area. Three things can happen in this area: No action People performing action A People performing action B I want to train a model to detect when action B happens. A human observer could typically…

convolutional-neural-networks video-classification

asked Dec 12 '19 at 09:13

firion

269
1
7

1

vote

0 answers

Is AI good at detecting AI-generated content?

Are AI models good at detecting AI-generated image or video content like deep fakes? Which model can we use for the detection of AI-generated image content?

image-generation deepfakes video-classification

asked Nov 28 '23 at 12:41

Maciej Łoziński

111
4

1

vote

0 answers

Detecting cheats visually using AI

I really like to play my favorite 3D shooter game online. Unfortunately, it is really old and cheat protection isn't really common there, but cheaters are! It is very frustrating, because it really kills all the fun playing against…

classification training image-recognition game-ai video-classification

asked Feb 20 '22 at 00:18

harrow

111
1

1

vote

1 answer

Video Analysis: Providing a success score for a of a student carrying out a specific task

I have an AI/ML challenge in relation to video analysis and am unsure where to start. I am investigating an application that will grade students performance of carrying out a task, based on analysis of a video of them carrying out the task. The…

training ai-design video-classification

asked Apr 22 '21 at 14:02

wilson_smyth

111
2

1

vote

0 answers

Can I use ML to discover via videos the best place to shoot in foosball?

I am a programmer, but just now attempting to enter the world of ML. I'm eyeballing a potential project/problem related to foosball. Pro foosball is a thing believe it or not and I'm wondering if I can use decades worth of game footage to determine…

machine-learning video-classification games-of-chance

asked Aug 04 '20 at 15:44

btd

111
1

1

vote

1 answer

I need to select the image from a predefined dataset that are the closest to the input, is this possible or do I even need to use ML/AI?

So as the title states, I have a set of images and I want to process input images and need to select the image that "looks" the most like the input image. I know I've seen something similar where the code could guess who's face was in a picture, I…

classification tensorflow video-classification

asked Feb 14 '20 at 07:21

taracus

111
3

1

vote

0 answers

How to handle a high dimensional video (large number of frames per video) data for training a video classification network

I have a video dataset as follows. Dataset size: 1k videos Frames per video: 4k (average) and 8k (maximum) Labels: Each video has one label. So the size of my input will be (N, 8000, 64, 64, 3) 64 is height and width of video. I use keras. I am…

deep-learning keras dimensionality video-classification

asked Dec 25 '19 at 08:26

manv

11
2

0

votes

0 answers

How to write a custom loss for multi-label video classification?

I am trying to train a multi-label video classification model. My dataset consists of just one video, sampled at 1fps. I have a total of 12k frames and 21 classes, and in a single frame multiple classes can be present. I added a simple…

objective-functions video-classification

asked Jan 09 '25 at 11:43

Berk Ali Çam

1
1

0

votes

1 answer

Is background segmentation effective for improving action recognition model on real-time human-object interaction videos?

I am working on an action recognition task involving human-object interactions using an I3D (3D CNN-based) model. The model was trained on pre-recorded videos, and it performed well during evaluation. However, when I applied it to unseen real-time…

computer-vision image-segmentation real-time video-classification action-recognition

asked Sep 24 '24 at 12:52

Renat Abdrakhmanov

23
3

0

votes

0 answers

What is the current state of the art in video transformers (mainly for tasks like classification) and what are the Top 5 papers from the last 2 years?

Is there a general consensus in the community regarding the most effective video transformer architecture which modalities to use, how to represent them, and the best methods for fusing them the recommended training strategies for tasks like video…

computer-vision transformer video-classification

asked Sep 04 '24 at 13:50

user27192192

1

0

votes

0 answers

Generation of text describing moving objects in video

How might I generate text messaging from live video describing how objects of significance are moving, left, right, away from me, in or out of a building etc., without using lidar or similar to assess the objects movement?

deep-learning text-generation video-classification

asked Jul 18 '24 at 10:24

Nicholas Walton

1

Questions tagged [video-classification]