Questions tagged [video-classification]

18 questions
7
votes
3 answers

How to classify human actions?

I'm quite new to machine learning (I followed the Coursera course of Andrew Ng and now starting deeplearning.ai courses). I want to classify human actions real-time like: Left-arm bended Arm above shoulder ... I first did some research for…
4
votes
1 answer

Why don't those developing AI Deepfake detectors use two detectors at once so as to catch deepfakes in one or the other?

Why don't those developing AI Deepfake detectors use two differently trained detectors at once that way if the Deepfake was trained to fool one of the detectors the other would catch it and vice-versa? To be clear this is really a question of can…
3
votes
1 answer

How can I determine whether a car in a video is moving or not?

How can I classify a given sequence of images (video) as either moving or staying still from the perspective of the person inside the car? Below is an example of the sequence of 12 images animated. Moving from the point of the person inside the…
3
votes
1 answer

How can I do video classification while taking into account the temporal dependencies of the frames?

I need to solve a video classification problem. While looking for solutions, I only found solutions that transform this problem into a series of simpler image classification tasks. However, this method has a downside: we ignore the temporal…
2
votes
1 answer

Most suitable model for video classification with a fixed camera

Consider a fixed camera that records a given area. Three things can happen in this area: No action People performing action A People performing action B I want to train a model to detect when action B happens. A human observer could typically…
1
vote
0 answers

Is AI good at detecting AI-generated content?

Are AI models good at detecting AI-generated image or video content like deep fakes? Which model can we use for the detection of AI-generated image content?
1
vote
0 answers

Detecting cheats visually using AI

I really like to play my favorite 3D shooter game online. Unfortunately, it is really old and cheat protection isn't really common there, but cheaters are! It is very frustrating, because it really kills all the fun playing against…
1
vote
1 answer

Video Analysis: Providing a success score for a of a student carrying out a specific task

I have an AI/ML challenge in relation to video analysis and am unsure where to start. I am investigating an application that will grade students performance of carrying out a task, based on analysis of a video of them carrying out the task. The…
wilson_smyth
  • 111
  • 2
1
vote
0 answers

Can I use ML to discover via videos the best place to shoot in foosball?

I am a programmer, but just now attempting to enter the world of ML. I'm eyeballing a potential project/problem related to foosball. Pro foosball is a thing believe it or not and I'm wondering if I can use decades worth of game footage to determine…
1
vote
1 answer

I need to select the image from a predefined dataset that are the closest to the input, is this possible or do I even need to use ML/AI?

So as the title states, I have a set of images and I want to process input images and need to select the image that "looks" the most like the input image. I know I've seen something similar where the code could guess who's face was in a picture, I…
taracus
  • 111
  • 3
1
vote
0 answers

How to handle a high dimensional video (large number of frames per video) data for training a video classification network

I have a video dataset as follows. Dataset size: 1k videos Frames per video: 4k (average) and 8k (maximum) Labels: Each video has one label. So the size of my input will be (N, 8000, 64, 64, 3) 64 is height and width of video. I use keras. I am…
0
votes
0 answers

How to write a custom loss for multi-label video classification?

I am trying to train a multi-label video classification model. My dataset consists of just one video, sampled at 1fps. I have a total of 12k frames and 21 classes, and in a single frame multiple classes can be present. I added a simple…
0
votes
1 answer

Is background segmentation effective for improving action recognition model on real-time human-object interaction videos?

I am working on an action recognition task involving human-object interactions using an I3D (3D CNN-based) model. The model was trained on pre-recorded videos, and it performed well during evaluation. However, when I applied it to unseen real-time…
0
votes
0 answers

What is the current state of the art in video transformers (mainly for tasks like classification) and what are the Top 5 papers from the last 2 years?

Is there a general consensus in the community regarding the most effective video transformer architecture which modalities to use, how to represent them, and the best methods for fusing them the recommended training strategies for tasks like video…
0
votes
0 answers

Generation of text describing moving objects in video

How might I generate text messaging from live video describing how objects of significance are moving, left, right, away from me, in or out of a building etc., without using lidar or similar to assess the objects movement?
1
2