Questions tagged [video-classification]
18 questions
7
votes
3 answers
How to classify human actions?
I'm quite new to machine learning (I followed the Coursera course of Andrew Ng and now starting deeplearning.ai courses).
I want to classify human actions real-time like:
Left-arm bended
Arm above shoulder
...
I first did some research for…
user1007522
- 171
- 1
4
votes
1 answer
Why don't those developing AI Deepfake detectors use two detectors at once so as to catch deepfakes in one or the other?
Why don't those developing AI Deepfake detectors use two differently trained detectors at once that way if the Deepfake was trained to fool one of the detectors the other would catch it and vice-versa?
To be clear this is really a question of can…
Ethan
- 103
- 6
3
votes
1 answer
How can I determine whether a car in a video is moving or not?
How can I classify a given sequence of images (video) as either moving or staying still from the perspective of the person inside the car?
Below is an example of the sequence of 12 images animated.
Moving from the point of the person inside the…
Naveen
- 153
- 4
3
votes
1 answer
How can I do video classification while taking into account the temporal dependencies of the frames?
I need to solve a video classification problem. While looking for solutions, I only found solutions that transform this problem into a series of simpler image classification tasks. However, this method has a downside: we ignore the temporal…
אבנר יעקב
- 33
- 2
2
votes
1 answer
Most suitable model for video classification with a fixed camera
Consider a fixed camera that records a given area. Three things can happen in this area:
No action
People performing action A
People performing action B
I want to train a model to detect when action B happens. A human observer could typically…
firion
- 269
- 1
- 7
1
vote
0 answers
Is AI good at detecting AI-generated content?
Are AI models good at detecting AI-generated image or video content like deep fakes? Which model can we use for the detection of AI-generated image content?
Maciej Łoziński
- 111
- 4
1
vote
0 answers
Detecting cheats visually using AI
I really like to play my favorite 3D shooter game online. Unfortunately, it is really old and cheat protection isn't really common there, but cheaters are! It is very frustrating, because it really kills all the fun playing against…
harrow
- 111
- 1
1
vote
1 answer
Video Analysis: Providing a success score for a of a student carrying out a specific task
I have an AI/ML challenge in relation to video analysis and am unsure where to start.
I am investigating an application that will grade students performance of carrying out a task, based on analysis of a video of them carrying out the task.
The…
wilson_smyth
- 111
- 2
1
vote
0 answers
Can I use ML to discover via videos the best place to shoot in foosball?
I am a programmer, but just now attempting to enter the world of ML. I'm eyeballing a potential project/problem related to foosball.
Pro foosball is a thing believe it or not and I'm wondering if I can use decades worth of game footage to determine…
btd
- 111
- 1
1
vote
1 answer
I need to select the image from a predefined dataset that are the closest to the input, is this possible or do I even need to use ML/AI?
So as the title states, I have a set of images and I want to process input images and need to select the image that "looks" the most like the input image.
I know I've seen something similar where the code could guess who's face was in a picture, I…
taracus
- 111
- 3
1
vote
0 answers
How to handle a high dimensional video (large number of frames per video) data for training a video classification network
I have a video dataset as follows.
Dataset size: 1k videos
Frames per video: 4k (average) and 8k (maximum)
Labels: Each video has one label.
So the size of my input will be (N, 8000, 64, 64, 3)
64 is height and width of video. I use keras. I am…
manv
- 11
- 2
0
votes
0 answers
How to write a custom loss for multi-label video classification?
I am trying to train a multi-label video classification model. My dataset consists of just one video, sampled at 1fps. I have a total of 12k frames and 21 classes, and in a single frame multiple classes can be present. I added a simple…
Berk Ali Çam
- 1
- 1
0
votes
1 answer
Is background segmentation effective for improving action recognition model on real-time human-object interaction videos?
I am working on an action recognition task involving human-object interactions using an I3D (3D CNN-based) model. The model was trained on pre-recorded videos, and it performed well during evaluation. However, when I applied it to unseen real-time…
Renat Abdrakhmanov
- 23
- 3
0
votes
0 answers
What is the current state of the art in video transformers (mainly for tasks like classification) and what are the Top 5 papers from the last 2 years?
Is there a general consensus in the community regarding
the most effective video transformer architecture
which modalities to use, how to represent them, and the best methods for fusing them
the recommended training strategies
for tasks like video…
0
votes
0 answers
Generation of text describing moving objects in video
How might I generate text messaging from live video describing how objects of significance are moving, left, right, away from me, in or out of a building etc., without using lidar or similar to assess the objects movement?