Highest Voted 'image-processing' Questions - Artificial Intelligence Stack Exchange

12

votes

2 answers

Is there any existing attempt to create a deep learning model which extracts vector paths from bitmaps?

I need an algorithm to trace simple bitmaps, which only contain paths with a given stroke width. Is there any existing attempt to create a deep learning model which extracts vector paths from bitmaps? It is obviously very easy to generate bitmaps…

asked Nov 20 '19 at 16:45

arthur.sw

171
1
8

8

votes

3 answers

Is it okay to use publicly available Instagram videos to train an AI?

Since I haven't found any good training data for my university project, I want to use pictures and videos from public Instagram profiles. Am I allowed to do that?

computer-vision training datasets research image-processing

asked Sep 22 '21 at 12:04

Bert Gayus

645
1
5
12

8

votes

2 answers

What are the main algorithms used in computer vision?

Nowadays, CV has really achieved great performance in many different areas. However, it is not clear what a CV algorithm is. What are some examples of CV algorithms that are commonly used nowadays and have achieved state-of-the-art performance?

computer-vision reference-request image-processing algorithm-request model-request

asked Jun 17 '20 at 15:12

Pluviophile

1,293
7
20
40

7

votes

3 answers

Does each filter in each convolution layer create a new image?

Say I have a CNN with this structure: input = 1 image (say, 30x30 RGB pixels) first convolution layer = 10 5x5 convolution filters second convolution layer = 5 3x3 convolution filters one dense layer with 1 output So a graph of the network will…

convolutional-neural-networks image-processing convolution hidden-layers convolution-arithmetic

asked Dec 09 '19 at 14:26

RocketNuts

205
2
6

5

votes

1 answer

In OCR, how should I deal with the warped text on the sides of oval objects?

Consider an image that contains one can (or bottle, or any similar oval object), which has texts all over it. In the image below, I have many bottles, but you can assume that each image only contains one such object. As we can see, in each can, the…

python image-processing data-preprocessing optical-character-recognition

asked Jan 06 '21 at 09:09

Red

175
6

5

votes

1 answer

Autoencoder produces repeated artifacts after convergence

As experiment, I have tried using an autoencoder to encode height data from the alps, however the decoded image is very pixellated after training for several hours as show in the image below. This repeating patter is larger than the final kernel…

convolutional-neural-networks autoencoders image-processing

asked Jan 08 '20 at 22:01

Yadeses

231
2
5

4

votes

5 answers

Can an AI generated image (such as pic of human face) be detected that it's AI generated?

AIs are getting better and better at creating images and art. Some of the stuff is almost impossible to be detected by the naked eye. But what about programs and algorithms? Instead of creating an image, can anything detect that this image was…

image-recognition image-processing human-like face-recognition face-detection

asked Jul 13 '22 at 05:53

No Name

141
2

4

votes

1 answer

What is the stride information of an image referring here?

In convolutional neural networks, the convolution and pooling operations have a parameter known as stride, which decides the amount of jump the kernel needs to do on the input image. You can get more information regarding stride from follows taken…

deep-learning convolutional-neural-networks terminology image-processing stride

asked Nov 26 '21 at 22:37

hanugm

4,102
3
29
63

4

votes

1 answer

What is the state-of-the-art algorithm for neural style transfer?

I've read the paper A Neural Algorithm of Artistic Style by Gatys et. al. and I find the application of neural style transfer very fun. I also read that Exploring the structure of a real-time, arbitrary neuralartistic stylization network by Ghiasi…

deep-learning computer-vision deep-neural-networks image-processing image-generation

asked Aug 06 '20 at 14:47

DeepNet

41
2

4

votes

1 answer

How to evaluate the performance of an autoencoder trained on image data?

I am training an autoencoder on (general) image data. I use binary crossentropy loss function, but it is not very informative when I want to evaluate the performance of my autoencoder. An obvious performance metric would be pixel-wise MSE, but it…

autoencoders image-processing

asked May 05 '20 at 14:49

nim.py

160
8

4

votes

1 answer

How to calculate the size of a 3d object from an image?

I am wondering how to calculate the size of a 3d object in an image without knowing the focal length of the camera but the distance from the camera to the object.

machine-learning computer-vision image-processing

asked Nov 14 '19 at 01:06

Blackreaved

43
5

4

votes

1 answer

Video engagement analysis with deep learning

I am trying to rank video scenes/frames based on how appealing they are for a viewer. Basically, how "interesting" or "attractive" a scene inside a video can be for a viewer. My final goal is to generate say a 10-second short summary given a video…

neural-networks deep-learning classification computer-vision image-processing

asked Aug 27 '19 at 03:39

Mary

993
6
13

4

votes

1 answer

Turn photos right-side up?

I'm looking for either an existing AI app or a pre-trained NN that will tell me if a photograph is right-side up or not. I want to use this to create an application that automatically rotates photos so they are right-side-up. This doesn't seem…

image-processing

asked Aug 24 '19 at 19:25

vy32

141
2

4

votes

1 answer

Why do we get a three-dimensional output after a convolutional layer?

In a convolutional neural network, when we apply the convolution on a $5 \times 5$ image with $3 \times 3$ kernel, with stride $1$, we should get only one $4 \times 4$ as output. In most of the CNN tutorials, we are having $4 \times 4 \times m$ as…

convolutional-neural-networks computer-vision image-processing convolution

asked Aug 16 '19 at 05:47

Prabu M

43
2

4

votes

1 answer

Aesthetics analysis with deep learning

I'm trying to score video scenes in terms of aesthetics and cinematography features. Basically, how "interesting" a scene or video frame can be for a viewer. Simpler, how attractive a scene is. My final goal is to tag intervals of video which can be…

neural-networks deep-learning computer-vision image-processing art-aesthetics

asked Aug 15 '19 at 21:45

Mary

993
6
13

Questions tagged [image-processing]