Highest Voted 'semantic-segmentation' Questions - Artificial Intelligence Stack Exchange

4

votes

1 answer

How does Mask R-CNN automatically output a different number of objects on the image?

Recently, I was reading Pytorch's official tutorial about Mask R-CNN. When I run the code on colab, it turned out that it automatically outputs a different number of channels during prediction. If the image has 2 people on it, it would output a mask…

asked May 08 '21 at 07:33

dato nefaridze

882
10
22

4

votes

3 answers

If I trained a model to perform semantic segmentation on images with only one object, would it also work on images with multiple objects?

I'm working on semantic segmentation tasks in the medical space using the U-Net. Let's say that I train a U-Net model on medical images with the goal of segmenting out, say, ligaments, from a medical image. If I train that model on images that…

computer-vision image-segmentation u-net instance-segmentation semantic-segmentation

asked Jan 12 '20 at 22:11

cmed123

141
2

2

votes

0 answers

How do I use a 2D image segmentation model on 3D medical imaging data?

I am trying to use a high-level semantic segmentation model (something like DeepLabv3), that takes in 2D RGB images, and then fine-tune it for my problem. However, I am working with brain MRI images which are grayscale 3D images. The obvious…

semantic-segmentation 3d-convolution

asked Dec 11 '23 at 18:53

Tarek Alkabbani

21
1

2

votes

0 answers

Why CNN inference works on larger images

I have been reading up on 'regular' CNN's such as Mask R-CNN, and as far as I understand it they rely on a fully connected layer in the end to classify pixels. FCN's (such as U-Net) which do not use these layers are able to effectively process…

convolutional-neural-networks semantic-segmentation

asked Feb 26 '22 at 12:01

Cartman123

121
2

2

votes

0 answers

What is meant by Hinton when he refers to "Part-Whole Hierarchies" in his GLOM framework

I was recently reading Hinton's GLOM idea How to represent part-whole hierarchies in a neural network, and I am simply unsure about what exactly he means when he says parsing images into "part-whole hierarchies". Moreover, wouldn't semantic…

neural-networks terminology papers semantic-segmentation

asked Jun 02 '21 at 01:16

ElectricMinimum58

21
1

2

votes

1 answer

What does the "number of channels" correspond to in U-Net?

I'm studying the U-Net CNN architecture. I'm new to CNNs and am confused regarding the "number of channels". Referring to the U-Net diagram, the input image is convolved with a 3x3 mask which generates a 570x570 output. This output image is then…

convolutional-neural-networks u-net semantic-segmentation

asked Apr 29 '21 at 02:56

Izzo

123
1
5

2

votes

0 answers

Semantic segmentation - background or ignore for non-target classes?

I am training a deep learning model for semantic segmentation. I am using the cityscapes dataset for training/evaluation. In cityscapes, there are 34 classes, and of which, we consider only 19 classes and the rest of the classes are ignored. For…

image-segmentation semantic-segmentation

asked Apr 14 '21 at 14:16

147956

33
2

2

votes

1 answer

What are the state-of-the-art Person-Detektion / Human-Segmentation?

I would like to use a deep learning approach to detect people in videos. I have found some freely accessible implementations like Human Segementation with Pytorch or BodyPix / DeepLab / Pixellib with Tensorflow. They all work well, but with many it…

convolutional-neural-networks object-detection image-segmentation semantic-segmentation

asked Feb 25 '21 at 11:17

miimi

21
1

1

vote

1 answer

How do I input multi-channel Numpy array to U-net for semantic segmentation

I had lidar 3D point cloud data from semantckitti. I want to perform Semantic Segmentation on the data using U-Net. I converted the 3d point cloud data into 2D using spherical conversion and saved the original point cloud data which was in (.bin…

neural-networks deep-learning tensorflow u-net semantic-segmentation

asked Mar 28 '24 at 12:20

Leibniz 24

11
1

1

vote

1 answer

Why do we do need compression in Semantic Segmentation?

When doing semantic segmentation, we often make use of FCN, which can be thought of in two parts: an encoder and decoder. As I understand, the encoder compresses the image into a spatially small, but high number of channels. The decoder then uses…

convolutional-neural-networks semantic-segmentation encoder-decoder fully-convolutional-networks

asked Nov 24 '23 at 21:25

Dude156

113
4

1

vote

0 answers

Resizing segmentation masks

I am trying to run a semantic segmentation model. The problem is, my data has different resolution for every image and for corresponding segmentation map (image and corresponding segmentation map have same aspect ratio. But aspect ratio changes for…

deep-learning image-processing semantic-segmentation

asked Apr 24 '23 at 20:47

programmer_04_03

73
8

1

vote

1 answer

How to identify and diferentiate several edge lines of an object?

I want to create an AI to detect and identify certain edge lines on my image. The input image is a locker key, and I want to know the exact position of certain edges. Sample input image: Sample output image. As output i have each red line…

neural-networks image-segmentation yolo semantic-segmentation edge-detection

asked Jun 06 '22 at 12:17

Lluis C

111
3

1

vote

1 answer

How does the classification head of EfficientDet work?

EfficientDet outputs classes and bounding boxes. My question is about both but specifically I am interested in the class prediction net part. In the paper's diagram it shows 2 conv layers. I don't understand the code and how it works. And what's the…

deep-learning computer-vision object-detection object-recognition semantic-segmentation

asked Dec 25 '21 at 14:41

ML Dev

11
3

1

vote

3 answers

What is the difference (if any) between semantic segmentation and multi-class, mutually exclusive classification?

Multi-class classification is simply assigning all data points into one of up to any finite number of mutually exclusive labels. I am new to the field(s) of AI/ML and I keep hearing people use the term "semantic segmentation." I want to "translate"…

classification semantic-segmentation semantics multiclass-classification labels

asked Jun 07 '21 at 14:54

tdMJN6B2JtUe

115
7

1

vote

0 answers

Pixel values of segmap in multi-class semantic segmentation

I'm preparing a dataset for a multiclass semantic segmentation using U-Net like architecture. To be precise, I've got it ready but a question came to my mind. How does pixel values of a segmentation map influence the training? Also, is it better to…

convolutional-neural-networks keras datasets u-net semantic-segmentation

asked May 20 '21 at 18:12

Nuwanda

11
2

Questions tagged [semantic-segmentation]