computer-vision conv-neural-network image-segmentation faster-rcnn detectron

Train Mask R-CNN with some classes having no masks but only bounding boxes

I want to train a model to detect three different types of defects. I have a training dataset where two of these classes contain segmentation masks, but one contains only bounding boxes. Can I train a shared model or do I need to separate the training dataset and train a Faster R-CNN and a Mask R-CNN?

(I only care about bounding box output for the class containing no masks in the training data.)

Solution

You can create 'weak' masks from those bounding boxes and then combine those two datasets. Something like below:

mask = np.zeros((256, 256), dtype=np.float32)
mask[y:y+h, x:x+w] = 255.

If the two datasets are small, combining them will yield better results. But if the datasets are big enough (>2000 images) then you can use FasterRCNN + MaskRCNN approach.

DeepSORT's Feature extractor cannot be used for Person ReIdentification
Implementing from scratch cv2.warpPerspective()
Lucas-Kanade optical flow - gradient calculation
function for sorting contours of a Sudoku, top to botton and left to right?
Why cv2 has no attribute destroyAllWindows?
Opencv: AttributeError: module 'cv2' has no attribute 'dnn'
Python - OpenCV - Is there a way to "Fill" an expected contour?
Issues with Detecting Object Borders Due to Color or Transparency in OpenCV and Feret Measurements
Yolo v8 can help to detect keypoints, but how to count the number of the keypoints that cross a specified line
OpenCV in Python cv2.solvePnP return wrong results
Image processing how to detect specific custom shape in image
How to properly calculate PSD plot (Power Spectrum Density Plot) for images in order to remove periodic noise?
How to use pt file
How to check if an image region contains text or not?
Obtain sigma of gaussian blur between two images
Horizontal Line detection and measure distance in between with OpenCV
OpenCV Undistort Cropped Image
Bilateral filter algorithm
OpenCV import error mac
After cropping a Point Cloud using pcl::CropBox filter, can't visualize that cropped Point Cloud
Albumentations intensity augmentations disrupt the image
Is it possible to load huggingface model which does not have config.json file?
How to remove the background from an image
scene change detection in video (moving cameras)
HALCON + C# - How can show selected objects with Halcon procedure in WinForms program?
Partial errors computing affine transform with OpenCV
OpenCV task of detect object in image
Measuring angle of a curve wire with OpenCV
How to verify CuDNN installation?
Does batch normalisation work with a small batch size?