Search code examples
pythonopencvimage-processingscikit-image

Detect rectangles in an image with Python


I have a large collection of binary images like these ones:

sample image 1 sample image 2 sample image 3 sample image 4

On each image I need to detect the white rectangle. The rectangles have different dimensions and orientations and sometimes they are interrupted by a black line (see image 2).

I think the problem is easy to solve if one could remove the noisy background. Thus, I tried first using OpenCV's filter2D function:

import cv2

img = cv2.imread(file_name)
mean_filter_kernel = np.ones((5,5),np.float32)/(5*5)
filtered_image = cv2.filter2D(image,-1,mean_filter_kernel)

But this doesn't seem to have any effect, probably because I'm not dealing with a gray scale image.

Next I thought about detecting contours and filling all contours black which have a small size:

import cv2

img = cv2.imread(file_name)
blurred = cv2.GaussianBlur(img, (5, 5), 0)
canny = cv2.Canny(blurred, 100, 50)
contours, _ = cv2.findContours(canny, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE)

for cnt in contours:
    area = cv2.contourArea(cnt)
    if area > 1000:
        continue
    cv2.drawContours(img, [cnt], -1, 0, -1)
    

This helps a bit to remove the noise but it is far from perfect.

Does anybody have an idea how to improve my approach or is there a way to directly detect rectangles without removing the background noise?


Solution

  • Applying multi filtering and edge detection and thresholding will give you an acceptable results, you can improve the results using morphology or some math to determine the corners and the angle of your rectangle.

    result

    import numpy as np
    import matplotlib.pyplot as plt
    from skimage.io import imread
    from skimage.filters import median, gaussian, threshold_otsu, sobel
    from skimage.morphology import binary_erosion
    
    orig = imread('4.png',as_gray=True)
    img = orig.copy()
    img[img<1] = 0
    
    gauss = gaussian(img, sigma=3)
    
    SE = np.ones((7,7))
    med = median(gauss, selem=SE)
    
    edges = sobel(med)
    
    thresh = threshold_otsu(edges)
    binary = edges > thresh
    
    SE2 = np.ones((3,3))
    result = binary_erosion(binary, selem=SE2)
    
    plt.subplot(121)
    plt.imshow(orig, cmap='gray')
    plt.axis('off')
    plt.subplot(122)
    plt.imshow(result, cmap='gray')
    plt.axis('off')
    plt.show()