Python文件使用OpenCV写入所有边界框坐标

Python文件使用OpenCV写入所有边界框坐标,python,python-3.x,opencv,machine-learning,computer-vision,Python,Python 3.x,Opencv,Machine Learning,Computer Vision,我的任务: 我的任务是提取下图的边界框坐标: 我有以下代码。我试图使用roi获得这些坐标,但我不确定如何获得它们 import cv2 import numpy as np large = cv2.imread('1.jpg') small = cv2.cvtColor(large, cv2.COLOR_BGR2GRAY) kernel = cv2.getStructuringElement(cv2.MORPH_ELLIPSE, (3, 3)) grad = cv2.morphology

我的任务: 我的任务是提取下图的边界框坐标:

我有以下代码。我试图使用roi获得这些坐标,但我不确定如何获得它们

import cv2
import numpy as np

large = cv2.imread('1.jpg')

small = cv2.cvtColor(large, cv2.COLOR_BGR2GRAY)

kernel = cv2.getStructuringElement(cv2.MORPH_ELLIPSE, (3, 3))
grad = cv2.morphologyEx(small, cv2.MORPH_GRADIENT, kernel)

_, bw = cv2.threshold(grad, 0.0, 255.0, cv2.THRESH_BINARY | cv2.THRESH_OTSU)

kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (9, 1))
connected = cv2.morphologyEx(bw, cv2.MORPH_CLOSE, kernel)

contours, hierarchy = cv2.findContours(connected.copy(), cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_NONE)

mask = np.zeros(bw.shape, dtype=np.uint8)

for idx in range(len(contours)):
    x, y, w, h = cv2.boundingRect(contours[idx])
    mask[y:y+h, x:x+w] = 0
    cv2.drawContours(mask, contours, idx, (255, 255, 255), -1)
    r = float(cv2.countNonZero(mask[y:y+h, x:x+w])) / (w * h)

    if r > 0.45 and w > 8 and h > 8:
        cv2.rectangle(large, (x, y), (x+w-1, y+h-1), (0, 255, 0), 1)
        roi=large[y:y+h, x:x+w]

print(roi)
结果应该是这样的:

1675,1335,2338,1338,2337,1455,1674,1452.  :Box1
3067,519,3604,521,3603,651,3066,648       :Box2
1017,721,1729,726,1728,857,1016,852       :Box3
我提到:
. 在这个链接上,当他们已经用矩形GUI作为输入的带注释的图像时,他们正在边界框内提取图像。我想将检测到的区域提取到文本文件中。如何操作?

x,y,w,h=cv2。boundingRect(轮廓[idx])
是您需要的坐标,然后将其写入txt文件:

...
with open("coords.txt","w+") as file:
    for idx in range(len(contours)):
        x, y, w, h = cv2.boundingRect(contours[idx])
        mask[y:y+h, x:x+w] = 0
        file.write("Box {0}: ({1},{2}), ({3},{4}), ({5},{6}), ({7},{8})".format(idx,x,y,x+w,y,x+w,y+h,x,y+h))
        cv2.drawContours(mask, contours, idx, (255, 255, 255), -1)
        r = float(cv2.countNonZero(mask[y:y+h, x:x+w])) / (w * h)
...
结果将为每个框包含4个点,如下所示

Box 0: (360,259), (364,259), (364,261), (360,261)
Box 1: (380,258), (385,258), (385,262), (380,262)
Box 2: (365,258), (370,258), (370,262), (365,262)
Box 3: (386,256), (393,256), (393,260), (386,260)
Box 4: (358,256), (361,256), (361,258), (358,258)