Image 去除历史文档中的噪声和污点，以便进行OCR识别_Image_Opencv_Image Processing_Ocr_Noise

Image 去除历史文档中的噪声和污点，以便进行OCR识别

image opencv image-processing

Image 去除历史文档中的噪声和污点，以便进行OCR识别,image,opencv,image-processing,ocr,noise,Image,Opencv,Image Processing,Ocr,Noise,嗨，我正在尽可能多地清除历史文献中的噪音这些文件有像整个文件中的小点一样的染色，影响OCR和手写识别。除了OpenCV中的图像去噪外，还有没有更有效的方法来清理这些图像一种可能的方法是自适应阈值，执行一些形态学操作，并使用纵横比+轮廓区域滤波去除噪声。从这里我们可以按位和生成的掩码以及输入图像来获得一个干净的图像。结果如下：因为您没有指定语言，所以我用Python实现了它 import cv2 import numpy as np # Load image, create blank

嗨，我正在尽可能多地清除历史文献中的噪音

这些文件有像整个文件中的小点一样的染色，影响OCR和手写识别。除了OpenCV中的图像去噪外，还有没有更有效的方法来清理这些图像

一种可能的方法是自适应阈值，执行一些形态学操作，并使用纵横比+轮廓区域滤波去除噪声。从这里我们可以按位和生成的掩码以及输入图像来获得一个干净的图像。结果如下：

因为您没有指定语言，所以我用Python实现了它

import cv2
import numpy as np

# Load image, create blank mask, convert to grayscale, Gaussian blur
# then adaptive threshold to obtain a binary image
image = cv2.imread('1.jpg')
mask = np.zeros(image.shape, dtype=np.uint8)
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
blur = cv2.GaussianBlur(gray, (7,7), 0)
thresh = cv2.adaptiveThreshold(blur,255,cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY_INV,51,9)

# Create horizontal kernel then dilate to connect text contours
kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (5,2))
dilate = cv2.dilate(thresh, kernel, iterations=2)

# Find contours and filter out noise using contour approximation and area filtering
cnts = cv2.findContours(dilate, cv2.RETR_EXTERNAL, cv2.CHAIN_APPROX_SIMPLE)
cnts = cnts[0] if len(cnts) == 2 else cnts[1]
for c in cnts:
    peri = cv2.arcLength(c, True)
    approx = cv2.approxPolyDP(c, 0.04 * peri, True)
    x,y,w,h = cv2.boundingRect(c)
    area = w * h
    ar = w / float(h)
    if area > 1200 and area < 50000 and ar < 6:
        cv2.drawContours(mask, [c], -1, (255,255,255), -1)

# Bitwise-and input image and mask to get result
mask = cv2.cvtColor(mask, cv2.COLOR_BGR2GRAY)
result = cv2.bitwise_and(image, image, mask=mask)
result[mask==0] = (255,255,255) # Color background white

cv2.imshow('thresh', thresh)
cv2.imshow('mask', mask)
cv2.imshow('result', result)
cv2.waitKey()

导入cv2
将numpy作为np导入
#加载图像、创建空白遮罩、转换为灰度、高斯模糊
#然后自适应阈值得到二值图像
image=cv2.imread（'1.jpg'）
掩码=np.zero（image.shape，dtype=np.uint8）
灰色=cv2.CVT颜色（图像，cv2.COLOR\u BGR2GRAY）
模糊=cv2.高斯模糊（灰色，（7,7），0）
thresh=cv2.自适应阈值（模糊，255，cv2.自适应阈值，高斯，cv2.阈值二进制，51,9）
#创建水平内核，然后展开以连接文本轮廓
kernel=cv2.getStructuringElement（cv2.morp_RECT，（5,2））
扩张=cv2。扩张（阈值、内核、迭代次数=2）
#使用轮廓近似和区域滤波查找轮廓并滤除噪声
cnts=cv2.找到的孔（扩张，cv2.外部翻新，cv2.链近似简单）
如果len（cnts）==2个其他cnts[1]，则cnts=cnts[0]
对于碳纳米管中的碳：
peri=cv2.弧长（c，真）
近似值=cv2.近似聚合度（c，0.04*peri，真）
x、 y，w，h=cv2.boundingRect（c）
面积=w*h
ar=w/浮子（h）
如果面积大于1200且面积小于50000且ar小于6：
cv2.绘制轮廓（掩模，[c]，-1，（255255），-1）
#按位输入图像和掩码以获得结果
遮罩=cv2.CVT颜色（遮罩，cv2.COLOR\u BGR2GRAY）
结果=cv2.按位_和（图像，图像，掩码=掩码）
结果[遮罩==0]=（255255）#颜色背景白色
cv2.imshow（'thresh'，thresh）
cv2.imshow（“面具”，面具）
cv2.imshow（“结果”，结果）
cv2.waitKey（）