Django 从表单中读取图像,然后使用tesseract提取标题
这是我在Django项目中的HTML:Django 从表单中读取图像,然后使用tesseract提取标题,django,forms,tesseract,Django,Forms,Tesseract,这是我在Django项目中的HTML: <form method="GET"> <div> <label for="image"></label> <input type="file" name="image" id="image" required/> &l
<form method="GET">
<div>
<label for="image"></label>
<input type="file" name="image" id="image" required/>
</div>
<input type="submit" value="Submit"/>
</form>
最后,这是提取标题的代码(我在Jupiter中测试了OCR,它对我有效):
但在那之后,我最终出现了错误,可能是因为我没有正确地获取图像
def home(request):
result = None
if request.method == 'GET':
data = request.GET
im = request.FILES.get('image')
medecine = ocr_title(im)
def ocr_title(im):
image = cv2.imread(im, 0)
img = cv2.resize(image, (500, 500))
img = cv2.GaussianBlur(img, (5, 5), 0)
img = cv2.adaptiveThreshold(img, 255, cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY, 21, 4)
himg, wimg = img.shape
maxw = 0
title = pytesseract.image_to_data(img, config='--psm 6 --oem 3')
for x, b in enumerate(title.splitlines()):
if x != 0:
b = b.split()
if len(b) == 12 and len(b[11]) >= 4:
if (int(b[8]) > maxw):
maxh = int(b[9])
maxx = int(b[6])
maxy = int(b[7])
maxw = int(b[8])
text = b[11]
return (text)