Quantcast
Viewing latest article 3
Browse Latest Browse All 10

Answer by jtbr for Getting the bounding box of the recognized words using python-tesseract

Python tesseract can do this without writing to file, using the image_to_boxes function:

import cv2
import pytesseract

filename = 'image.png'

# read the image and get the dimensions
img = cv2.imread(filename)
h, w, _ = img.shape # assumes color image

# run tesseract, returning the bounding boxes
boxes = pytesseract.image_to_boxes(img) # also include any config options you use

# draw the bounding boxes on the image
for b in boxes.splitlines():
    b = b.split(' ')
    img = cv2.rectangle(img, (int(b[1]), h - int(b[2])), (int(b[3]), h - int(b[4])), (0, 255, 0), 2)

# show annotated image and wait for keypress
cv2.imshow(filename, img)
cv2.waitKey(0)

Viewing latest article 3
Browse Latest Browse All 10

Trending Articles