why can't it read the single character on the picture? #273

umutozgur · 2021-09-10T06:37:40Z

why can't it read the single character on the picture?

ivanstepanovftw · 2023-11-26T12:32:31Z

Try this:

TESSDATA_PREFIX = "/usr/share/tesseract/tessdata"
tesserocr_languages = ["eng", "ara"]
api = PyTessBaseAPI(path=TESSDATA_PREFIX, lang="+".join(tesserocr_languages))

api.SetImageBytes(
    imagedata=pixmap.samples,
    width=pixmap.w,
    height=pixmap.h,
    bytes_per_pixel=bpp,
    bytes_per_line=pixmap.stride,
)
api.SetPageSegMode(tesserocr.PSM.SINGLE_CHAR)  # <- important
api.Recognize()
ocr_text = api.GetUTF8Text()

zdenop · 2023-11-26T14:08:03Z

without providing an input image you are alone with your problem...

ivanstepanovftw · 2024-03-21T08:45:32Z

Sure!

I think this issue should be closed as resolved.

If image contains single character, read this:

Tesseract works best on images which have a DPI of at least 300 dpi, so it may be beneficial to resize images

Here is also plot for you:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why can't it read the single character on the picture? #273

why can't it read the single character on the picture? #273

umutozgur commented Sep 10, 2021

ivanstepanovftw commented Nov 26, 2023 •

edited

zdenop commented Nov 26, 2023

ivanstepanovftw commented Mar 21, 2024 •

edited

why can't it read the single character on the picture? #273

why can't it read the single character on the picture? #273

Comments

umutozgur commented Sep 10, 2021

ivanstepanovftw commented Nov 26, 2023 • edited

zdenop commented Nov 26, 2023

ivanstepanovftw commented Mar 21, 2024 • edited

ivanstepanovftw commented Nov 26, 2023 •

edited

ivanstepanovftw commented Mar 21, 2024 •

edited