Skip to main content
Stack Overflow
  1. About
  2. For Teams
Filter by
Sorted by
Tagged with
2 votes
1 answer
133 views

I am including OCR with Leptonica and Tesseract capi.h in my C project. For regular images loaded as Pix, all is good, but for multipage TIFFs loaded as Pixa, I get the following compiler error: ocr.c:...
0 votes
0 answers
23 views

I want to detect all the words in my image. In the original, it detects all the 4 lines, but not the word "Juliet". Which was expected. Then, I inverted the image in order to detect only &...
DjAlan's user avatar
  • 79
0 votes
1 answer
153 views

I am using docling and trying to get images with scanned text to parse with Tesseract OCR (could be any OCR, but tesseract is preferred if possible). My code is: pipeline_options = PdfPipelineOptions()...
-1 votes
2 answers
187 views

Image contains single document printed in white paper. Background of image can be different. Tried to get document using code from https://scanbot.io/techblog/document-edge-detection-with-opencv/ with ...
0 votes
2 answers
77 views

Pytesseract cannot understand very simple and clear text. I've tried nearest neighbor, bilinear, gaussian blur, and everything else and cannot get tesseract to read the text consistently, the best I ...
1 vote
1 answer
90 views

I am struggling with tesseract package (5.3.2 version) for R, trying to have a XML ALTO as output of the ocr() function. I read the documentation which states that this has something to do with the ...
-1 votes
1 answer
149 views

Receipt clip contains structured background: Tried to remove it using textcleaner ImageMagic wrapper script from Remove receipt image border using ImageMagick answer. Used code from answer How to use ...
2 votes
1 answer
220 views

I am trying to extract numbers from dotted LED-style digits (0–9) using Tesseract OCR in a MAUI/Xamarin app on Android and iOS, fully offline. My boss wants a local solution that works on mobile ...
1 vote
3 answers
176 views

My code works as a Python file, but I am struggling to make it work using PyScript. I am sharing the code that I tried. main.py import pytesseract pytesseract.pytesseract.tesseract_cmd = r"...
1 vote
0 answers
201 views

I’m using Docling to OCR scanned PDFs. I want to control Tesseract’s page-segmentation mode (PSM), e.g. --psm 6. Docling exposes both TesseractOcrOptions and TesseractCliOcrOptions, but neither ...
2 votes
1 answer
76 views

I'm attempting to perform OCR on a set of single letters inside an image using Python. I'm new to this so apologies if I get the terminology wrong, but I've filtered and have obtained (I think) quite ...
0 votes
1 answer
177 views

Tried to use https://github.com/Sicos1977/TesseractOCR Nuget package in Debian 12. It looks that it requires new version of leptonica libleptonica-1.85.0.dll.so which is not avaliable in Debian: #apt ...
0 votes
1 answer
83 views

I am trying to OCR a specific area of a PDF page in a multi-page document (total page count varies between 600-10,000 pages). I initially receive the data as .pcl files in batches of 500 records, ...
1 vote
1 answer
89 views

I am currently using tesseract 5.0 and am training a model. I have generated the png, box and the ground truth files for a thousand images. However, when I run the command: make training MODEL_NAME=...
0 votes
0 answers
94 views

I followed the steps for fine-tuning Tesseract for handwriting recognition. I have the character images and the corresponding box files. Then I generated the .lstmf files, followed by the lstm_train....

15 30 50 per page
1
2 3 4 5
...
296

AltStyle によって変換されたページ (->オリジナル) /