PDF OCR

by schorschii

Score 8

Issues Website Download

UUID: pdf-ocr@schorschii

Last edited:

7 months ago 2025-11-15, 08:26

Last commit: [25434397] Add Japanese translations (#746)

Quickly recognize text in images of PDF files via OCR

README

PDF OCR

Does Optical Character Recognition (OCR) on the selected PDF files.

DESCRIPTION

After scanning, a PDF file only contains an image of your scanned text. With OCR, you can add text information so that you can copy it from your PDF. The resulting PDF will be saved with the suffix ".ocr.pdf".

DEPENDENCIES

The following programs must be installed and available:

ocrmypdf for PDF processing
zenity for progress dialog

CHANGELOG

Open

Log In To Comment!

4 Comments

Douglas Wade Goodall-1 year ago

I am using LMDE and although I have installed tesseract-ocr, it can't be found so things don't work

uditarenos-2 years ago

Thank you for your work! I've used it before. There might be a problem with space characters in file names. The dialogue gives me ("Date Example.pdf"): ERROR - InputFileError: File not found - Date ERROR - InputFileError: File not found - Example.pdf

Nassos Kranidiotis-2 years ago

Thank you for the nice action. Please consider including support for Greek language.

Julian Groß -2 years ago

Should work now if you update. https://github.com/linuxmint/cinnamon-spices-actions/pull/413 You just need the relevant tesseract package installed. E.g. tesseract-ocr-ell.

PDF OCR

README

PDF OCR

DESCRIPTION

DEPENDENCIES

CHANGELOG

1.0

Log In To Comment!

4 Comments