File Exchange

image thumbnail

How to detect & localize a text in pdf using OCR in MATLAB

version 1.0.3 (547 KB) by Kevin Chng
Using Python package - pdf2image convert the the pdf to image Using OCR in MATLAB to detect & localize the text

15 Downloads

Updated 16 Jul 2019

View License

Using OCR to detect and localize text is simple in MATLAB. However, it is only workable if your input is image format (jpg,png) but not pdf. Hence, we are going to convert the pdf to image. However, up to MATLAB version R2019a, It don't have any built-in function to convert pdf to image. For this example, i am going to use a python package pdf2image help us to convert pdf to image. There are no conflicts using MATLAB or Python. If there is something working better in Python, we can collaborate both platform (MATLAB and Python) through MATLAB Api to complete our objective.

Highlights :
Execute python user-defined function from MATLAB
Detect and Localize a text in pdf

Product Focus :
MATLAB
Computer Vision Toolbox

Written at 16 July 2019

Cite As

Kevin Chng (2020). How to detect & localize a text in pdf using OCR in MATLAB (https://www.mathworks.com/matlabcentral/fileexchange/72156-how-to-detect-localize-a-text-in-pdf-using-ocr-in-matlab), MATLAB Central File Exchange. Retrieved .

Comments and Ratings (0)

Updates

1.0.3

modify description

1.0.2

Change description

1.0.1

*change description

MATLAB Release Compatibility
Created with R2019a
Compatible with any release
Platform Compatibility
Windows macOS Linux
Tags Add Tags

OCRforPDF