Overview

gcv2hocr is a repository that converts Google Cloud Vision OCR output to hOCR format and creates searchable PDFs.

I created a notebook to run the above repository on Google Colab.

As shown below, you can create searchable PDF files.

How to Use

Access the following notebook.

First, obtain an API key to use the Google Cloud Vision API. The following article may be helpful.

After entering the API key, press the three play buttons for the initial setup shown below.

Then, select the appropriate option from the execution options shown below.

For example, to specify an “Image URL,” press the two play buttons labeled “Settings” and “Run” shown below.

After execution, the PDF file will be downloaded. The path where the recognition results and other outputs are saved will also be displayed.

I would like to thank the developers of useful tools such as gcv2hocr and hocr-tools.