Overview

I created an NDL “Classical Text” OCR application using Google Colab. You can try it at the following URL.

https://colab.research.google.com/github/nakamura196/ndl_ocr/blob/main/NDL古典籍OCRの実行例.ipynb

The description of NDL Classical Text OCR is as follows.

https://github.com/ndl-lab/ndlkotenocr_cli

The notebook was created with reference to @blue0620’s notebook. Thank you!

https://twitter.com/blue0620/status/1617888733323485184

In the notebook I created, I added support for additional input formats and a feature to save to Google Drive.

How to Use

The usage is almost the same as the NDLOCR application. Please refer to the following video.

https://youtu.be/46p7ZZSul0o

Features

As a notable feature, I converted the recognition results into an IIIF manifest format, making them viewable in the Mirador viewer. Specifically, the following output is produced.

By clicking the latter link, the following Mirador viewer is displayed, allowing you to review the recognition results.

This IIIF manifest file is also stored in Google Drive.

References

For the NDLOCR tutorial, please refer to the following.