Overview
I have been creating a tutorial for the NDL “Classical Book” OCR application using Google Colab, as introduced in the following article.
This time, the following updates were made.
- Added terms of use
- Fixed bugs
- Added support for IIIF Presentation API v3 manifest file input
The updated notebook can be accessed at the same URL as before.
https://colab.research.google.com/github/nakamura196/ndl_ocr/blob/main/NDL古典籍OCRの実行例.ipynb
Terms of Use
Please use the notebook itself under CC0. However, the “NDL Classical Book OCR Application” is released by the National Diet Library under the CC BY 4.0 license, so please include the appropriate credit. Also, please check the terms of use for the materials to which OCR is applied.
Bug Fixes
A bug had occurred where OCR results were not being output. Additionally, library installation had started taking a long time. These two issues have been fixed.
Support for IIIF Presentation API v3 Manifest File Input
Previously, only IIIF Presentation API v2 manifest file input was supported.
This time, support for v3 was added. For this verification, the IIIF Presentation API v3 manifest file introduced in the following article was used.
Summary
I apologize for the bugs that had occurred during this time. I hope this can help with the utilization of NDL Classical Book OCR.