Overview
This article introduces a Gradio app that creates TEI/XML files from IIIF manifest files using NDL Kotenseki OCR-Lite.
It can be accessed at the following URL:
https://nakamura196-ndlkotenocr-lite-iiif.hf.space/
Background
This is a continuation of the following articles:
Previously, two separate apps were needed, but with this update, the entire conversion process can be completed within a single Gradio app.
Additionally, issues such as difficulty tracking progress when processing manifest files with many image pages, and the inability to copy processing results, have been fixed.
Screen Example
As shown below, progress is now displayed, such as “Processing page 111/129… - 79.7%”:

The implementation uses Progress:
https://www.gradio.app/docs/gradio/progress
After OCR is complete, a link for downloading the TEI/XML file is now displayed:

Summary
Due to the limitations of the free CPU available on Hugging Face, OCR may take some time, but I hope this serves as a useful reference.