Overview

This is a note on how to describe multiple VTT files for Audio/Visual materials using IIIF.

Here, we describe transcription text in both Japanese and English as shown below.

https://ramp.avalonmediasystem.org/?iiif-content=https://nakamura196.github.io/ramp_data/demo/3571280/manifest.json

Manifest File Description

An example is stored at the following location.

https://github.com/nakamura196/ramp_data/blob/main/docs/demo/3571280/manifest.json

Please also refer to the following article.

Specifically, by describing them as multiple annotations as shown below, they were correctly processed by the Ramp viewer.

"annota]ti{}on"""]siti"dyt{}{}:"pe,:em"""}""}""""}""}"["sitl,mb,titl,mb,t":"dya"]oo""""}adya"]oo""""}ah:"pbjtditflr"pbjtditflrt":eea"iydyoa"]g:eea"iydyoa"]gtA["l"v""prbje"l"Ev""prbjepn":":a::emea"t":":na::emea"tsnh:t"al""h:gt"al"E":t"[i{":t"::t"[li{":t":n:ttA{(oh":tA{ioh":g/apnmnt":["pnsnt":[l"ntsna"tT{(hsnh"tT{ihai:c:pe"mt::pe"stkothsxtatt(sxthtan/ai":tecp/am":tepmPntns/"xhsntas/"x(suaaieu/,ti:aicu/,tm:rgko-pn/n/kohpn/a/aeangpave/anipavc/1"m"elkt-nm"nlkthn9,u,neatgau,eeatia6remm"ekr-mm"nk.areu,naageu,eag1anrem1enr-mi9ttaru9ntagut6ei1ar6ei1erh.dn9ta.rn9naug)g6e1gag6e1bi"".d9it".r9.t,g)6te,ga6ihi".hdit.outgu)teg/bhib"hdir.ut.u)taibhib"hmo.uo.up/ib/ib_ro.ro.da/ia/iamromrotpa/pa/a_mr_mr/dpadpada_ma_metdptdpmaa_aa_o/td/td/daadaa3e/te/t5mdamda7oe/oe/1/md/md23oe3oe85/m5/m073o73o/15/15/c273273a815815n027027v/81/81ac02c02sa/8a/8/n30n30pv5/v5/aa7ca7cgs1as1ae/2n/2n/a8va8v2n0an0a"n.sn_s,ov"oe"tttnata.t"tvi,itootnn"//,wweebbvvtttt"/,2",

Note that in Clover, the two transcription texts were displayed consecutively.

https://samvera-labs.github.io/clover-iiif/docs/viewer/demo?iiif-content=https://nakamura196.github.io/ramp_data/demo/3571280/manifest.json

(Reference) Creating English Transcription Text

For creating the English transcription text, the following program was used. This is an example using the GitHub version of Whisper.

https://github.com/openai/whisper

dddeeeffff"hmsrwwtmrwro"oieeriroerer"unctitadsitmCruouthneutuaostnreffsllertnedn_oiolt_n_v=ssvplra=vtefteet=triri=="tn.isetfewtemtn{((w,tnei(hm(sestishtfradxliiorus(neoriisrtensdelttstcualtet=.ppestaie(orneeg=wuelummc(nss_(m=frtr.lpeosd:cp"eosi_.tt(nes0raWnfretplr,sidc2itEtomgeaoaenso%}phBram(tanocn:t,Vimtefhdsuosd6{iTna_n",_ctnes0mo'Tttt{mrpdcinwe_i[sooiuso3%n,'nntm'tudbt)n6u)uietatee_:d03tfnmmserpl(ps06eia"eetxtu(ia)0sls)rsat}t'ntt0:eatm'_mpho)0_ftap]peu)2piem(.adta}al(pssti_:tet(ethupf{h:rsgr{,mao6s)aemie'tr0e:ngepnv)hm)csmn(de,aocet)}rtnrn[\bvtdit'noeesp[e{srd:t'nteb0isde=os6ot'xFst.na]taer3[r)}l=if't\svn}s'neeg"e]\)rg)n:b'm"oHe)sHne:t#,MsM'{l:]iaS)nS:+g.um1am}gm\e'n=.""j"a"",task="translate")

Initially, I tried translation using the API version of Whisper as follows, but it output in Japanese and I was unable to successfully create English text.

transmfrcoierdlsieeppl=ot=an"us=wdehi_ciofls_oipfreeimnrlat-et.1,=a""u,vdtito".,translations.create(

Summary

I hope this is helpful for describing multiple transcription text and subtitle files.