Overview

I had the opportunity to try rico-converter, so here are my notes.

https://github.com/ArchivesNationalesFR/rico-converter

It is described as follows.

A tool to convert EAC-CPF and EAD 2002 XML files to RDF datasets conforming to Records in Contexts Ontology (RiC-O)

Converting

Explanation is available at the following link.

https://archivesnationalesfr.github.io/rico-converter/en/GettingStarted.html

First, download the latest zip file from the following and extract it.

https://github.com/ArchivesNationalesFR/rico-converter/releases/latest

There are sample data in input-eac and input-ead, so we will convert these to RDF.

input-eac

Here is ChatGPT’s explanation of input-eac/FRAN_NP_051151.xml in this folder.

!

This XML file is written in the EAC-CPF (Encoded Archival Context for Corporate Bodies, Persons, and Families) format, and systematically organizes information about organizations such as the French Ministry of Culture (Ministere de la Culture). The main elements are as follows:

  • Control section: Contains metadata about the record, including the record ID, language declarations, revision history, and sources of materials used.
  • Identity section: Contains basic information about the corporate body (here, the French Ministry of Culture). It includes multiple names showing how the ministry’s name has changed over time, with detailed descriptions of the periods during which each name was used.
  • Description section: Contains detailed descriptions of the ministry’s period of existence, legal status, main functions, missions, and historical changes. For example, it explains how the ministry was established in 1959, how it has been operated based on decrees, and how its organizational structure has changed.
  • Relations section: Shows relationships with other organizations and individuals. This includes detailed descriptions of relationships with related agencies and educational institutions in France, as well as successive Ministers of Culture, along with links to external resources (archives and online information).

This XML is intended to describe information about archives and organizations in a standardized format and to enable interoperability with other databases and systems. It is particularly useful for tracking organizational history and relationships.

Execute the following.

shricoconverter.sh

Running with default settings creates a folder like output-eac-20241005, and RDF files are output inside it.

input-ead

Here is ChatGPT’s explanation of input-ead/FRAN_IR_003500.xml in this folder.

!

This XML file is a document written in EAD (Encoded Archival Description) format that describes archival materials. This specific document describes an archival collection called “Fonds Vitet” from France, spanning from the 19th to the 20th century. The content includes an overview of the document, the creator, publication information, and the detailed structure of the archival materials.

Main elements:

  1. eadheader: This section contains metadata about the materials (language, dates, repository information). For example, it includes the material’s ID (FRAN_IR_003500) and a summary of the archive.
  2. filedesc: Contains the title of the material, the creator, publisher, etc. In this example, it has the title “Fonds Vitet,” with C. Sibille as the creator, published by the Archives Nationales (French National Archives).
  3. archdesc: Describes the contents of the archive in detail. The unit IDs, titles, creation dates, and provenance (e.g., who created them) of each item in the “Fonds Vitet” are described in a structured manner.
  4. dsc: A detailed inventory of the archive. Individual materials are organized hierarchically, with detailed contents of individual documents, letters, photo albums, etc. listed.

This document is a detailed record of the archive, organized and hierarchically structured to make it easy for researchers and archive users to reference the materials.

As before, execute the following.

shricoconverter.sh

At this point, select convert_ead at the Enter command to execute prompt.

An execution example is as follows.

s:EEjR0---LPh:nnai7irttvC:NNNsNorWeea-2uuutocierrO9Emmmnecl-:AbbboesoccpXC4Deeefscooamo9rrr!ommrxn.Cftnema1v8ooooiovam2e6nfffloetne0r2vekrodt0tefffsteMeIriii0eRtrrNsllli:rio-Fieeen0.cfX1Oosss0s-eim0ne:hOxls0tiir0ee1%fRonnr1Cc2.eooul0[gppEsr(nto0=.orRussvecM=croRc:tea=.tcOcar(t-=aeRertcij=.ssteooa=rssernnr=.:::dv=e2e[r=.101a.rpi=c77t0trc=.._eo=E22esc=a0aso=d2:cn=24:,Ev=R-ne=i1ctr=c0oet=o-nre=C0vr=o5ef-=nroc=v0trl=e7_i=r:e'-=t2ap2=e9ca.=r:_r0=R4ra.=e8am2=p,we-=o,to=reen=tncre=Ldosj=ien/a=sdcr=teo.=earnj=nttva=e_er=r2er=0atc=-2d_o=4,en=-av=1tde]0e.r-spt10tr_75_oe/epa10aed77cr:,t@(2ip09tea::esr04s'a09t]m:)_:e0et1aedr/,s/0vc:eo0rn0sv:ie0or0nt),_ehaedl.pp)ro[pperretsisesEnterfor'convert_eac']:convert_ead

As a result, a folder like output-ead-20241005 is created, and RDF files are output inside it.

Registering in a SPARQL Endpoint

Referring to the following, I bulk registered the output RDF files to Dydra.

An example registration script is as follows.

ffffec#frrrinlooooldicrmmmepelsonifcgdtiteillyq=nnlioddt=teebrmg,.nalDciti_ioayln.mpmbpdeipyp(iratmo.o"_arqprar.kC(dotpt/el)mriryi(tgtief_liqc=nibomdotlybpmcD(e_ooyesfrndn)itvrd:leapeDrCo(ytlifdeinirretla-n,eC2t,l..ai0lp"e.oixn2a_mt/dklo_e"uey)tn)pvu(t"-.e/a.ce-n2v0"2)41005/**/*.rdf",recursive=True)

Additionally, since rico appeared to use the following Namespaces, I registered them in Snorql.

Let me try various things below.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2FrecordResource%2Ftop-003500

!

This describes a collection of materials stored at the Archives nationales de France called “Fonds Vitet.” The collection contains the following information:

  • rdf:type: This resource is classified as a “RecordResource” and a “RecordSet.”
  • rdfs:label: The title of this collection is “Fonds Vitet.”
  • beginningDate: The start date of the collection is January 1, 1801.
  • date: The collection spans from the 19th to the 20th century.
  • endDate: The end date of the collection is December 31, 2000.
  • hasInstantiation: This collection has a concrete instantiation (actual materials), with the URI http://data.archives-nationales.culture.gouv.fr/instantiation/top-003500-i1.
  • hasOrHadHolder: The holder of this collection is the Archives nationales (agent/005061).
  • hasProvenance: The provenance of this collection is associated with two agents (050218 and 052986).
  • hasRecordSetType: The type of this collection is “Fonds” (archival collection).
  • includesOrIncluded: This collection includes multiple record resources (003500-d_1, 003500-d_2, 003500-d_3, 003500-d_4).
  • isOrWasDescribedBy: This collection is described by a record (record/003500).
  • title: The official title of the collection is “Fonds Vitet.”

This data describes a historical record group called “Fonds Vitet” stored at the Archives nationales de France, containing materials from the 19th to the 20th century.

Multiple resources are linked with rico:includesOrIncluded.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2FrecordResource%2F003500-d_1

!

This data describes “LUDOVIC VITET (1802-1873)” stored at the Archives nationales de France. This resource represents a record resource about the materials of Ludovic Vitet, containing the following information:

  • rdf:type: This resource is classified as a “RecordResource.”
  • rdfs:label: The title is “LUDOVIC VITET (1802-1873).”
  • hasInstantiation: The instantiation (concrete materials) of this record exists at http://data.archives-nationales.culture.gouv.fr/instantiation/003500-d_1-i1.
  • hasOrHadPart: This record resource is divided into multiple parts, represented by the following URIs:
    • 003500-d_1_1
    • 003500-d_1_2
    • 003500-d_1_3
    • 003500-d_1_4
    • 003500-d_1_5
  • hasProvenance: The provenance of this record resource is associated with http://data.archives-nationales.culture.gouv.fr/agent/051234.
  • isOrWasIncludedIn: This resource is included in the higher-level record set “Fonds Vitet” (top-003500).

This information provides details about materials related to Ludovic Vitet, a 19th-century French historian and politician.

It is linked to an agent with rico:hasProvenance.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2Fagent%2F051234

!

This data is the record of an agent (person) named “Ludovic Vitet (1802-1873)” stored at the Archives nationales de France. Ludovic Vitet was known as a French historian, archaeologist, and politician. This record contains the following information:

  • rdf:type: This agent is classified as a “Person.”
  • rdfs:label: The title is “Vitet, Ludovic (1802-1873).”
  • owl:sameAs: Links to other data sources about Ludovic Vitet are provided (e.g., DBpedia, ISNI).
  • agentIsConnectedToAgentRelation: Links showing relationships with other agents.
  • birthDate: Indicates that he was born on October 18, 1802.
  • deathDate: Indicates that he died on June 5, 1873.
  • descriptiveNote: Details about Ludovic Vitet’s family background, historical context, and achievements are described. For example, he was interested in archaeology and history, was active in journalism and literature, and was particularly involved in the preservation of French monuments, contributing to the establishment of the Commission des Monuments Historiques.

This record provides extensive information about the life and achievements of Ludovic Vitet, detailing his family lineage and his efforts in French cultural heritage preservation.

This is linked to the following Record with rico:isOrWasDescribedBy.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2Frecord%2F051234

!

This data represents a record about “Ludovic Vitet (1802-1873)” at the Archives nationales de France. This record contains the following information:

  • rdf:type: Classified as a “Record.”
  • rdfs:seeAlso: Contains links to other reference materials about Ludovic Vitet (e.g., Wikipedia, ISNI).
  • creationDate: The record was created on December 3, 2015.
  • describesOrDescribed: This record describes the agent (person) Ludovic Vitet.
  • hasCreator: The creator of this record is the “Archives nationales de France (agent/005061).”
  • hasDocumentaryFormType: The format of the record is “Authority Record.”
  • hasInstantiation: Contains links to instances related to this record.
  • hasOrHadLanguage: The language of the record is French.
  • isOrWasRegulatedBy: Contains links to rules related to this record.
  • lastModificationDate: The record was last modified on October 25, 2019.
  • source: Contains links to various information sources about Ludovic Vitet (e.g., BnF, Academie francaise, Wikipedia).

This record provides detailed information about the life and achievements of Ludovic Vitet, indicating reliable information sources about him.

This is linked to the following agent with rico:hasCreator.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2Fagent%2F005061

!

This data presents an overview of the Archives nationales de France (French National Archives). It includes the following details:

  • rdf:type: Classified as “rico:Agent” and “rico:CorporateBody.”
  • rdfs:label: Named “Archives nationales (France; 1790-….).”
  • owl:sameAs: Links to other databases about the Archives nationales de France (e.g., DBpedia, BnF, ISNI).
  • rico:agentIsConnectedToAgentRelation: Contains URIs showing relationships with other agents and periods of activity.
  • rico:beginningDate: The founding date is shown as “1790-01-01.”
  • rico:descriptiveNote: Description of the internal organization and history. The archives are composed of three main sites:
    • Paris site: Old Regime archives, Paris notarial records
    • Pierrefitte-sur-Seine site: Post-Revolution public archives and private documents
    • Fontainebleau site: Specific public archives (e.g., naturalization applications, Legion of Honor records)

Furthermore, it is explained that the archives are operated by four main departments (Public Affairs, Collections, Scientific Support, and Administration).

  • rico:groupIsTargetOfGroupSubdivisionRelation: Shows the group subdivision relationships within the archives.
  • rico:history: Provides a detailed description of the historical background of the archives, including its founding in 1790, the preservation of historical public records, the centralization of archives throughout France, and the principle of public access. It also details the evolution and expansion of the archives’ functions through various eras.

This record provides detailed information about the organizational structure and historical background of the Archives nationales de France, explaining the evolution of its functions and roles through different eras.

This is linked to the following agent with rico:isOrWasSubdivisionOf.

https://nakamura196.github.io/snorql_examples/rico/?describe=http%3A%2F%2Fdata.archives-nationales.culture.gouv.fr%2Fagent%2F000005

!

This data contains information about France’s “Ministere de la Culture et de la Communication” (Ministry of Culture and Communication). Here is an overview:

  • rdf:type: Classified as “rico:Agent” and “rico:CorporateBody.”
  • rdfs:label: Named “France. Ministere de la Culture et de la Communication (1959-….).”
  • owl:sameAs: This agent is linked to other databases, including DBpedia, BnF (Bibliotheque nationale de France), and ISNI.
  • rico:agentIsConnectedToAgentRelation: Shows relationships and periods of interaction with other institutions.
  • rico:beginningDate: The founding date of the Ministry of Culture and Communication is recorded as “1959-01-08.”
  • rico:descriptiveNote: Contains detailed descriptions of the ministry’s organizational structure and history. Established in 1959, the Ministry of Culture and Communication was composed of various departments including architecture, the Archives nationales de France, and the general directorate of arts and literature. Over time, structural changes and separations progressed. For example, the General Directorate of Arts and Literature was abolished in 1969, the Music, Dance, and Opera department became independent in 1970, and under the 2009 public policy reform, it was reorganized into three main departments (Cultural Heritage, Artistic Creation, Media and Cultural Industries).
  • rico:agentIsTargetOfAgentOriginationRelation and rico:agentIsTargetOfPerformanceRelation: Various relationships and activity histories are indicated by URIs.
  • rico:history: Description of the ministry’s historical development. It was established by Charles de Gaulle in 1959, with Andre Malraux appointed as the first Minister of Culture. Subsequently, multiple ministers shaped the direction of cultural policy, and from the 2000s onward, new areas focusing on digital technology and the internet were also addressed.

This record provides a detailed account of the development of the Ministry of Culture and Communication since its founding and its connections with related institutions.

As shown above, we confirmed that various resources are linked through RDF.

Summary

I thought I would like to study Records in Contexts Ontology (RiC-O) thoroughly.

I hope this serves as a useful reference.