Introduction

Odeuropa is a unique project that extracts descriptions of “scent” from European historical documents and structures them as Linked Data. In this article, we explore the actual data through the SPARQL endpoint, revealing its structure and design philosophy.

What is Odeuropa?

Overall Data Model

Odeuropa uses an extended ontology specialized for scent, based on CIDOC-CRM (Conceptual Reference Model for Cultural Heritage).

Key Concepts and Relationships

SForuargcPmPe1e6ESE0n7mmx(6t_iepD_rsleoi(eslrcsTfiFFiFu_eeo31(e2mcxrn__Sn_eotshgccpnm_(aeeeetpftEdnnr)orom_et(csaisr)Eeegsoaxidmsutpv_eireeeonocdrdftnei)eenvScSeOmemnbeetjlel)elvlcet(n(StS(c)cSeeConneuttnr))tcrea)lhub

Key points:

  • Fragment directly references Emission, Smell, and Experience
  • Object is accessed via Emission (Fragment -> Emission -> Object)
  • Emission plays the central role of causally connecting Object and Smell

Learning Data Structure Through Examples

Let’s examine the data structure using the 1810 German agricultural book “Grundsatze der rationellen Landwirthschaft” (Principles of Rational Agriculture) as an example.

1. Source (Document)

An entity that stores basic information about the document.

SW}EHLE?ERsCETar<<<{dhhh?<ftttshstttt:ppp?tlsssapa:::u:b/t/e/h/lsssoecccrr"hhhlGeee?armmmdnuaaaagn...tedoooensrrr-äggg?ct///lrzadiameuann.ttLgodheaureoCnagrrrgg/>euecraaua?tgrtaeeriud>eot>nnh?teo?l/lrdaElan3e;tg3neu_aLL;giaenngd.uwiisrttihcs_cOhbajfetc"t@>de;;

Run query

Key properties:

  • rdfs:label: Title
  • schema:author: Author (Albrecht Daniel Thaer)
  • schema:dateCreated: Creation year (1810)
  • schema:inLanguage: Language (de)
  • schema:genre: Genre (Household texts & recipes)
  • schema:locationCreated: Place of creation
  • P106_is_composed_of: Contained fragments

2. Fragment (Text fragment)

A portion of text containing scent-related descriptions.

<frrsPPPadc161gfh076m:e6_5evm_rinaaie_tl:sfi/up_es8eocr_dsosi7"im_ncSgdtptc4ieoiooo9ebcosr2ehne<paſndeo-im4_mrcneeoia8dih;fstfnrse6de"id-anzSo_5heinibeTre/n4rhr"eeoe,8<-ani5sangb"bo1gelTfu2erih4r5fucobc-echn6eeuheg>/9cne,04hr00toKu<3cenlcs9dtuhmc9ſm"efdſipl80cce;l9fhhn/>6l,2cuz4>ͤuufpnſffdadrmbitm8grebon>uc.,nk"dn<e;ednxephzenurbiafereneſcrte,e/n037de532>;

View data

Meaning of the text: “They (clays) become slippery and more elastic when moistened, emit a clay smell, and dry into solid but more crumbly clumps.”

Key properties:

  • rdf:value: Actual text content
  • schema:position: Position within the document (4th fragment)
  • P106_is_composed_of: Important words contained (“Sie”, “Thongeruch”)
  • P67_refers_to: Referenced concepts (Emission, Smell, Experience)
  • P165i_is_incorporated_in: Parent document (Source)

3. Emission (Scent emission event)

Represents an event where a scent is generated. Emission is referenced from Fragment and connects Object and Smell.

<emaFFPPtPi3191i6s<__22m7shhg__eiitaebo:_otdnrchinp_eocas/:sruus_e/agrTr8uthrie5dretemfbacd_deeftei_r4a<ni<rb.stnte6oomo_id-dbe_tm_eejlehet1uelxe/oarc/i_8_6ot2sp3b-p/4tr4y5a2feed2.6fns8<7ebdceaf2u7ben0r-/98c7abo7b<e>g7n9>s_mft0mo;e6o>;efn-llt1o;l/6g/o8ey2bd9/4j71Lfec31fc402dt91_b/22S82afmb6>1e>bdl7>l;9_7E9m0i>s,si<osnm>el;l/24ffdb8b>;

View data

Key properties:

  • F3_had_source: Source of the scent (Object “Sie”)
  • F1_generated: Generated smell (Smell “Thongeruch”)
  • P92_brought_into_existence: Smell brought into existence
  • P12_occurred_in_the_presence_of: Things present at the event (Object and Smell)
  • time:hasTime: Time of occurrence (1810)
  • P67i_is_referred_to_by: Fragment referencing this Emission

Role of Emission: Emission is the central event expressing the causal relationship of “which Object (source) generated which Smell (scent), and when.”

4. Object (Scent source)

The object or substance that emits the scent. Object is referenced from Emission.

obarPjd1e<f2chsitt:_/tlw2paa6:bsb/e_7/lp9wr7w"e9wSs0.ie-ienfc"tas_d.;a0ft-o5r<6tefhm0.i-gsarsc/idio6sn-l/a/ebC84R52Mb3sfac44ib6/60S>d1d03_>Material_Substantial>;

In this example, “Sie” (they) is a pronoun referring to clay or soil.

Types:

  • S10_Material_Substantial: Material substance
  • S15_Observable_Entity: Observable entity

Key properties:

  • rdfs:label: Object name (“Sie”)
  • P12i_was_present_at: Event where this object was present (Emission)

Connection path:

Fragment(P67_refers_to)Emission(F3_had_source)Object

View data

5. Smell (Scent)

The central concept representing the scent itself.

<smrPPed91lf24lsi0/:_i2lw_4aawfbsafe_sdlb_bra8"otbTut-hgraohidntbcg_ubeit-rne5utdcco_8h_bb"ey-x8;i<dse3txaep-neecrbei3_ecbn4yc5e3</3e0fm39i73sdcse>i5o3n2/>e85bf4b6>;

Key properties:

  • rdfs:label: Name of the scent (Thongeruch = clay smell)
  • P92i_was_brought_into_existence_by: Emission that generated this smell
  • P140i_was_attributed_by: Experience that recognized this smell

View data

6. Experience (Scent experience event)

An event where a person perceives or experiences a scent.

<exFOPtp281ie__4mrpo0eieb_:ersahncesacersseiviT/vegi0ednm3dee7<dd<s_<esmat5meti3eltm2llre-li/92b8524u374ft47ffed-fd_85dbta0b8o048b7fb><>->s9;m0;e4l4l-/f2c4bf7f5d9b485b9>ba;2>

Key properties:

  • F2_perceived: Perceived smell
  • O8_observed: Observed smell
  • P140_assigned_attribute_to: Target to which attributes were assigned
  • P14_carried_out_by: Experiencer (Actor)

View data

Data Flow: The Complete Story

1810,📖📄💨(EGDT"EmeoeSmircxiismutessam.sinePf.Pioyn1r.6onFFt0ag7n)316ge___"_mbrhgPieeeaersnnf👃(dni_teS_encerSmsrco(ismeoaimFn_elutppretllreloanol)cdesgesemTdeho_nofotnf)g(🏺👃Re👁EarxMS👁👨tupamicEete🌾ohxrelEnpirlxOaeeipblorna(esniclSreAeemFirgsn)(e2evricOl_neiceblpcrchj)ee:u.e:rl.cc(Tt.t"eEhu")Tixar:hvpeeoeer""ndrSgi(ieeSerno"ucucer=h)c"ec)lay

Data flow explanation:

  1. Fragment directly references three concepts (Emission, Smell, Experience)
  2. Emission is the center of causal relationships:
    • From Object (source)
    • Generates Smell (scent)
  3. Experience perceives Smell
  4. Object is indirectly connected to Fragment via Emission

SPARQL Query Examples

Searching by Language

When searching by German label:

SW}EHLE?ERsCETrr{dd?ffsss::?lllaaabbbeeelll"?Glraubnedlsä.tzederrationellenLandwirthschaft"@de;

Retrieving Visual Items with Images

Avoiding duplicates when multiple images exist:

SW}EHLE?ERsCETar<<<{dhhh?<ftttshstttt:ppp?tlsssapa:::u:b/t/e/h/lsssoecccrr"hhhlGeee?armmmdnuaaaagn...tedoooensrrr-äggg?ct///lrzadiameuann.ttLgodheaureoCnagrrrgg/>euecraaua?tgrtaeeriud>eot>nnh?teo?l/lrdaElan3e;tg3neu_aLL;giaenngd.uwiisrttihcs_cOhbajfetc"t@>de;;

0

Retrieving Smells and Their Sources

SW}EHLE?ERsCETar<<<{dhhh?<ftttshstttt:ppp?tlsssapa:::u:b/t/e/h/lsssoecccrr"hhhlGeee?armmmdnuaaaagn...tedoooensrrr-äggg?ct///lrzadiameuann.ttLgodheaureoCnagrrrgg/>euecraaua?tgrtaeeriud>eot>nnh?teo?l/lrdaElan3e;tg3neu_aLL;giaenngd.uwiisrttihcs_cOhbajfetc"t@>de;;

1

Ontologies Used

CIDOC-CRM

  • E33_Linguistic_Object: Linguistic object (document)
  • E36_Visual_Item: Visual item
  • E39_Actor: Person (author, observer)
  • E53_Place: Place
  • E77_Persistent_Item: Persistent item
  • P67_refers_to: Refers to
  • P106_is_composed_of: Is composed of
  • P140_assigned_attribute_to: Assigned attribute to

CRMsci (Scientific Observation Extension)

  • S10_Material_Substantial: Material substance
  • S15_Observable_Entity: Observable entity
  • O8_observed: Observed

Odeuropa Custom Extensions

  • L12_Smell_Emission: Smell emission
  • F1_generated: Generated
  • F2_perceived: Perceived
  • F3_had_source: Had source

Schema.org

  • schema:author: Author
  • schema:dateCreated: Creation date
  • schema:inLanguage: Language
  • schema:genre: Genre
  • schema:image: Image
  • schema:position: Position

Significance of the Project

The Odeuropa project is groundbreaking in the following ways:

  1. Digitization of sensory data: Structures “scent,” sensory information that was previously difficult to digitize
  2. Application to historical research: Enables analysis of what past people perceived as scent and how
  3. Linked Data in practice: Implementation of advanced Semantic Web technology using CIDOC-CRM
  4. Interdisciplinary approach: Fusion of history, information science, and sensory studies

Summary

The Odeuropa database is an ambitious project that uses text mining, ontology design, and Linked Data technologies to extract and structure the abstract concept of “scent” from historical documents.

While based on the established cultural heritage ontology CIDOC-CRM, it achieves a reusable and extensible data model by adding scent-specific concepts (Emission, Experience).

This approach can be applied to the digitization of other sensory information (sound, taste, touch, etc.), demonstrating new possibilities for digital humanities.

References