Main Page: Difference between revisions

From FinnoUgric Dataspace
Jump to navigation Jump to search
 
(19 intermediate revisions by 3 users not shown)
Line 1: Line 1:
[[File:Dreams DNBH2025 Poster.jpg|thumb|You can download our poster in a higher resoultion on our [https://reprex.nl/event/2025-03-07_dreams/ event] page.]]
<strong>We started experimenting with the legal, organisational, semantic and technical challenges of creating a genuinely trustworthy, AI-supported data-sharing space that can find and connect tangible and intangible elements of the Finno-Ugric cultural universes. We were also seeking a better governance model for oversight for the custodians of these endangered, shrinking universes in their language and with little technical knowledge, partly as alternatives to the established Wikipedia to the open knowledge incubation method for small linguistic minorities.</strong> See further details on the [https://reprex.nl/project/finnougricdataspace/ project description page].


This experimental platform reimagines how small and endangered language communities—like Võro, Seto, Livonian, and Latgalian—can engage with the Wikimedia ecosystem. Rather than relying on traditional Wikipedia editing models, we use structured data (Wikibase, Lexemes, SPARQL) to narrate cultural heritage through multilingual exhibitions, oral histories, traditional dress, and music. Our approach supports community-led storytelling and data stewardship, building a sustainable foundation for knowledge sharing rooted in local voices and values.
<div style="text-align:center;">
  <strong><span style="font-size: 1.5em;">Finno-Ugric Data Sharing Space</span></strong> 
</div>


Explore how museums, researchers, and communities are collaboratively curating and publishing open datasets, Commons media, and Wiktionary entries that reflect the rich traditions and languages of Finno-Ugric peoples. This work is part of a broader effort to create inclusive and interoperable pathways for cultural and linguistic representation online.
<strong>Finno-Ugric Data Sharing Space (FUDSS) is a multilingual cultural heritage infrastructure connecting museums, archives, libraries, community collections, music archives, photographs, films, and linguistic resources across Finno-Ugric communities. Built on Wikibase, Apache Jena Fuseki, Sampo-UI, and WikiMuseum technologies, it combines structured knowledge graphs, semantic browsing, virtual exhibitions, and linked open data services.</strong>


== Collections ==
[[File:Finno Ugric Data Sharing Space example garments.png|center|800px|alt=Our collection is best view via our Sampo semantic browser via finnougric.net|Our collection is best view via our Sampo semantic browser via [https://finnougric.net/en/ finnougric.net].]]
==== Musical works ====
The following collections contain musical works that are almost always sung in the given language, in some cases, they belong to the musical tradition of these communities without lyrics.


[[Item:Q123|Khanty Mansi Musical Works Collection]]; [[Item:Q136|Samoyedic Musical Works Collection]]; [[Item:Q194|Livonian Musical Works Collection]]; [[Item:Q266|Veps Musical Works Collection]] ;[[Item:Q324|Saami Musical Works Collection]]; [[Item:Q2498|Komi Musical Works Collection]]; [[Item:Q3721|Hungarian Musical Works Collection]]; [[Item:Q3770|Finnish Musical Works Collection]]; [[Item:Q2836|Mari Musical Works Collection]]; [[Item:Q3090|Udmurt Musical Works Collection]]; [[Item:Q3680|Estonian Musical Works Collection]]; [[Item:Q2632|Erzya Moksha Musical Works Collection]]
The platform enables the discovery, enrichment, and publication of tangible and intangible cultural heritage while supporting multilingual access, community participation, and interoperability with Wikimedia, Europeana, and the emerging European Collaborative Cloud for Cultural Heritage (ECCCH).


==== Sound recordings ====
== 🌍 What is FUDSS? ==
The following collections as playlist contain sound recordings of the musical works, you can listen to them on Spotify. We will keep adding other listening options on YouTube, Bandcamp or other licensed players. [[Item:Q117|Khanti Mansi Playlist]]; [[Item:Q118|Samoyedic Playlist]]; [[Item:Q193|Livonian Playlist]]; [[Item:Q265|Veps Playlist]]; [[Item:Q323|Saami Playlist]]; [[Item:Q3720|Hungarian Playlist]]; [[Item:Q3769|Finnish Playlist]]; [[Item:Q2835|Mari Playlist]]; [[Item:Q3089|Udmurt Playlist]]; [[Item:Q3679|Estonian Playlist]]; [[Item:Q2497|Komi Playlist]]; [[Item:Q2631|Erzya Moksha Playlist]].
*We are inviting new curators knowledgeable about the traditional or contemporary music of these peoples and languages.


==== Photographs ====
FUDSS is a federated cultural heritage data space, a multi-layer cultural heritage infrastructure combining knowledge graphs, linked open data, semantic browsing, virtual exhibitions, and multilingual community curation.
[[File:Pedestrian Path to Cape Kolka and the Baltic Sea thumbnail.jpg|left|thumb|180x180px|The mission of the collection is to connect historical and contemporary photographs in various private and public collections.]]
* [[Livonian Photography Collection (21st century)]]: This is a growing collection of contemporary photographs documenting Livonian culture, now available under a Creative Commons license on Wikimedia Commons. Each image is tagged using the emerging ISCC standard for digital media identification, ensuring long-term traceability and interoperability.([[Item:Q896|Livonian Photography Collection (21st century; database link)]].
* [[Seto Historical Photography Collection]]: Curated by Daniel Antal and Dr. Ieva Pigozne, this virtual collection explores the rich cultural history of Setomaa and the Seto people through historical photographs. The images are enhanced with multilingual metadata and integrated into our Finno-Ugric Dataspace.  [[Item:Q3906| Explore the collection in the database →]]] We warmly invite new curators with knowledge of Setomaa to contribute. We are also happy to start similar collection with Võro, Mari, Udmurt, Livonian or other Finno-Ugric communities.
* Where possible, we present both the digitized original and carefully enhanced versions of the photographs. Our post-processing includes respectful restoration—removing visible damage, correcting lighting issues, or cropping to highlight meaningful details such as traditional garments. These improvements support research (e.g., textile analysis) while remaining faithful to the original artifact. In cases where sharpening or exposure adjustments are made, these are considered standard archival practices rather than derivative works, ensuring clarity and usability without altering the photograph’s historical integrity.


==== Garments ====
* A Wikibase knowledge graph containing structured descriptions of people, places, languages, artefacts, recordings, films, and cultural practices.
* [[Item:Q1099|Kurzeme Region Dress History Collection]]: A growing archive of traditional garments from the Kurzeme region of Latvia, showcasing regional styles and textile craftsmanship across time.
* A Fuseki RDF triplestore supporting advanced SPARQL queries and linked open data publication.
* [[Seto Garment Collection]] - A curated selection of historical and contemporary textiles and clothing from Setomaa. This collection highlights distinctive Seto designs, often linked to specific customs, ceremonies, and everyday life.
* A Sampo-UI semantic browser enabling exploratory search, filtering, visualisation, and contextual discovery.
* WikiMuseum exhibitions presenting curated narratives built from the same underlying knowledge graph.
* Multilingual lexicographic resources based on Wikibase Lexemes.
* Connections to Wikimedia projects, Wikidata, Europeana, and ECCCH-aligned infrastructures.


==== Films ====
For more context, see our graphical browser or subscribe to our [https://open.substack.com/pub/finnougric/p/mari-clothing-heritage-across-collections?utm_campaign=post-expanded-share&utm_medium=web blog] or just check out our [https://www.instagram.com/finnougricnet/ instagram account] for a first impression. For a more professional, digital heritage-based presentation read our [https://doi.org/10.5281/zenodo.17938081 publication preprint], visit the [https://reprex.nl/project/finnougricdataspace/ project page] or check out [https://reprex.nl/slides/20250306_dreams/ slide presentation].


[[Finno-Ugric Film Database| Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas)]]: We have recently begun enriching and semantically reprocessing this foundational database, which documents films made by Finno-Ugric creators or in Finno-Ugric languages. Managed by the Finno-Ugric Film Foundation (FUFF), the database offers detailed metadata on production, language, creators, and availability. Examples: [[Item:Q4079|Celestial Wives of the Meadow Mari]]: A Russian drama directed by Aleksei Fedorchenko, filmed in the Mari language and inspired by traditional folklore (2012); [[Item:Q4088|The Land of Love]]: A Nenets-language documentary by Liivo Niglas (2014); [[Item:Q4096|The Afflicted Animal]]: A Northern Sámi short film by Egil Pedersen (2016); [[Item:Q4100|To Speak is to Resist]]: A Komi-language short film by Taawetti Erkinpoika Myrskyvalkea (2024); * [[Item:Q4101|Tõnis Day in Setumaal]]: An Estonian language television documentary by Maie Maasikas (1993).
=== Services ===


== Linguistics (Lexeme) ==
[[File:D Antal collective-management-AI 11.png|center|800px|alt=We reconcile data and metadata across public institutions, community archives, and commercial platforms such as Spotify and YouTube. The same methods support rights clearance, provenance research, and multilingual discovery, enabling overlooked Finno-Ugric heritage—including Livonian and Latvian folk songs—to become visible and accessible to new audiences.|We reconcile data and metadata across public institutions, community archives, and commercial platforms such as Spotify and YouTube. The same methods support rights clearance, provenance research, and multilingual discovery, enabling overlooked Finno-Ugric heritage—including Livonian and Latvian folk songs—to become visible and accessible to new audiences.]]
[[File:Dreams playlist for DHNB2025..png|left|frameless|A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages]]
The concept of a [[Item:Q2|dream]] is independent from languages, in the sense "imaginary events seen in the mind while sleeping" it can be expressed with the English [[Lexeme:L1|dream]], the Hungarian [[Lexeme:L2|álom]] or the Estonian [[Lexeme:L3|unenägu]] or the Dutch [[d:Lexeme:L1149861|droom]] nouns. These lexemes behave differently. In Dutch, they it has a masculine grammatical gender. The Hungarian and Estonian lexemes change their forms when you say in your dream. The identification of a dream as a subject of a song lyrics requires the understanding how dream becomes dreams in English, dromen in Dutch, and álmok in Hungarian. If you want to describe the lyrics in statements, you must know that *álmodban* is a form of [[Lexeme:L2|álom]], which refers translates to *in your dream*, i.e., expressing both an adjective and clarifying the owner of the subject, too.


Connecting the senses of lexemes with their translations, we can carry out language independent queries for songs that are about dreams. See our [[Item:Q3714|Dream Playlist DHNB2025]]!
* '''Cultural catalogue export and interoperability''' (Europeana, ECCCH, Wikidata, Wikimedia Commons, RDF and linked open data publication)
* '''Co-curation of exhibitions and collections from dispersed heritage''' (e.g. [https://et.wikipedia.org/wiki/Vikipeedia:GLAM/Traditional_Livonian_Clothing/en/1 Livonian], [https://et.wikipedia.org/wiki/Vikipeedia:GLAM/Traditional_Seto_Clothing Seto], [https://et.wikipedia.org/wiki/Vikipeedia:GLAM/Traditional_Mari_Clothing_in_the_Collection_of_the_Estonian_National_Museum/en/1 Mari] and [https://commons.wikimedia.org/wiki/Category:Udmurt_ethnographic_clothing_from_museum_collections Udmurt] cultural heritage)
* '''Knowledge graph and namespace development for multilingual cultural heritage documentation'''
* '''Metadata enrichment and semantic harmonisation of catalogues''', inventories and collection databases
* '''Publication of high-value datasets''' following FAIR, CARE and linked open data principles
Virtual exhibitions and semantic browsing interfaces using WikiMuseum and Sampo-style technologies
* '''Multilingual lexicographic and terminology resources''' for endangered and minority languages
* '''Rights-aware metadata repair and provenance reconstruction''' for heritage collections and audiovisual materials
* '''Community-based heritage documentation''' and co-curation involving scholars, memory institutions and cultural communities
* '''Heritage Digital Twin''' development for connecting tangible and intangible cultural heritage
* Research infrastructure prototyping and data-space development for small and underrepresented cultural communities


== 🎵 Collections by Type ==
Databases


== Getting started ==
LīvMDb - the [[Item:Q4747|Livonian Music Database]]: a collection of musical works, songs, print music and sound recordings connected to the Livonian language or the Livonian Coast.
* [[mediawikiwiki:Special:MyLanguage/Manual:FAQ|MediaWiki FAQ]]
 
=== Musical Works === 
Explore works sung in Finno-Ugric languages or belonging to their musical traditions: 
[[Item:Q194|Livonian]] • [[Item:Q2836|Mari]] • [[Item:Q3770|Finnish]] • [[Item:Q3721|Hungarian]] • [[Item:Q2498|Komi]] • [[Item:Q3090|Udmurt]] • [[Item:Q3680|Estonian]] • [[Item:Q2632|Erzya Moksha]] • [[Item:Q123|Khanty Mansi]] • [[Item:Q136|Samoyedic]] • [[Item:Q266|Veps]] • [[Item:Q324|Saami]]
 
=== Sound Recordings (Spotify Playlists) === 
Curated playlists of Finno-Ugric music—traditional and contemporary: 
[[Item:Q193|Livonian]] • [[Item:Q2835|Mari]] • [[Item:Q3769|Finnish]] • [[Item:Q3720|Hungarian]] • [[Item:Q2497|Komi]] • [[Item:Q3089|Udmurt]] • [[Item:Q3679|Estonian]] • [[Item:Q2631|Erzya Moksha]] • [[Item:Q117|Khanty Mansi]] • [[Item:Q118|Samoyedic]] • [[Item:Q265|Veps]] • [[Item:Q323|Saami]] • [[Item:Q6775|Karelian]]
 
''We are inviting new curators knowledgeable about the traditional or contemporary music of these communities.''
 
== 📸 Photographs ==
 
[[File:Pedestrian Path to Cape Kolka and the Baltic Sea thumbnail.jpg|left|thumb|180x180px|Connecting past and present through photographic metadata.]]
 
* '''[[Item:Q896|Livonian Photography Collection (21st century)]]''': 
Contemporary photographs under Creative Commons licenses, each tagged with ISCC codes for digital traceability.
 
* '''[[Item:Q4665|Livonian Historical Photography Collection]]''': 
Historical photography focused on Livonian speakers in Northern Courland and the Livonian Coast.
 
* '''[[Seto Historical Photography Collection]]''': 
Curated by Daniel Antal and Dr. Ieva Pigozne. Includes multilingual metadata and community annotations. May be used as a primary photographic collection, or as secondary source for other areas, including 🧵''garments'' (see below.)
 
Where possible, we present both original and enhanced versions of images. Post-processing supports textile research and educational use without compromising historical integrity.
 
== 🧵 Garments ==
[[File:FUDS Q4633 preview.jpg|thumb|[[Item:Q4633|Seto female festive shirt (SU4106:97)]]]]
* '''[[Item:Q1099|Traditional Livonian Clothing Collection]]''': 
Traditional garments worn by Livonians from from the Northern Kurzeme region (Latvia), showcasing weaving, color, and cut diversity.
 
* '''[[Traditional Seto Clothing Collection]]''': 
Historical and ceremonial garments from Setomaa with detailed annotations. Includes primary sources (see [[Item:Q4370| kitasnik pinafore dress (STM SSM 782 E)]] and secondary sources (see the original ethnographic photograph [[Item:Q4044|Seto men in the village of Võmmorski in Setomaa municipality (ERM Fk 213:146)]] and the derivative work: [[Item:Q4051|Trousers of a Seto man in the village of Võmmorski in Setomaa municipality (detail)]])
 
== 🎬 Films ==
'''[[Finno-Ugric Film Database| Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas)]]''':
We are enhancing this foundational database with semantic metadata. 
Examples include: 
==== Feature films ====
* [[Item:Q4079|Celestial Wives of the Meadow Mari]]: A Russian drama directed by Aleksei Fedorchenko, filmed in the Mari language and inspired by traditional folklore (2012).
 
==== Documentaries ====
[[Item:Q4108|Browse the documentary collection →]]
 
* [[Item:Q4088|The Land of Love]]: A Nenets-language documentary by Liivo Niglas (2014).
* [[Item:Q4101|Tõnis Day in Setumaal]]: An Estonian language television documentary by Maie Maasikas (1993).
 
==== Short films ====
[[Item:Q4109|Browse the short film collection →]]
 
* [[Item:Q4096|The Afflicted Animal]]: A Northern Sámi short film by Egil Pedersen (2016).
* [[Item:Q4100|To Speak is to Resist]]: A Komi-language short film by Taawetti Erkinpoika Myrskyvalkea (2024).
 
==== Newsreels & Archives ====
[[Item:Q4106|Browse the audiovisual archive collection →]]
 
== 🗣️ Language & Lexemes ==
[[File:Dreams playlist for DHNB2025..png|right|frameless|A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages]]
 
Can machines understand dreams across languages?
 
* English: [[Lexeme:L1|dream]] 
* Hungarian: [[Lexeme:L2|álom]] 
* Estonian: [[Lexeme:L3|unenägu]] 
* Dutch: [https://www.wikidata.org/wiki/Lexeme:L1149861 droom]
 
By connecting senses of lexemes with their grammatical and semantic behavior, we model language-independent queries. 
See the [[Item:Q3714|Dream Playlist DHNB2025]]!
 
== 🧪 Try It, Change It ==
 
This is a sandbox for metadata innovation.
 
* Test metadata enrichment on real items.
* Experiment with multilingual reconnection.
* Prototype governance workflows.
 
''We invite GLAM institutions, researchers, and community curators to explore new models for cultural preservation.''
 
> Real innovation comes not from new software—but from new participation.
 
== 🧭 Start Here ==
[[File:Dreams DNBH2025 Poster.jpg|right|thumb|You can download our poster in high resolution on our [https://reprex.nl/event/2025-03-07_dreams/ event page].]]
 
* [https://reprex.nl/project/finnougricdataspace/ Project Overview] 
* [https://reprex.nl/project/finnougricdataspace/ Digital Dreams And Practices-DNBH 2025 presentation]  and [https://reprex.nl/slides/20250306_dreams/ Slide Presentation]
* [[mediawikiwiki:Special:MyLanguage/Manual:FAQ|MediaWiki FAQ]] 
* [[Special:Search|Search the database]] 
* [[Special:AllPages|Explore all items]]

Latest revision as of 11:57, 5 June 2026

 Finno-Ugric Data Sharing Space  

Finno-Ugric Data Sharing Space (FUDSS) is a multilingual cultural heritage infrastructure connecting museums, archives, libraries, community collections, music archives, photographs, films, and linguistic resources across Finno-Ugric communities. Built on Wikibase, Apache Jena Fuseki, Sampo-UI, and WikiMuseum technologies, it combines structured knowledge graphs, semantic browsing, virtual exhibitions, and linked open data services.

Our collection is best view via our Sampo semantic browser via finnougric.net
Our collection is best view via our Sampo semantic browser via finnougric.net.

The platform enables the discovery, enrichment, and publication of tangible and intangible cultural heritage while supporting multilingual access, community participation, and interoperability with Wikimedia, Europeana, and the emerging European Collaborative Cloud for Cultural Heritage (ECCCH).

🌍 What is FUDSS?

FUDSS is a federated cultural heritage data space, a multi-layer cultural heritage infrastructure combining knowledge graphs, linked open data, semantic browsing, virtual exhibitions, and multilingual community curation.

  • A Wikibase knowledge graph containing structured descriptions of people, places, languages, artefacts, recordings, films, and cultural practices.
  • A Fuseki RDF triplestore supporting advanced SPARQL queries and linked open data publication.
  • A Sampo-UI semantic browser enabling exploratory search, filtering, visualisation, and contextual discovery.
  • WikiMuseum exhibitions presenting curated narratives built from the same underlying knowledge graph.
  • Multilingual lexicographic resources based on Wikibase Lexemes.
  • Connections to Wikimedia projects, Wikidata, Europeana, and ECCCH-aligned infrastructures.

For more context, see our graphical browser or subscribe to our blog or just check out our instagram account for a first impression. For a more professional, digital heritage-based presentation read our publication preprint, visit the project page or check out slide presentation.

Services

We reconcile data and metadata across public institutions, community archives, and commercial platforms such as Spotify and YouTube. The same methods support rights clearance, provenance research, and multilingual discovery, enabling overlooked Finno-Ugric heritage—including Livonian and Latvian folk songs—to become visible and accessible to new audiences.
We reconcile data and metadata across public institutions, community archives, and commercial platforms such as Spotify and YouTube. The same methods support rights clearance, provenance research, and multilingual discovery, enabling overlooked Finno-Ugric heritage—including Livonian and Latvian folk songs—to become visible and accessible to new audiences.
  • Cultural catalogue export and interoperability (Europeana, ECCCH, Wikidata, Wikimedia Commons, RDF and linked open data publication)
  • Co-curation of exhibitions and collections from dispersed heritage (e.g. Livonian, Seto, Mari and Udmurt cultural heritage)
  • Knowledge graph and namespace development for multilingual cultural heritage documentation
  • Metadata enrichment and semantic harmonisation of catalogues, inventories and collection databases
  • Publication of high-value datasets following FAIR, CARE and linked open data principles

Virtual exhibitions and semantic browsing interfaces using WikiMuseum and Sampo-style technologies

  • Multilingual lexicographic and terminology resources for endangered and minority languages
  • Rights-aware metadata repair and provenance reconstruction for heritage collections and audiovisual materials
  • Community-based heritage documentation and co-curation involving scholars, memory institutions and cultural communities
  • Heritage Digital Twin development for connecting tangible and intangible cultural heritage
  • Research infrastructure prototyping and data-space development for small and underrepresented cultural communities

🎵 Collections by Type

Databases

LīvMDb - the Livonian Music Database: a collection of musical works, songs, print music and sound recordings connected to the Livonian language or the Livonian Coast.

Musical Works

Explore works sung in Finno-Ugric languages or belonging to their musical traditions: LivonianMariFinnishHungarianKomiUdmurtEstonianErzya MokshaKhanty MansiSamoyedicVepsSaami

Sound Recordings (Spotify Playlists)

Curated playlists of Finno-Ugric music—traditional and contemporary: LivonianMariFinnishHungarianKomiUdmurtEstonianErzya MokshaKhanty MansiSamoyedicVepsSaamiKarelian

We are inviting new curators knowledgeable about the traditional or contemporary music of these communities.

📸 Photographs

Connecting past and present through photographic metadata.

Contemporary photographs under Creative Commons licenses, each tagged with ISCC codes for digital traceability.

Historical photography focused on Livonian speakers in Northern Courland and the Livonian Coast.

Curated by Daniel Antal and Dr. Ieva Pigozne. Includes multilingual metadata and community annotations. May be used as a primary photographic collection, or as secondary source for other areas, including 🧵garments (see below.)

Where possible, we present both original and enhanced versions of images. Post-processing supports textile research and educational use without compromising historical integrity.

🧵 Garments

Seto female festive shirt (SU4106:97)

Traditional garments worn by Livonians from from the Northern Kurzeme region (Latvia), showcasing weaving, color, and cut diversity.

Historical and ceremonial garments from Setomaa with detailed annotations. Includes primary sources (see kitasnik pinafore dress (STM SSM 782 E) and secondary sources (see the original ethnographic photograph Seto men in the village of Võmmorski in Setomaa municipality (ERM Fk 213:146) and the derivative work: Trousers of a Seto man in the village of Võmmorski in Setomaa municipality (detail))

🎬 Films

Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas):

We are enhancing this foundational database with semantic metadata. Examples include:

Feature films

Documentaries

Browse the documentary collection →

Short films

Browse the short film collection →

Newsreels & Archives

Browse the audiovisual archive collection →

🗣️ Language & Lexemes

A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages
A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages

Can machines understand dreams across languages?

By connecting senses of lexemes with their grammatical and semantic behavior, we model language-independent queries. See the Dream Playlist DHNB2025!

🧪 Try It, Change It

This is a sandbox for metadata innovation.

  • Test metadata enrichment on real items.
  • Experiment with multilingual reconnection.
  • Prototype governance workflows.

We invite GLAM institutions, researchers, and community curators to explore new models for cultural preservation.

> Real innovation comes not from new software—but from new participation.

🧭 Start Here

You can download our poster in high resolution on our event page.