Main Page: Difference between revisions

From FinnoUgric Dataspace
Jump to navigation Jump to search
AsmahFederico (talk | contribs)
add a space to fix formatting
Major reedit
Line 1: Line 1:
[[File:Dreams DNBH2025 Poster.jpg|thumb|You can download our poster in a higher resoultion on our [https://reprex.nl/event/2025-03-07_dreams/ event] page.]]
<strong>We started experimenting with the legal, organisational, semantic and technical challenges of creating a genuinely trustworthy, AI-supported data-sharing space that can find and connect tangible and intangible elements of the Finno-Ugric cultural universes. We were also seeking a better governance model for oversight for the custodians of these endangered, shrinking universes in their language and with little technical knowledge, partly as alternatives to the established Wikipedia to the open knowledge incubation method for small linguistic minorities.</strong> See further details on the [https://reprex.nl/project/finnougricdataspace/ project description page].


This experimental platform reimagines how small and endangered language communities—like Võro, Seto, Livonian, and Latgalian—can engage with the Wikimedia ecosystem. Rather than relying on traditional Wikipedia editing models, we use structured data (Wikibase, Lexemes, SPARQL) to narrate cultural heritage through multilingual exhibitions, oral histories, traditional dress, and music. Our approach supports community-led storytelling and data stewardship, building a sustainable foundation for knowledge sharing rooted in local voices and values.
<div style="text-align:center;">
  <strong><span style="font-size: 1.5em;">Finno-Ugric Data Sharing Space</span></strong> 
</div>


Explore how museums, researchers, and communities are collaboratively curating and publishing open datasets, Commons media, and Wiktionary entries that reflect the rich traditions and languages of Finno-Ugric peoples. This work is part of a broader effort to create inclusive and interoperable pathways for cultural and linguistic representation online.
[[File:Dreams DNBH2025 Poster.jpg|right|thumb|You can download our poster in high resolution on our [https://reprex.nl/event/2025-03-07_dreams/ event page].]]


== Collections ==
<strong>We are building a trustworthy, AI-supported data-sharing space to connect tangible and intangible elements of the Finno-Ugric cultural universes—textiles, music, oral history, photography, and language. This platform explores legal, organizational, semantic, and technical challenges while offering new governance models for community-based digital curation.</strong>
==== Musical works ====
The following collections contain musical works that are almost always sung in the given language, in some cases, they belong to the musical tradition of these communities without lyrics.


[[Item:Q123|Khanty Mansi Musical Works Collection]]; [[Item:Q136|Samoyedic Musical Works Collection]]; [[Item:Q194|Livonian Musical Works Collection]]; [[Item:Q266|Veps Musical Works Collection]] ;[[Item:Q324|Saami Musical Works Collection]]; [[Item:Q2498|Komi Musical Works Collection]]; [[Item:Q3721|Hungarian Musical Works Collection]]; [[Item:Q3770|Finnish Musical Works Collection]]; [[Item:Q2836|Mari Musical Works Collection]]; [[Item:Q3090|Udmurt Musical Works Collection]]; [[Item:Q3680|Estonian Musical Works Collection]]; [[Item:Q2632|Erzya Moksha Musical Works Collection]]
Rather than replicating established Wikipedia methods, we work with structured data tools like Wikibase, Lexemes, and SPARQL. Our approach centers multilinguality, shared custodianship, and the ability of communities—Võro, Seto, Livonian, Mari, and others—to describe their own heritage on their terms.


==== Sound recordings ====
== 🌍 What is This Platform? ==
The following collections as playlist contain sound recordings of the musical works, you can listen to them on Spotify. We will keep adding other listening options on YouTube, Bandcamp or other licensed players. [[Item:Q117|Khanti Mansi Playlist]]; [[Item:Q118|Samoyedic Playlist]]; [[Item:Q193|Livonian Playlist]]; [[Item:Q265|Veps Playlist]]; [[Item:Q323|Saami Playlist]]; [[Item:Q3720|Hungarian Playlist]]; [[Item:Q3769|Finnish Playlist]]; [[Item:Q2835|Mari Playlist]]; [[Item:Q3089|Udmurt Playlist]]; [[Item:Q3679|Estonian Playlist]]; [[Item:Q2497|Komi Playlist]]; [[Item:Q2631|Erzya Moksha Playlist]].
*We are inviting new curators knowledgeable about the traditional or contemporary music of these peoples and languages.


==== Photographs ====
This experimental knowledge graph helps small and endangered language communities:
[[File:Pedestrian Path to Cape Kolka and the Baltic Sea thumbnail.jpg|left|thumb|180x180px|The mission of the collection is to connect historical and contemporary photographs in various private and public collections.]]
* [[Livonian Photography Collection (21st century)]]: This is a growing collection of contemporary photographs documenting Livonian culture, now available under a Creative Commons license on Wikimedia Commons. Each image is tagged using the emerging ISCC standard for digital media identification, ensuring long-term traceability and interoperability.([[Item:Q896|Livonian Photography Collection (21st century; database link)]].
* [[Seto Historical Photography Collection]]: Curated by Daniel Antal and Dr. Ieva Pigozne, this virtual collection explores the rich cultural history of Setomaa and the Seto people through historical photographs. The images are enhanced with multilingual metadata and integrated into our Finno-Ugric Dataspace.  [[Item:Q3906| Explore the collection in the database →]]] We warmly invite new curators with knowledge of Setomaa to contribute. We are also happy to start similar collection with Võro, Mari, Udmurt, Livonian or other Finno-Ugric communities.
* Where possible, we present both the digitized original and carefully enhanced versions of the photographs. Our post-processing includes respectful restoration—removing visible damage, correcting lighting issues, or cropping to highlight meaningful details such as traditional garments. These improvements support research (e.g., textile analysis) while remaining faithful to the original artifact. In cases where sharpening or exposure adjustments are made, these are considered standard archival practices rather than derivative works, ensuring clarity and usability without altering the photograph’s historical integrity.


==== Garments ====
* Document and connect culturally significant materials (songs, clothing, places, people).
* [[Item:Q1099|Kurzeme Region Dress History Collection]]: A growing archive of traditional garments from the Kurzeme region of Latvia, showcasing regional styles and textile craftsmanship across time.
* Create multilingual, multimodal metadata in structured formats.
* [[Seto Garment Collection]] - A curated selection of historical and contemporary textiles and clothing from Setomaa. This collection highlights distinctive Seto designs, often linked to specific customs, ceremonies, and everyday life.
* Model heritage semantically without needing full encyclopedic coverage.
* Try new digital curation workflows in a low-risk, unaffiliated environment.


==== Films ====
The platform is not owned by any national institution and is open for scholars, curators, and communities to test metadata enrichment, federated linking, and feedback models that large systems rarely permit.


[[Finno-Ugric Film Database| Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas)]]: We have recently begun enriching and semantically reprocessing this foundational database, which documents films made by Finno-Ugric creators or in Finno-Ugric languages. Managed by the Finno-Ugric Film Foundation (FUFF), the database offers detailed metadata on production, language, creators, and availability. Examples: [[Item:Q4079|Celestial Wives of the Meadow Mari]]: A Russian drama directed by Aleksei Fedorchenko, filmed in the Mari language and inspired by traditional folklore (2012); [[Item:Q4088|The Land of Love]]: A Nenets-language documentary by Liivo Niglas (2014); [[Item:Q4096|The Afflicted Animal]]: A Northern Sámi short film by Egil Pedersen (2016); [[Item:Q4100|To Speak is to Resist]]: A Komi-language short film by Taawetti Erkinpoika Myrskyvalkea (2024); [[Item:Q4101|Tõnis Day in Setumaal]]: An Estonian language television documentary by Maie Maasikas (1993).
For more context, see our [https://reprex.nl/project/finnougricdataspace/ project page] or [https://reprex.nl/slides/20250306_dreams/ slide presentation].


== Linguistics (Lexeme) ==
== 🎵 Collections by Type ==
[[File:Dreams playlist for DHNB2025..png|left|frameless|A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages]]
The concept of a [[Item:Q2|dream]] is independent from languages, in the sense "imaginary events seen in the mind while sleeping" it can be expressed with the English [[Lexeme:L1|dream]], the Hungarian [[Lexeme:L2|álom]] or the Estonian [[Lexeme:L3|unenägu]] or the Dutch [[d:Lexeme:L1149861|droom]] nouns. These lexemes behave differently. In Dutch, they it has a masculine grammatical gender. The Hungarian and Estonian lexemes change their forms when you say in your dream. The identification of a dream as a subject of a song lyrics requires the understanding how dream becomes dreams in English, dromen in Dutch, and álmok in Hungarian. If you want to describe the lyrics in statements, you must know that *álmodban* is a form of [[Lexeme:L2|álom]], which refers translates to *in your dream*, i.e., expressing both an adjective and clarifying the owner of the subject, too.


Connecting the senses of lexemes with their translations, we can carry out language independent queries for songs that are about dreams. See our [[Item:Q3714|Dream Playlist DHNB2025]]!
=== Musical Works === 
Explore works sung in Finno-Ugric languages or belonging to their musical traditions: 
[[Item:Q194|Livonian]] • [[Item:Q2836|Mari]] • [[Item:Q3770|Finnish]] • [[Item:Q3721|Hungarian]] • [[Item:Q2498|Komi]] • [[Item:Q3090|Udmurt]] • [[Item:Q3680|Estonian]] • [[Item:Q2632|Erzya Moksha]] • [[Item:Q123|Khanty Mansi]] • [[Item:Q136|Samoyedic]] • [[Item:Q266|Veps]] • [[Item:Q324|Saami]]


=== Sound Recordings (Spotify Playlists) === 
Curated playlists of Finno-Ugric music—traditional and contemporary: 
[[Item:Q193|Livonian]] • [[Item:Q2835|Mari]] • [[Item:Q3769|Finnish]] • [[Item:Q3720|Hungarian]] • [[Item:Q2497|Komi]] • [[Item:Q3089|Udmurt]] • [[Item:Q3679|Estonian]] • [[Item:Q2631|Erzya Moksha]] • [[Item:Q117|Khanty Mansi]] • [[Item:Q118|Samoyedic]] • [[Item:Q265|Veps]] • [[Item:Q323|Saami]]


== Getting started ==
''We are inviting new curators knowledgeable about the traditional or contemporary music of these communities.''
* [[mediawikiwiki:Special:MyLanguage/Manual:FAQ|MediaWiki FAQ]]
 
== 📸 Photographs ==
 
[[File:Pedestrian Path to Cape Kolka and the Baltic Sea thumbnail.jpg|left|thumb|180x180px|Connecting past and present through photographic metadata.]]
 
* '''[[Item:Q896|Livonian Photography Collection (21st century)]]''': 
Contemporary photographs under Creative Commons licenses, each tagged with ISCC codes for digital traceability.
 
* '''[[Item:Q3906|Seto Historical Photography Collection]]''': 
Curated by Daniel Antal and Dr. Ieva Pigozne. Includes multilingual metadata and community annotations.
 
Where possible, we present both original and enhanced versions of images. Post-processing supports textile research and educational use without compromising historical integrity.
 
== 🧵 Garments ==
 
* '''[[Item:Q1099|Kurzeme Region Dress History Collection]]''': 
Traditional garments worn by Livonians from from the Northern Kurzeme region (Latvia), showcasing weaving, color, and cut diversity.
 
* '''Seto Garment Collection''': 
Historical and ceremonial garments from Setomaa with detailed annotations.
 
== 🎬 Films ==
'''[[Finno-Ugric Film Database| Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas)]]''':
We are enhancing this foundational database with semantic metadata. 
Examples include: 
==== Feature films ====
* [[Item:Q4079|Celestial Wives of the Meadow Mari]]: A Russian drama directed by Aleksei Fedorchenko, filmed in the Mari language and inspired by traditional folklore (2012).
 
==== Documentaries ====
[[Item:Q4108|Browse the documentary collection →]]
 
* [[Item:Q4088|The Land of Love]]: A Nenets-language documentary by Liivo Niglas (2014).
* [[Item:Q4101|Tõnis Day in Setumaal]]: An Estonian language television documentary by Maie Maasikas (1993).
 
==== Short films ====
[[Item:Q4109|Browse the short film collection →]]
 
* [[Item:Q4096|The Afflicted Animal]]: A Northern Sámi short film by Egil Pedersen (2016).
* [[Item:Q4100|To Speak is to Resist]]: A Komi-language short film by Taawetti Erkinpoika Myrskyvalkea (2024).
 
==== Newsreels & Archives ====
[[Item:Q4106|Browse the audiovisual archive collection →]]
 
== 🗣️ Language & Lexemes ==
[[File:Dreams playlist for DHNB2025..png|right|frameless|A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages]]
 
Can machines understand dreams across languages?
 
* English: [[Lexeme:L1|dream]] 
* Hungarian: [[Lexeme:L2|álom]] 
* Estonian: [[Lexeme:L3|unenägu]] 
* Dutch: [https://www.wikidata.org/wiki/Lexeme:L1149861 droom]
 
By connecting senses of lexemes with their grammatical and semantic behavior, we model language-independent queries. 
See the [[Item:Q3714|Dream Playlist DHNB2025]]!
 
== 🧪 Try It, Change It ==
 
This is a sandbox for metadata innovation.
 
* Test metadata enrichment on real items.
* Experiment with multilingual reconnection.
* Prototype governance workflows.
 
''We invite GLAM institutions, researchers, and community curators to explore new models for cultural preservation.''
 
> Real innovation comes not from new software—but from new participation.
 
== 🧭 Start Here ==
 
* [https://reprex.nl/project/finnougricdataspace/ Project Overview] 
* [https://reprex.nl/project/finnougricdataspace/ Digital Dreams And Practices-DNBH 2025 presentation] 
* [https://reprex.nl/slides/20250306_dreams/ Slide Presentation]
* [[mediawikiwiki:Special:MyLanguage/Manual:FAQ|MediaWiki FAQ]] 
* [[Special:Search|Search the database]] 
* [[Special:AllPages|Explore all items]]

Revision as of 08:57, 11 June 2025

 Finno-Ugric Data Sharing Space  
You can download our poster in high resolution on our event page.

We are building a trustworthy, AI-supported data-sharing space to connect tangible and intangible elements of the Finno-Ugric cultural universes—textiles, music, oral history, photography, and language. This platform explores legal, organizational, semantic, and technical challenges while offering new governance models for community-based digital curation.

Rather than replicating established Wikipedia methods, we work with structured data tools like Wikibase, Lexemes, and SPARQL. Our approach centers multilinguality, shared custodianship, and the ability of communities—Võro, Seto, Livonian, Mari, and others—to describe their own heritage on their terms.

🌍 What is This Platform?

This experimental knowledge graph helps small and endangered language communities:

  • Document and connect culturally significant materials (songs, clothing, places, people).
  • Create multilingual, multimodal metadata in structured formats.
  • Model heritage semantically without needing full encyclopedic coverage.
  • Try new digital curation workflows in a low-risk, unaffiliated environment.

The platform is not owned by any national institution and is open for scholars, curators, and communities to test metadata enrichment, federated linking, and feedback models that large systems rarely permit.

For more context, see our project page or slide presentation.

🎵 Collections by Type

Musical Works

Explore works sung in Finno-Ugric languages or belonging to their musical traditions: LivonianMariFinnishHungarianKomiUdmurtEstonianErzya MokshaKhanty MansiSamoyedicVepsSaami

Sound Recordings (Spotify Playlists)

Curated playlists of Finno-Ugric music—traditional and contemporary: LivonianMariFinnishHungarianKomiUdmurtEstonianErzya MokshaKhanty MansiSamoyedicVepsSaami

We are inviting new curators knowledgeable about the traditional or contemporary music of these communities.

📸 Photographs

Connecting past and present through photographic metadata.

Contemporary photographs under Creative Commons licenses, each tagged with ISCC codes for digital traceability.

Curated by Daniel Antal and Dr. Ieva Pigozne. Includes multilingual metadata and community annotations.

Where possible, we present both original and enhanced versions of images. Post-processing supports textile research and educational use without compromising historical integrity.

🧵 Garments

Traditional garments worn by Livonians from from the Northern Kurzeme region (Latvia), showcasing weaving, color, and cut diversity.

  • Seto Garment Collection:

Historical and ceremonial garments from Setomaa with detailed annotations.

🎬 Films

Finno-Ugric Film Database (Soome-Ugri Filmiandmebaas):

We are enhancing this foundational database with semantic metadata. Examples include:

Feature films

Documentaries

Browse the documentary collection →

Short films

Browse the short film collection →

Newsreels & Archives

Browse the audiovisual archive collection →

🗣️ Language & Lexemes

A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages
A playlist curated by Hõimulõimed with songs in the Dream Playlist Dhnb by Hõimulõimed in various Finno-Ugric languages

Can machines understand dreams across languages?

By connecting senses of lexemes with their grammatical and semantic behavior, we model language-independent queries. See the Dream Playlist DHNB2025!

🧪 Try It, Change It

This is a sandbox for metadata innovation.

  • Test metadata enrichment on real items.
  • Experiment with multilingual reconnection.
  • Prototype governance workflows.

We invite GLAM institutions, researchers, and community curators to explore new models for cultural preservation.

> Real innovation comes not from new software—but from new participation.

🧭 Start Here