INEL Ressourcen Portal

Cite Shluinsky, Andrey; Khanina, Olesya; Wagner-Nagy, Beáta. 2025. INEL Enets Corpus. Version 1.1. Publication date 2025-12-31. https://hdl.handle.net/11022/0000-0008-005C-1. Archived at Universität Hamburg. In: The INEL corpora of indigenous Northern Eurasian languages. https://hdl.handle.net/11022/0000-0007-F45A-1

Enzisch

Enets

Das INEL Enets Korpus wurde im Rahmen des Langzeitprojekts INEL (Grammatiken, Korpora und Sprachtechnologie für indigene nordeurasische Sprachen, 2016-2033) erstellt. Das Korpus umfasst Texte, die zwischen 1962-2017 in Wald- und Tundra-Enzisch aufgezeichnet wurden. Die Quellen des Korpus (siehe weitere Details in der Benutzerdokumentation, Abschnitt 2.2) sind:

Audioaufnahmen von Olesya Khanina, Maria Ovsjannikova, Andrey Shluinsky, Natalia Stoynova und Sergey Trubetskoy
Legacy Audioaufnahmen von Vera Bettu, Nina N. Bolina, Dar`ya S. Bolina, Zoya N. Bolina, Oksana E. Dobzhanskaya, Valentin Gusev, Eugene Helimski†, Kazimir I. Labanauskas†, Larisa Leisiö, Marina Lyublinskaya, Kaur Mägi, Viktor N. Pal`chin, Marina N. Pal`china, Irina P. Sorokina†, Anna Urmanchieva, Beáta Wagner-Nagy und möglicherweise weitere Personen,
Veröffentlichte Audioaufnahmen
Veröffentlichte Texte von Dar`ya S. Bolina, Yaroslav A. Gluxij† und Vasilij A. Susekov†, Eugene Helimski†, Kazimir I. Labanauskas†, Tibor Mikola†, János Pusztay, Irina P. Sorokina†, Anna Urmanchieva
Manuskripttranskriptionen und Selbsttranskriptionen, die von Dar`ya S. Bolina, Galina S. Bolina, Zoya N. Bolina, Valentin Gusev, Eugene Helimski†, Kazimir I. Labanauskas†, Larisa Leisiö, Marina Lyublinskaya, Vasilij F. Ly`rmin†, Anton N. Pal`chin, Viktor N. Pal`chin, Ivan I. Silkin†, Irina P. Sorokina†, Natal`ya M. Tereščenko†, Anna Urmanchieva und möglicherweise anderen Personen erfasst wurden.

Neu in Version 1.1

Die Annotation syntaktischer Funktionen (Spur-Kategorie „SyF“) ist nun für 55 weitere Texte verfügbar, davon 52 Folklortexte und 3 narrativen Texte.
Für Texte aus veröffentlichten und archivierten Quellen sowie Manuskripten wurden detaillierte Verweise in den Abschnitt „Citation“ der Dokumentation und in das entsprechende Feld in der Metadatendatei des Korpus aufgenommen.

Alle Texte im Korpus sind mit interlinearen Morphem-zu-Morphem-Glossen und Übersetzungen ins Englische und Russische versehen. Alle Texte, für die Audioaufnahmen zugänglich waren, sind mit diesen zeitlich aligniert. Sofern verfügbar, wurden auch Videoaufnahmen in das Korpus aufgenommen.

Zugang über das Zentrum für nachhaltiges Forschungsdatenmanagement der Universität Hamburg:

Download des gesamte Korpus als Zip-Archiv inkl. Audio- und Videoressourcen (28 GB), nur mit Adio in WAV Format (13,6 GB), mit Audio in MP3 Format (4,3 GB) und ohne Medienressourcen (110,5 Mb)
Online Suche mithilfe von Tsakorpus
Dokumentation
Vollständige Übersicht über Korpusinhalte und Metadaten (oder siehe unten)
Remote Suche mithilfe des Suchwerkzeugs EXAKT (EXMARALDA Anaylse- und Konkordanzprogramm).

The INEL Enets corpus has been created within the long-term INEL project (Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages, 2016–2033). The corpus includes texts recorded between 1962–2017 in both Enets lects – Forest Enets and Tundra Enets. The sources of the corpus (see more details in the user documentation, section 2.2) are:

Audio recordings done by Olesya Khanina, Maria Ovsjannikova, Andrey Shluinsky, Natalia Stoynova and Sergey Trubetskoy
Legacy audio recordings done by Vera Bettu, Nina N. Bolina, Dar`ya S. Bolina, Zoya N. Bolina, Oksana E. Dobzhanskaya, Valentin Gusev, Eugene Helimski†, Kazimir I. Labanauskas†, Larisa Leisiö, Marina Lyublinskaya, Kaur Mägi, Viktor N. Pal`chin, Marina N. Pal`china, Irina P. Sorokina†, Anna Urmanchieva, Beáta Wagner-Nagy and possibly other people
Published audio recordings
Texts published by Dar`ya S. Bolina, Yaroslav A. Gluxij† and Vasilij A. Susekov†, Eugene Helimski†, Kazimir I. Labanauskas†, Tibor Mikola†, János Pusztay, Irina P. Sorokina†, Anna Urmanchieva
Legacy manuscript transcriptions and self-transcriptions done and/or edited by Dar`ya S. Bolina, Galina S. Bolina, Zoya N. Bolina, Valentin Gusev, Eugene Helimski†, Kazimir I. Labanauskas†, Larisa Leisiö, Marina Lyublinskaya, Vasilij F. Ly`rmin†, Anton N. Pal`chin, Viktor N. Pal`chin, Ivan I. Silkin†, Irina P. Sorokina†, Natal`ya M. Tereščenko†, Anna Urmanchieva and possibly other people.

New in release 1.1

Annotation of syntactic functions (tier category "SyF") is now available for 55 additional texts, of which 52 are folklore and 3 – narrative;
For texts originating from published and archival sources, as well as manuscripts, detailed references were added to the "Citation" section of the documentation and the respective field in the corpus metadata file.

All texts in the corpus are provided with interlinear morpheme-by-morpheme glosses and translation into English and Russian. All texts for which the audio recordings were accessible are time-aligned with them. Video recordings are also included into the corpus if available.

Access to the corpus via the Center for sustainable research data management:

Download the corpus as a zip archive incl. audio and video materials (28 GB), only with audio in WAV format (13.6 GB), with audio in mp3 format (4.3 GB) and without any media sources (110.5 Mb)

Online search using Tsakorpus

Documentation

Complete Overview of corpus files and metadata (or see below)

Remote search using EXAKT (EXMARALDA Anaylsis- and concordance software).

Download Online (Tsakorpus)