INEL Nganasan corpus

The annotated corpus of Nganasan (< Samoyedic < Uralic) is available for online search or download under a CC-BY-NC-SA license. Corpus size in words: 221746. You will find full documentation here.

About

The INEL Nganasan Corpus has been created within the long-term INEL project headed by Prof. Dr. Beáta Wagner-Nagy, scheduled for 2016–2033.

The corpus makes possible typologically aware corpus-based grammatical research on the Nganasan language and expands the documentation of the lesser described indigenous languages of Northern Eurasia. The corpus is largely based on the Nganasan Spoken Language Corpus, which has been adapted to the INEL standards and supplemented with new texts.

The INEL Nganasan corpus consists of two parts. The glossed (searchable) part of the corpus includes texts provided with source media files (whenever available) and annotated transcripts. The archival part of the corpus contains non-glossed texts, represented either by audio recordings (optionally – with preliminary transcriptions) or scanned pages of the manuscripts or publications.

The corpus includes texts recorded in 1933–2019. The sources of the corpus are:

Corpus size

The glossed (searchable) part of the corpus contains 236 texts, 34,872 sentences and 221,747 tokens. The total duration of the audio recordings is 49 hours 53 minutes.

The archival part of the corpus contains 98 hours of audio material (210 texts) and 30 manuscripts.

Funding

The corpus has been produced in the context of the joint research funding of the German Federal Government and Federal States in the Academies’ Programme, with funding from the Federal Ministry of Education and Research and the Free and Hanseatic City of Hamburg. The Academies’ Programme is coordinated by the Union of the German Academies of Sciences and Humanities.

The Nganasan Spoken Language Corpus, which was integrated into the INEL Nganasan corpus, was created as part of the project Corpus based grammatical studies on Nganasan at the Institute of Finno-Ugric/Uralic Studies of Universität Hamburg. The project was supported by the Deutsche Forschungsgemeinschaft under grant number WA3153/2-1 between 2014 and 2017.

Contributions/Acknowledgements

Many native speakers shared their knowledge of Nganasan and thus made the existence of this corpus possible (see the documentation file below, Appendix A1). We are especially grateful to those who spent days and sometimes months working with us: Svetlana S. Aksyonova, Zinaida S. Chebodaeva, Nikolai S. Chunanchar, Nina D. Chunanchar, Yuliya M. Goricheva, Ekaterina Ch. Kokore, Ekaterina S. Kosterkina, Nadezhda T. Kosterkina, Svetlana M. Kudryakova, Serafima M. Kupchik, Tat`yana T. Kuzenko, Aleksandr Ch. Momde, Dar`ya Ch. Momde, Vera L. Momde, Vasilij F. Porbin, Evdokiya D. Porbina, Mariya M. Porbina, Zoya Ch. Porbina, Galina F. Porotova, Ekaterina N. Sovalova, Lodun N. Turdagina, Nadezhda K. Turdagina, Tat`yana D. Turkina, Mariya D. Yarotskaya, Sy`ku M. Yarotskaya.

The Department of Siberian Indigenous Languages of Tomsk State Pedagogical University and the Institute for Linguistic Studies RAS kindly provided access to their archives.

The Dudinka branch of GTRK “Norilsk” generously provided access to the Nganasan part of its extensive audio archive.

The Taimyr House of National Arts and the City Centre of National Arts in Dudinka helped and supported us during our field trips.

Search

The Tsakorpus search system is used for the online search. You can search by lemma (root), word form, glosses and grammatical tags. You can combine several parameters or specify a distance between search terms to make an advanced search query. You can also narrow down you search to a subcorpus. For more information, use the ❔ button at the top of the search page.

For offline search, you can download the corpus from the ZFDM Repository. A downloaded corpus can be browsed or searched locally using the EXMARaLDA software or, alternatively, ELAN. Remote search with EXMARaLDA is also possible without downloading all the files (see here).


Other links

Akademie der Wissenschaften in Hamburg Universität Hamburg