INEL Selkup Corpus 1.0 EN | RU | Show dictionary Show help How to cite the corpus
(Place for results)
Grammar selection body
The help text should be loaded here.
The dictionary text should be loaded here.
Brykina, Maria; Orlova, Svetlana; Wagner-Nagy, Beáta. 2020. INEL Selkup Corpus. Version 1.0. Publication date 2020-06-16. Archived in Hamburger Zentrum für Sprachkorpora. http://hdl.handle.net/11022/0000-0007-CAE5-3. In: Wagner-Nagy, Beáta; Arkhipov, Alexandre; Ferger, Anne; Jettka, Daniel; Lehmberg, Timm (eds.). 2020. The INEL corpora of indigenous Northern Eurasian languages.
To cite a particular sentence, you can use the reference code which appears below the examples (e.g. "NST_1965_Tyrshaqo1_flk.001"). The reference codes include the text name and the number of the sentence (continuous numbering for each speaker). The following number(s) in parentheses can be disregarded.
If you want to share your query with someone, send them the text you see below. The person who loads this query will see the same results in the same order, unless the corpus has been re-indexed.
Here you can load a corpus query someone sent you. Please enter the query below:
The plot below works as follows. On the x axis, you see frequency ranks, i.e. positions in the full list of all word forms / lemmata of the corpus, ordered by decreasing frequency. If multiple words/lemmata have the same frequency, they get the same rank equal to the average of their positions. For each frequency rank r, the plot shows the proportion of words/lemmata that conform to your query (each word counts only once) among all words with frequency rank less or equal to r on the y axis. The rightmost point, therefore, shows the total proportion of such words/lemmata among all types (different words) in the corpus. In the case of lemmata, all lemmata that have at least one word form conforming to the query are counted.
The subcorpus constraints and all words in the query, except the first one, are not taken into account here.Scale of the x axis: