INEL Selkup Corpus 2.0

EN | RU | | Documentation |

Word #1

Word:

Lemma:

Grammar:
Gram. gloss:

Language/tier:

Full-text search: Precise match

Welcome! Here is how you can find something:

Type a word or a lemma (dictionary form) in the text box above. Wildcards (*) and even regular expressions are allowed.
Or choose some tags, such as part of speech, in the Grammar box.
Hit Search sentences to find randomly sorted examples of what you are looking for.
Or hit Search words or / lemmata to get a table with words that conform to your query.

There are lots of other options! Click at the top to find out.

Max seconds left: 61

Glossed example copied to clipboard.

Click on to repeat a query.

You have not made any queries yet.

Word	Lemma	Grammar	Subcorpus	Search type	Time

Select parameters

Krasnoyarsk Krai

Tomsk Oblast

Yamalo-Nenets AO

folklore

narrative

conversation

translation

song

Northern

Central

Southern

Baikha

Chaya

Middle Ob

Ket

Narym

Taz

Turuxan

Tym

Upper Tolka

Vasyugan

Upper Ket

Middle Ket

Lower Ket

Upper Taz

Middle Taz

Select documents

(please wait for the document list to load)

Selected subcorpus composition

Parameter: Language/tier:

(please wait for the plot to load)

Value	Size in words	Number of documents

please wait for the plot to load

Word statistics for

Parameter: Query type:

		Query word 1
	value
frequency (ipm)	90% conf. int.

please wait for the plot to load

Distribution by frequency rank

The plot below works as follows. On the x axis, you see frequency ranks, i.e. positions in the full list of all word forms / lemmata of the corpus, ordered by decreasing frequency. If multiple words/lemmata have the same frequency, they get the same rank equal to the average of their positions. For each frequency rank r, the plot shows the proportion of words/lemmata that conform to your query (each word counts only once) among all words with frequency rank less or equal to r on the y axis. The rightmost point, therefore, shows the total proportion of such words/lemmata among all types (different words) in the corpus. In the case of lemmata, all lemmata that have at least one word form conforming to the query are counted.

The subcorpus constraints and all words in the query, except the first one, are not taken into account here.

Scale of the x axis: Max y:

(please wait for the plot to load)