Statistiken nach Metadatenwerten
Veröffentlichte INEL-Korpora
Hier finden Sie Statistiken nach Metadaten für alle INEL-Korpora. Für jedes Korpus werden die relevanten Metadatenparameter in separaten Tabellen aufgeführt. Für jeden Metadatenwert zeigt die Tabelle an, wie viele Wörter und Sätze das entsprechende Subkorpus enthält.
Tavda Mansi
Wörter: 11879. Sätze: 2042.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Chandyri | 1831 | 15.4% | 305 | 14.9% |
Kuzyayeva | 3446 | 29.0% | 478 | 23.4% |
Shaytanskaya | 2599 | 21.8% | 622 | 30.4% |
Yanychkovo | 4003 | 33.6% | 637 | 31.1% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 68 | 0.5% | 19 | 0.9% |
flk | 11545 | 97.1% | 1977 | 96.8% |
nar | 89 | 0.7% | 9 | 0.4% |
sng | 177 | 1.4% | 37 | 1.8% |
Selkupisch
Wörter: 81498. Sätze: 14509.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Central | 3105 | 3.8% | 608 | 4.1% |
Northern | 26093 | 32.0% | 4372 | 30.1% |
Southern | 52300 | 64.1% | 9529 | 65.6% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Baikha | 1550 | 1.9% | 283 | 1.9% |
Chaya | 17225 | 21.1% | 3614 | 24.9% |
Chaya/Middle Ob | 889 | 1.0% | 178 | 1.2% |
Ket | 16259 | 19.9% | 2959 | 20.3% |
Middle Ob | 17927 | 21.9% | 2778 | 19.1% |
Narym | 358 | 0.4% | 74 | 0.5% |
Narym/Tym | 2404 | 2.9% | 464 | 3.1% |
Taz | 21197 | 26.0% | 3444 | 23.7% |
Tym | 343 | 0.4% | 70 | 0.4% |
Upper Tolka | 3346 | 4.1% | 645 | 4.4% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
... | 48261 | 59.2% | 8789 | 60.5% |
Lower Ket/Middle Ket | 5582 | 6.8% | 966 | 6.6% |
Middle Ket | 6235 | 7.6% | 1098 | 7.5% |
Middle Taz | 12356 | 15.1% | 2014 | 13.8% |
Upper Ket | 4038 | 4.9% | 808 | 5.5% |
Upper Taz/Middle Taz | 5026 | 6.1% | 834 | 5.7% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 849 | 1.0% | 182 | 1.2% |
flk | 54601 | 66.9% | 9314 | 64.1% |
nar | 17168 | 21.0% | 3562 | 24.5% |
song | 391 | 0.4% | 70 | 0.4% |
transl | 8489 | 10.4% | 1381 | 9.5% |
Kamassisch
Wörter: 63810. Sätze: 13786.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
flk | 15174 | 23.7% | 3394 | 24.6% |
misc | 47939 | 75.1% | 10222 | 74.1% |
nar | 649 | 1.0% | 159 | 1.1% |
song | 48 | 0.0% | 11 | 0.0% |
Nganasanisch
Wörter: 221747. Sätze: 34872.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 10306 | 4.6% | 2194 | 6.2% |
flk | 45154 | 20.3% | 6462 | 18.5% |
flkd | 73428 | 33.1% | 11270 | 32.3% |
flks | 55319 | 24.9% | 8969 | 25.7% |
nar | 36065 | 16.2% | 5651 | 16.2% |
song | 985 | 0.4% | 249 | 0.7% |
transl | 490 | 0.2% | 77 | 0.2% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Avam | 217678 | 98.1% | 34248 | 98.2% |
Vadeyev | 4069 | 1.8% | 624 | 1.7% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
... | 10602 | 4.7% | 1930 | 5.5% |
AVM | 187 | 0.0% | 32 | 0.0% |
Ust`-Avam | 127155 | 57.3% | 19303 | 55.3% |
Volochanka | 83803 | 37.7% | 13607 | 39.0% |
Enzisch
Wörter: 218710. Sätze: 54133.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Forest | 164900 | 75.3% | 39173 | 72.3% |
Forest (with marginal Tundra influence) | 4963 | 2.2% | 1308 | 2.4% |
Forest with Tundra influence | 3433 | 1.5% | 900 | 1.6% |
Forest with Tundra influence (maybe TE recording transcribed by a FE speaker) | 83 | 0.0% | 15 | 0.0% |
Tundra | 32619 | 14.9% | 9363 | 17.2% |
Tundra; Forest | 12712 | 5.8% | 3374 | 6.2% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 35399 | 16.1% | 9671 | 17.8% |
conv (pseudo-conversation) | 80 | 0.0% | 23 | 0.0% |
conv; nar | 2648 | 1.2% | 694 | 1.2% |
flk | 66413 | 30.3% | 14283 | 26.3% |
flk/nar | 1065 | 0.4% | 253 | 0.4% |
flk/song | 1384 | 0.6% | 442 | 0.8% |
flk; nar | 2627 | 1.2% | 520 | 0.9% |
nar | 104601 | 47.8% | 26975 | 49.8% |
nar/flk | 454 | 0.2% | 131 | 0.2% |
nar; conv | 1582 | 0.7% | 414 | 0.7% |
song | 2324 | 1.0% | 705 | 1.3% |
transl | 133 | 0.0% | 22 | 0.0% |
Nenzisch
Wörter: 61278. Sätze: 10254.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Forest | 23597 | 38.5% | 3709 | 36.1% |
Tundra | 37681 | 61.4% | 6545 | 63.8% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Central | 3358 | 5.4% | 615 | 5.9% |
Eastern | 32993 | 53.8% | 5681 | 55.4% |
Western | 1330 | 2.1% | 249 | 2.4% |
not relevant | 23597 | 38.5% | 3709 | 36.1% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Agan | 5889 | 9.6% | 891 | 8.6% |
Bolshaya Zemlya | 3358 | 5.4% | 615 | 5.9% |
Kanin | 571 | 0.9% | 96 | 0.9% |
Malaya Zemlya | 759 | 1.2% | 153 | 1.4% |
Nadym | 2001 | 3.2% | 261 | 2.5% |
Numto | 6518 | 10.6% | 940 | 9.1% |
Pur | 11190 | 18.2% | 1878 | 18.3% |
Taimyr | 28257 | 46.1% | 4896 | 47.7% |
Yamal | 2735 | 4.4% | 524 | 5.1% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
el | 24 | 0.0% | 6 | 0.0% |
flk | 49449 | 80.6% | 8331 | 81.2% |
nar | 11759 | 19.1% | 1909 | 18.6% |
sng | 46 | 0.0% | 8 | 0.0% |
Dolganisch
Wörter: 97757. Sätze: 14078.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
... | 1051 | 1.0% | 200 | 1.4% |
..., Lower | 508 | 0.5% | 55 | 0.3% |
Anabar | 1495 | 1.5% | 168 | 1.1% |
Lower | 25580 | 26.1% | 3749 | 26.6% |
Lower (?) | 2468 | 2.5% | 362 | 2.5% |
Lower, Upper | 1108 | 1.1% | 155 | 1.1% |
Upper | 47042 | 48.1% | 6631 | 47.1% |
Upper (?) | 10109 | 10.3% | 1556 | 11.0% |
Upper, Lower | 8396 | 8.5% | 1202 | 8.5% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 21193 | 21.6% | 3221 | 22.8% |
flk | 31787 | 32.5% | 4906 | 34.8% |
misc | 113 | 0.1% | 13 | 0.0% |
nar | 43812 | 44.8% | 5825 | 41.3% |
sng | 99 | 0.1% | 24 | 0.1% |
transl | 753 | 0.7% | 89 | 0.6% |
Ewenkisch
Wörter: 93264. Sätze: 19931.
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Northern | 34931 | 37.4% | 7091 | 35.5% |
Southern | 234 | 0.2% | 44 | 0.2% |
Southern (s) | 2425 | 2.6% | 401 | 2.0% |
Southern (sh) | 55674 | 59.6% | 12395 | 62.1% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
Barhahan | 30061 | 32.2% | 6495 | 32.5% |
Ilimpi | 5153 | 5.5% | 1061 | 5.3% |
Khantayskoye Ozero | 13000 | 13.9% | 2612 | 13.1% |
Nepa (?) | 234 | 0.2% | 44 | 0.2% |
Stony Tunguska | 2425 | 2.6% | 401 | 2.0% |
Sym | 25613 | 27.4% | 5900 | 29.6% |
Taimyr | 12688 | 13.6% | 2664 | 13.3% |
Yerbogachyon | 4090 | 4.3% | 754 | 3.7% |
Wert | Wörter | Wörter (Anteil) | Sätze | Sätze (Anteil) |
---|---|---|---|---|
conv | 215 | 0.2% | 67 | 0.3% |
flk | 43832 | 46.9% | 8890 | 44.6% |
misc | 1953 | 2.0% | 639 | 3.2% |
nar | 46462 | 49.8% | 10055 | 50.4% |
song | 802 | 0.8% | 280 | 1.4% |