Korpus: ekk_wikipedia_2018_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ta kasutatakse 1333
Ta kasutatakse 1333
ta nimetatakse 692
Ta nimetatakse 692
le sellele 376
Le sellele 376
al alal 297
Al alal 297
Āl alal 297
le millele 192
Subword Length 2 - Most frequent subwords
Subword Count
ta 198
Ta 198
al 84
Al 84
Āl 84
da 74
Da 74
Se 73
id 72
es 42
Amount of words containing repeated subwords of length 2 - per mille
Per mille
12.7930
Subword Length 3 - most frequent words
Subword Word Frequency
est esimestest 21
Bar Barbara 19
bar Barbara 19
Bär Barbara 19
ust tunnustust 17
Bar Barbarossa 15
bar Barbarossa 15
ust asustustihedus 15
Bär Barbarossa 15
est inimestest 13
Subword Length 3 - Most frequent subwords
Subword Count
ust 30
est 12
Bar 11
bar 11
Bär 11
ist 4
and 3
And 3
Ant 2
Ate 2
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.9932
Subword Length 4 - most frequent words
Subword Word Frequency
vana vanavanaisa 10
Vana vanavanaisa 10
isse Jenisseisse 3
poja pojapoja 3
sõna sõnasõnaline 2
vana vanavanaema 2
poja pojapojapoeg 2
Sõna sõnasõnaline 2
Vana vanavanaema 2
Šona sõnasõnaline 2
Subword Length 4 - Most frequent subwords
Subword Count
vana 2
Vana 2
poja 2
isse 1
sõna 1
Sõna 1
Šona 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.1045
Subword Length 5 - most frequent words
Subword Word Frequency
lapse lapselapsed 3
lapse lapselapselaps 3
Lapse lapselapsed 3
Lapse lapselapselaps 3
lapse lapselapse 2
Lapse lapselapse 2
punkt 0-punktpunkt 1
Subword Length 5 - Most frequent subwords
Subword Count
lapse 3
Lapse 3
punkt 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1235
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
vi Kivi-Vigala 9
ja Põhja-Jäämere 4
Ja Põhja-Jäämere 4
ee see-eest 4
Li Iisraeli-Liibanoni 3
ee See-eest 3
ja Põhja-Jäämeres 2
ka kreeka-katoliku 2
Ka kreeka-katoliku 2
Ja Põhja-Jäämeres 2
Subword Length 2 - Most frequent subwords
Subword Count
Ni 2
ja 2
ni 2
Ja 2
ee 2
na 2
Na 2
vi 1
sh 1
No 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1564
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Sal Sal-Saller 3
õhk õhk-õhk 3
Õhk õhk-õhk 3
Subword Length 3 - Most frequent subwords
Subword Count
Sal 1
õhk 1
Õhk 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0251
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
sõna sõna-sõnalt 9
Sõna sõna-sõnalt 9
Šona sõna-sõnalt 9
vana vana-vanaisa 4
Vana vana-vanaisa 4
Boom Boom-Boomiga 1
Subword Length 4 - Most frequent subwords
Subword Count
sõna 1
Sõna 1
Šona 1
vana 1
Vana 1
Boom 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0523
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Baden Baden-Baden 3
Baden Baden-Badeni 3
Baden Baden-Badenile 1
Baden Baden-Badenit 1
Subword Length 5 - Most frequent subwords
Subword Count
Baden 4
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.1235
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1061295 msec needed at 2024-02-09 01:21