Korpus: tuk_wikipedia_2016_10K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
sa esasan 95
şa esasan 95
Sa esasan 95
Şa esasan 95
yn aýynyň 76
ÿn aýynyň 76
aýynyň 76
yn halkynyň 63
ÿn halkynyň 63
halkynyň 63
Subword Length 2 - Most frequent subwords
Subword Count
913
yn 913
ÿn 913
655
655
655
655
655
in 655
In 655
Amount of words containing repeated subwords of length 2 - per mille
Per mille
55.0239
Subword Length 3 - most frequent words
Subword Word Frequency
deň edenden 2
den edenden 2
deñ edenden 2
nda zyndanda 2
Den edenden 2
Deň edenden 2
ary barýaryn 1
bär Barbarodskiý 1
daň dañdanlar 1
bär Barbarossa 1
Subword Length 3 - Most frequent subwords
Subword Count
Daň 7
daň 7
deñ 4
Den 4
Deň 4
deň 4
den 4
lar 3
ary 3
ýan 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.0320
Subword Length 4 - most frequent words
Subword Word Frequency
lary Balarylary 1
lary balarylary 1
lary balarylaryda 1
lary balarylaryň 1
Subword Length 4 - Most frequent subwords
Subword Count
lary 4
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.1666
Subword Length 5 - most frequent words
Subword Word Frequency
jübüt jübütjübütden 1
ýuwaş ýuwaşýuwaşdan 1
Ýuwaş ýuwaşýuwaşdan 1
Subword Length 5 - Most frequent subwords
Subword Count
Ýuwaş 1
jübüt 1
ýuwaş 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1431
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
öz öz-özünden 4
Öz öz-özünden 4
oz öz-özünden 4
óz öz-özünden 4
őz öz-özünden 4
Oz öz-özünden 4
öž öz-özünden 4
da ýurtda-da 2
da şonda-da 2
de döwletlerde-de 2
Subword Length 2 - Most frequent subwords
Subword Count
de 34
De 34
da 29
Da 29
oz 5
óz 5
őz 5
Oz 5
öž 5
öz 5
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
1.8776
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
kem kem-kemden 14
soň Soň-soňlar 4
soñ Soň-soňlar 4
Soň Soň-soňlar 4
son Soň-soňlar 4
soń Soň-soňlar 4
Son Soň-soňlar 4
Soñ Soň-soňlar 4
Şoň Soň-soňlar 4
bir bir-birine 3
Subword Length 3 - Most frequent subwords
Subword Count
bir 9
Bir 9
Men 2
soñ 2
men 2
Soň 2
Meň 2
son 2
soń 2
Son 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.6071
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
aýry aýry-aýry 14
Aýry aýry-aýry 14
Biri biri-birinden 9
Biri biri-birine 9
biri biri-birinden 9
biri biri-birine 9
Biri biri-birini 7
biri biri-birini 7
Biri biri-biri 5
biri biri-biri 5
Subword Length 4 - Most frequent subwords
Subword Count
biri 8
Biri 8
aýry 4
Aýry 4
Küşt 1
küşt 1
ýeke 1
täze 1
yeke 1
Täze 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.7495
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
dürli dürli-dürli 5
Dürli dürli-dürli 5
durli dürli-dürli 5
dürli dürli-dürli(dokma 1
dürli dürli-dürlidir 1
aňsat aňsat-aňsat 1
Dürli dürli-dürli(dokma 1
Dürli dürli-dürlidir 1
durli dürli-dürli(dokma 1
durli dürli-dürlidir 1
Subword Length 5 - Most frequent subwords
Subword Count
durli 3
ýuwaş 3
Ýuwaş 3
dürli 3
Dürli 3
aňsat 1
ansat 1
Birek 1
hatar 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.6440
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
wagtal wagtal-wagtal 2
topbak topbak-topbak 1
Subword Length 6 - Most frequent subwords
Subword Count
wagtal 1
topbak 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.2936
403596 msec needed at 2018-01-25 06:41