Corpus: tuk-tm_web_2019

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ba babatda 4340
Ba babatda 4340
babatda 4340
babatda 4340
un Munuň 3582
Un Munuň 3582
un Şunuň 3495
Un Şunuň 3495
arkalaşyklarynyň 3339
welaýatynyň 2803
Subword Length 2 - Most frequent subwords
Subword Count
2668
1881
1881
in 1881
In 1881
302
an 302
An 302
ny 209
Li 187
Amount of words containing repeated subwords of length 2 - per mille
Per mille
68.5096
Subword Length 3 - most frequent words
Subword Word Frequency
lyk başlyklyk 378
mer mermerli 296
ary barýarys 113
Ary barýarys 113
ýaň daýanýan 96
ýan daýanýan 96
Ýan daýanýan 96
ýän daýanýan 96
Ýaň daýanýan 96
dan meýdandan 72
Subword Length 3 - Most frequent subwords
Subword Count
Dan 15
Daň 15
dan 15
daň 15
lar 11
mer 9
ýaň 5
ýan 5
Ýan 5
ýän 5
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.7799
Subword Length 4 - most frequent words
Subword Word Frequency
syna synasyna 4
lary balarylaryň 4
Syna synasyna 4
lary balarylaryny 3
lary Balarylary 2
täze täzetäze 2
Täze täzetäze 2
ýaşy assosiýasyýasynyň 2
ýasy assosiýasyýasynyň 2
taze täzetäze 2
Subword Length 4 - Most frequent subwords
Subword Count
lary 4
syna 1
Syna 1
ýaşy 1
ýasy 1
Ýaşy 1
beri 1
täze 1
Täze 1
taze 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.1369
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
öz öz-özüňi 150
Öz öz-özüňi 150
oz öz-özüňi 150
Oz öz-özüňi 150
de geljekde-de 72
De geljekde-de 72
da GDA-da 46
Da GDA-da 46
da barada-da 44
Da barada-da 44
Subword Length 2 - Most frequent subwords
Subword Count
da 96
Da 96
de 71
De 71
öz 11
Öz 11
oz 11
Oz 11
4
4
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
2.0707
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
gün gün-günden 130
Gün gün-günden 130
gun gün-günden 130
Gun gün-günden 130
ýyl ýyl-ýyldan 121
Ýyl ýyl-ýyldan 121
kem kem-kemden 46
uly uly-uly 34
Uly uly-uly 34
bir bir-birine 32
Subword Length 3 - Most frequent subwords
Subword Count
bir 20
Bir 20
gül 6
Gül 6
gul 6
kem 3
gün 2
Gün 2
gun 2
Gun 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.5318
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
aýry aýry-aýry 308
ýygy ýygy-ýygydan 139
biri biri-birine 93
Biri biri-birine 93
täze täze-täze 92
Täze täze-täze 92
taze täze-täze 92
biri biri-birinden 79
Biri biri-birinden 79
biri biri-biri 51
Subword Length 4 - Most frequent subwords
Subword Count
biri 12
Biri 12
aýry 3
ýygy 3
ýeke 2
Ýeke 2
aram 2
Aram 2
täze 2
Täze 2
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.6085
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
ýylyň ýylyň-ýylyna 103
Ýylyň ýylyň-ýylyna 103
ýylyn ýylyň-ýylyna 103
Yylyň ýylyň-ýylyna 103
yylyn ýylyň-ýylyna 103
ýylyñ ýylyň-ýylyna 103
birek birek-birek 40
Birek birek-birek 40
dürli dürli-dürli 38
Dürli dürli-dürli 38
Subword Length 5 - Most frequent subwords
Subword Count
Ýuwaş 3
ýuwaş 3
birek 3
Birek 3
dürli 3
Dürli 3
durli 3
ýylyň 2
Ýylyň 2
ýylyn 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.7668
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
belent belent-belent 8
Belent belent-belent 8
menzil menzil-menzil 4
Üýşmek üýşmek-üýşmek 4
Sarahs Sarahs-Sarahs 2
sarahs Sarahs-Sarahs 2
belent Belent-belent 1
Belent Belent-belent 1
Subword Length 6 - Most frequent subwords
Subword Count
belent 2
Belent 2
menzil 1
Üýşmek 1
Sarahs 1
sarahs 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.2075
1102412 msec needed at 2023-11-29 13:18