Korpus: uzb_newscrawl_2011_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ни эканини 492
ин ўзининг 466
ўқ ҳуқуқ 411
Ўқ ҳуқуқ 411
ўқ ҳуқуқлари 382
Ўқ ҳуқуқлари 382
Ла болалар 323
ла болалар 323
Ha shahar 227
ha shahar 227
Subword Length 2 - Most frequent subwords
Subword Count
in 834
ин 699
ни 350
ni 244
la 111
La 111
Ла 100
ла 100
ig 70
ўқ 47
Amount of words containing repeated subwords of length 2 - per mille
Per mille
30.4845
Subword Length 3 - most frequent words
Subword Word Frequency
tan tantanali 50
Tan tantanali 50
ish erishish 38
Ish erishish 38
ish tanishish 37
Ish tanishish 37
ash yashash 26
тан тантанали 18
Тан тантанали 18
tan tantanalari 13
Subword Length 3 - Most frequent subwords
Subword Count
ish 28
Ish 28
tan 10
Tan 10
ash 8
тан 6
Тан 6
Mar 6
bir 6
Bir 6
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.1548
Subword Length 4 - most frequent words
Subword Word Frequency
Osiy geosiyosiy 6
Shar sharshara 5
mo‘l Mo‘lmo‘l 1
bosh boshboshdoqlikni 1
Bosh boshboshdoqlikni 1
Xuey Xueyxuey 1
gina kup,ozginagina 1
Subword Length 4 - Most frequent subwords
Subword Count
bosh 1
Bosh 1
gina 1
Osiy 1
Shar 1
mo‘l 1
Xuey 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0978
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ўз ўз-ўзидан 25
уз ўз-ўзидан 25
Ўз ўз-ўзидан 25
Уз ўз-ўзидан 25
ўз ўз-ўзини 10
уз ўз-ўзини 10
Ўз ўз-ўзини 10
Уз ўз-ўзини 10
ўз уз-узидан 5
уз уз-узидан 5
Subword Length 2 - Most frequent subwords
Subword Count
ar 3
sh 3
Sh 3
ўз 3
уз 3
Ўз 3
Уз 3
bi 2
ёр 1
al 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1965
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
бир бир-бирига 78
Бир бир-бирига 78
тез тез-тез 53
Тез тез-тез 53
бир бир-бирини 35
Бир бир-бирини 35
bir bir-biriga 30
Bir bir-biriga 30
BIr bir-biriga 30
bir bir-birini 24
Subword Length 3 - Most frequent subwords
Subword Count
bir 17
Bir 17
BIr 17
бир 17
Бир 17
миш 7
o‘z 2
O‘z 2
йўл 2
йул 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.7262
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
mish mish-mishlar 6
жуда жуда-жуда 5
янги янги-янги 5
ming ming-minglab 5
Янги янги-янги 5
Жуда жуда-жуда 5
Ming ming-minglab 5
яхши яхши-яхши 3
juda juda-juda 3
Яхши яхши-яхши 3
Subword Length 4 - Most frequent subwords
Subword Count
mish 5
Azal 3
azal 3
biri 2
яхши 2
ming 2
Яхши 2
Ming 2
Ko‘p 1
chin 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.4566
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
катта катта-катта 25
Катта катта-катта 25
секин секин-секин 21
Секин секин-секин 21
қайта қайта-қайта 15
Қайта қайта-қайта 15
вақти вақти-вақти 11
Вақти вақти-вақти 11
yangi yangi-yangi 10
Yangi yangi-yangi 10
Subword Length 5 - Most frequent subwords
Subword Count
qadim 2
yangi 2
Qadim 2
Yangi 2
битта 2
қанча 2
Битта 2
катта 2
Қанча 2
Катта 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.8881
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
qancha qancha-qancha 7
Qancha qancha-qancha 7
текшир текшир-текшир 5
barcha barcha-barcha 4
Barcha barcha-barcha 4
такрор такрор-такрор 3
Такрор такрор-такрор 3
текшир Текшир-текшир 3
Йиғлаб йиғлаб-йиғлаб 2
поғона поғона-поғона 2
Subword Length 6 - Most frequent subwords
Subword Count
Barcha 2
qancha 2
Qancha 2
текшир 2
barcha 2
boshka 1
Boshka 1
такрор 1
mazlum 1
Такрор 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.7432
1227370 msec needed at 2018-03-30 21:04