Corpus: hbs_wikipedia_2010

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Oj kojoj 1974
Oj svojoj 1213
ju zahvaljujući 599
Ju zahvaljujući 599
Ti koristiti 545
ti koristiti 545
ju Zahvaljujući 277
Ju Zahvaljujući 277
ju pojavljuju 273
Ju pojavljuju 273
Subword Length 2 - Most frequent subwords
Subword Count
Ti 97
ti 97
ju 56
Ju 56
Li 34
34
li 34
Ma 28
ma 28
ta 23
Amount of words containing repeated subwords of length 2 - per mille
Per mille
6.6052
Subword Length 3 - most frequent words
Subword Word Frequency
nje objašnjenje 119
nje ujedinjenje 83
nje smanjenje 69
pre prepreka 54
Pre prepreka 54
jen zamijenjen 44
jen namijenjen 42
nje punjenje 39
Čin Cincinnati 34
Ćin Cincinnati 34
Subword Length 3 - Most frequent subwords
Subword Count
jen 47
nje 24
bar 18
Bar 18
čin 7
Čin 7
Ćin 7
pre 4
Pre 4
Ant 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.4059
Subword Length 4 - most frequent words
Subword Word Frequency
čija asocijacija 35
čija glacijacija 16
čija inicijacija 6
čija Asocijacija 5
čija diferencijacija 5
čija disocijacija 5
čija Inicijacija 3
Subword Length 4 - Most frequent subwords
Subword Count
čija 7
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.1385
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
sh Shamash-shum-ukin 4
Il Ašur-etil-ilani 3
il Ašur-etil-ilani 3
Subword Length 2 - Most frequent subwords
Subword Count
sh 1
Il 1
il 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0208
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
dan dan-danas 85
Dan dan-danas 85
Subword Length 3 - Most frequent subwords
Subword Count
dan 1
Dan 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0127
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
dipol dipol-dipol 6
dipol Dipol-dipol 3
Subword Length 5 - Most frequent subwords
Subword Count
dipol 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0871
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
2040712 msec needed at 2017-12-21 05:34