Corpus: slv_newscrawl_2016_300K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
različico 1116
različico 1116
po popolnoma 978
Po popolnoma 978
el želel 703
El želel 703
Em spremembe 692
o. d.o.o. 576
el želeli 514
El želeli 514
Subword Length 2 - Most frequent subwords
Subword Count
aj 84
po 76
Po 76
Li 69
ti 59
Ti 59
Ka 34
ka 34
na 28
Na 28
Amount of words containing repeated subwords of length 2 - per mille
Per mille
7.9691
Subword Length 3 - most frequent words
Subword Word Frequency
pre preprečiti 186
pre preprečili 167
pre preprečevanje 129
pre preprečil 119
Bar Barbara 85
bar Barbara 85
ost prostost 80
dan dandanes 72
Dan dandanes 72
pre preprečila 61
Subword Length 3 - Most frequent subwords
Subword Count
pre 34
Bar 12
bar 12
nje 11
nja 8
ost 6
nos 4
noš 4
Nos 4
dan 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.0320
Subword Length 4 - most frequent words
Subword Word Frequency
Remo premoremo 8
Subword Length 4 - Most frequent subwords
Subword Count
Remo 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0188
Subword Length 5 - most frequent words
Subword Word Frequency
Rebel rebelrebel 5
Subword Length 5 - Most frequent subwords
Subword Count
Rebel 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0409
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ko diplomatsko-konzularnih 9
Ko diplomatsko-konzularnih 9
No božično-novoletni 5
no božično-novoletni 5
No božično-novoletne 4
No božično-novoletnih 4
No božično-novoletnimi 4
no božično-novoletne 4
no božično-novoletnih 4
no božično-novoletnimi 4
Subword Length 2 - Most frequent subwords
Subword Count
No 4
no 4
Ko 1
ga 1
Ga 1
ko 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0623
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1007611 msec needed at 2018-03-24 02:55