Corpus: slk_newscrawl_2016_100K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
to toto 519
To toto 519
to Toto 406
To Toto 406
en ocenenie 68
po popoludní 65
Po popoludní 65
Li schválili 65
Li zvolili 62
ťa dieťaťa 54
Subword Length 2 - Most frequent subwords
Subword Count
ne 84
84
Li 59
en 45
ov 39
Na 23
23
La 23
la 23
po 23
Amount of words containing repeated subwords of length 2 - per mille
Per mille
5.9401
Subword Length 3 - most frequent words
Subword Word Frequency
%Na znamená 221
%Na zaznamenali 104
%Za zadržali 78
%Na poznamenal 74
%Na Národná 72
%Na zaznamenal 71
%Za zadržala 65
%Na zaznamenala 59
%Na neznamená 57
%Na International 54
Subword Length 3 - Most frequent subwords
Subword Count
%Na 275
%Za 76
bar 9
Bar 9
dno 3
tom 2
Tom 2
nos 2
Nos 2
Bon 1
Amount of words containing repeated subwords of length 3 - per mille
Per mille
4.7413
Subword Length 4 - most frequent words
Subword Word Frequency
pred predpredaji 5
Pred predpredaji 5
foto fotoFoto 3
Foto fotoFoto 3
pred predpredaj 2
Pred predpredaj 2
Subword Length 4 - Most frequent subwords
Subword Count
pred 2
Pred 2
foto 1
Foto 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0595
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Subword Length 6 - most frequent words
Subword Word Frequency
Faeton FAETONFAETON 1
Faeton FaetonFaetona 1
Subword Length 6 - Most frequent subwords
Subword Count
Faeton 2
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.2346
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
no OĽaNO-Nova 15
No OĽaNO-Nova 15
no OĽaNO-NOVA 11
No OĽaNO-NOVA 11
Ch Cechach-chytili 1
ha Ha-Ha-Ha 1
Ha Ha-Ha-Ha 1
Subword Length 2 - Most frequent subwords
Subword Count
no 2
No 2
Ch 1
ha 1
Ha 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0418
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
925654 msec needed at 2018-03-23 10:49