Korpus: ron_news_1998-2007_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
at atat 1144
at aratat 639
at jumatate 436
tu tuturor 353
Tu tuturor 353
ti Justitiei 298
to Totodata 280
To Totodata 280
Se fusese 251
se fusese 251
Subword Length 2 - Most frequent subwords
Subword Count
ta 311
Ta 311
lu 219
at 211
le 195
Le 195
ti 138
Ze 87
ar 79
Ar 79
Amount of words containing repeated subwords of length 2 - per mille
Per mille
19.7049
Subword Length 3 - most frequent words
Subword Word Frequency
uni Uniunii 396
rea crearea 141
Rea crearea 141
uni reuniunii 38
Est povesteste 35
est povesteste 35
uni reuniuni 32
car descarcari 20
Tan instantaneu 11
bar Barbara 8
Subword Length 3 - Most frequent subwords
Subword Count
bar 11
Bar 11
uni 9
car 9
Est 7
est 7
Tan 4
rea 3
Rea 3
mar 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.8625
Subword Length 4 - most frequent words
Subword Word Frequency
Yang Yangyang 1
mata Matamata-Piako 1
Mata Matamata-Piako 1
Tori indatoritori 1
Subword Length 4 - Most frequent subwords
Subword Count
mata 1
Mata 1
Yang 1
Tori 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0539
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
si si-si 10
Si si-si 10
sI si-si 10
te trezeste-te 4
Te trezeste-te 4
Ro maghiaro-romana 1
Na na-na 1
na na-na 1
cu Dumitrescu-Cunctator 1
al Al-Ali 1
Subword Length 2 - Most frequent subwords
Subword Count
te 5
Te 5
Ro 3
Na 2
na 2
ar 2
Ar 2
si 1
cu 1
Si 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1873
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
alb alb-albastru 4
Alb alb-albastru 4
nou nou-nouta 3
Nou nou-nouta 3
alb alb-albastre 2
Alb alb-albastre 2
Rom Gazprom-Romgaz 1
rom Gazprom-Romgaz 1
aer aer-aer 1
alb alb-albastrii 1
Subword Length 3 - Most frequent subwords
Subword Count
alb 5
Alb 5
nou 1
Nou 1
Rom 1
rom 1
aer 1
Aer 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0986
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
doar doar-doar 3
Doar doar-doar 3
colo colo-colo 2
ceva ceva-ceva 1
usor Usor-usor 1
Ceva ceva-ceva 1
Usor Usor-usor 1
gata gata-gata 1
fete fete-fete 1
Gata gata-gata 1
Subword Length 4 - Most frequent subwords
Subword Count
doar 1
Doar 1
colo 1
usor 1
Usor 1
ceva 1
Ceva 1
fete 1
Fete 1
gata 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.1079
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
incet incet-incet 2
Incet incet-incet 2
incet Incet-incet 1
Incet Incet-incet 1
Subword Length 5 - Most frequent subwords
Subword Count
incet 2
Incet 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0689
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
922109 msec needed at 2019-11-27 08:20