Korpus: vls_wikipedia_2014_30K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
gegeevn 57
Ge gegeevn 57
ge gegeevn 57
Pepeyn 51
én dienen 35
ėn dienen 35
En dienen 35
en dienen 35
ên dienen 35
èn dienen 35
Subword Length 2 - Most frequent subwords
Subword Count
én 64
Èn 64
ėn 64
en 64
En 64
èn 64
ên 64
Ge 31
ge 31
31
Amount of words containing repeated subwords of length 2 - per mille
Per mille
4.8914
Subword Length 3 - most frequent words
Subword Word Frequency
Els sterrenstelsels 7
Els stelsels 6
bar Barbara 6
bar Barbarossa 6
Els vertelsels 4
oek Koekoeksnest 3
Oek Koekoeksnest 3
koe Koekoeksnest 3
Koe Koekoeksnest 3
Els sterrestelsels 2
Subword Length 3 - Most frequent subwords
Subword Count
Ièn 5
Els 5
ièn 5
bar 4
tis 2
Tis 2
oek 2
Oek 2
koe 2
end 2
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.6319
Subword Length 4 - most frequent words
Subword Word Frequency
stel assenstelstel 1
Subword Length 4 - Most frequent subwords
Subword Count
stel 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0261
Subword Length 5 - most frequent words
Subword Word Frequency
koter koterKoters 1
Subword Length 5 - Most frequent subwords
Subword Count
koter 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0429
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Ge Brugge-Gent 3
ge Brugge-Gent 3
Brugge-Gent 3
Co AGCO-Cooperation 1
ne Tiene-neegne 1
Ne Tiene-neegne 1
ch Islamitisch-Chinese 1
Tiene-neegne 1
Ge d'Hillige-Gêeststroate 1
ge d'Hillige-Gêeststroate 1
Subword Length 2 - Most frequent subwords
Subword Count
Ge 2
ge 2
2
Co 2
no 1
1
No 1
ch 1
ne 1
Ne 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1073
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Bio Bío-Bío 1
cel cel-cel 1
Chi chi-chi 1
chi chi-chi 1
rap rap-rap 1
Rap rap-rap 1
Yde hyde-yde 1
Subword Length 3 - Most frequent subwords
Subword Count
Bio 1
cel 1
Chi 1
chi 1
Yde 1
rap 1
Rap 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0929
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
tuut tuut-tuut 1
Subword Length 4 - Most frequent subwords
Subword Count
tuut 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0261
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1333553 msec needed at 2018-01-28 10:56