Korpus: afr_newscrawl_2013_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ge gegee 476
po Limpopo 289
si posisie 140
Ba baba 82
Is krisis 81
is krisis 81
ís krisis 81
Se oorsese 59
oorsese 59
oorsese 59
Subword Length 2 - Most frequent subwords
Subword Count
Ba 63
si 48
in 46
In 46
ín 46
Ed 38
is 35
Is 35
ís 35
ge 32
Amount of words containing repeated subwords of length 2 - per mille
Per mille
6.0902
Subword Length 3 - most frequent words
Subword Word Frequency
Els stelsels 60
Eis groeiseisoen 38
eis groeiseisoen 38
vêr verversings 14
Ver verversings 14
vér verversings 14
ver verversings 14
bar Barbara 7
Son sonsondergang 7
Eis broeiseisoen 7
Subword Length 3 - Most frequent subwords
Subword Count
Els 58
eis 8
Eis 8
en- 6
En- 6
bar 5
Bar 5
kom 4
Kom 4
son 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.5557
Subword Length 4 - most frequent words
Subword Word Frequency
Koës koeskoes 8
koes koeskoes 8
klop Klopkloppie 4
Klop Klopkloppie 4
belê Mabelebele 1
klop Landeryklopkloppie 1
Klop Landeryklopkloppie 1
bele Mabelebele 1
Bele Mabelebele 1
klop Woestynklopkloppie 1
Subword Length 4 - Most frequent subwords
Subword Count
klop 3
Klop 3
ring 1
Ring 1
Koës 1
koes 1
belê 1
bele 1
Bele 1
haha 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.1123
Subword Length 5 - most frequent words
Subword Word Frequency
Bela- Bela-Bela-landdroshof 1
Bela- Bela-Bela-skougronde 1
skaap landskaapskaap 1
Skaap landskaapskaap 1
Subword Length 5 - Most frequent subwords
Subword Count
Bela- 2
skaap 1
Skaap 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0695
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Ai Ai-Ais 3
ai Ai-Ais 3
ei groei-eienskappe 3
ou ou-ou 2
ha Ha-ha 2
Ou ou-ou 2
Ha Ha-ha 2
en Veteraan-trekker-en-Enjinklubs 1
Én Een-en-twintig 1
én een-en’n-halwe 1
Subword Length 2 - Most frequent subwords
Subword Count
en 9
En 9
én 9
Én 9
te 3
Te 3
3
3
la 2
Ra 2
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.4410
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
net net-net 24
Net net-net 24
nét net-net 24
wil wil-wil 5
Wil wil-wil 5
wíl wil-wil 5
Wíl wil-wil 5
gou gou-gou 4
Gou gou-gou 4
Toi toi-toi 4
Subword Length 3 - Most frequent subwords
Subword Count
ont 2
lek 1
Een 1
Lug 1
Gou 1
Lek 1
één 1
Org 1
Toi 1
Jan 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.2963
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Bela Bela-Bela 46
bela Bela-Bela 46
plek plek-plek 14
Plek plek-plek 14
kort kort-kort 5
Kort kort-kort 5
plek Plek-plek 3
stil stil-stil 3
Plek Plek-plek 3
Stil stil-stil 3
Subword Length 4 - Most frequent subwords
Subword Count
Bela 4
bela 4
plek 2
Plek 2
hoes 1
Koës 1
loer 1
koes 1
Loer 1
mens 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.2567
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
speel speel-speel 3
Speel speel-speel 3
vroeg vroeg-vroeg 2
Vroeg vroeg-vroeg 2
Felix Felix-Felix 1
speel Speel-speel 1
amper amper-amper 1
Noord Noord-Noord 1
noord Noord-Noord 1
Amper amper-amper 1
Subword Length 5 - Most frequent subwords
Subword Count
speel 2
Speel 2
Amper 1
groot 1
Groot 1
gróót 1
gróot 1
vroeg 1
klein 1
Vroeg 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.1854
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
sukkel sukkel-sukkel 2
Sukkel sukkel-sukkel 2
meisie Meisie-Meisie 1
Meisie Meisie-Meisie 1
maklik maklik-maklik 1
Junior junior-junior 1
junior junior-junior 1
Maklik maklik-maklik 1
Subword Length 6 - Most frequent subwords
Subword Count
sukkel 1
Sukkel 1
meisie 1
Meisie 1
Junior 1
junior 1
maklik 1
Maklik 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.1432
1381257 msec needed at 2018-01-30 19:46