Corpus: sna-zw_web_2016

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ya nyaya 2601
Ba baba 906
ba baba 906
ya Nyaya 645
ya nenyaya 398
ya munyaya 393
mu mumutambo 354
Ba Baba 351
ba Baba 351
at vatatu 332
Subword Length 2 - Most frequent subwords
Subword Count
ir 1028
an 490
ra 399
Ra 399
ku 355
Ku 355
or 329
is 300
na 255
ka 233
Amount of words containing repeated subwords of length 2 - per mille
Per mille
47.5547
Subword Length 3 - most frequent words
Subword Word Frequency
Nya kunyanya 407
nya kunyanya 407
dzi vadzidzi 361
Nya zvakanyanya 321
nya zvakanyanya 321
any zvakanyanya 321
Sha shasha 200
sha shasha 200
dzi vadzidzisi 181
Sha Shasha 126
Subword Length 3 - Most frequent subwords
Subword Count
dzi 256
nya 84
Nya 84
any 81
che 53
gwa 32
Sha 13
kwe 13
sha 13
bha 11
Amount of words containing repeated subwords of length 3 - per mille
Per mille
6.5952
Subword Length 4 - most frequent words
Subword Word Frequency
kuna makunakuna 28
Kuna makunakuna 28
mini maminimini 12
Mini maminimini 12
pata chipatapata 6
Tepa pamutepatepa 5
Kuna Makunakuna 4
kuna Makunakuna 4
meso mesomeso 3
Kuna semakunakuna 3
Subword Length 4 - Most frequent subwords
Subword Count
kuna 8
Kuna 8
kare 5
Kare 5
diki 4
bata 3
Bata 3
Tepa 2
kkkk 2
mira 2
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.6914
Subword Length 5 - most frequent words
Subword Word Frequency
nyoro zvinyoronyoro 7
Nyoro zvinyoronyoro 7
mhere mheremhere 3
Mhere mheremhere 3
shoma zvishomashoma 3
Shoma zvishomashoma 3
Shava murushavashava 2
shava murushavashava 2
mviro mviromviro 2
shena akashenashena 1
Subword Length 5 - Most frequent subwords
Subword Count
mhere 2
Mhere 2
Shoma 1
zvino 1
Shava 1
Zvino 1
shava 1
manga 1
mviro 1
batwa 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.3706
Subword Length 6 - most frequent words
Subword Word Frequency
siyana dzakasiyanasiyana 1
siyana hwakasiyanasiyana 1
Hwande chihwandehwande 1
mhanya achimhanyamhanya 1
Siyana dzakasiyanasiyana 1
Siyana hwakasiyanasiyana 1
Kikiki kikikikikikikiki 1
hwande chihwandehwande 1
Subword Length 6 - Most frequent subwords
Subword Count
siyana 2
Siyana 2
Kikiki 1
mhanya 1
Hwande 1
hwande 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.2408
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
we Sarawe-we 1
We Sarawe-we 1
ra inongorara-rara 1
Ra inongorara-rara 1
Subword Length 2 - Most frequent subwords
Subword Count
we 1
We 1
ra 1
Ra 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0205
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
oga oga-oga 8
ega ega-ega 6
ose ose-ose 3
Ose ose-ose 3
boi boi-boi 2
iyo chaiyo-iyo 1
Iyo chaiyo-iyo 1
Subword Length 3 - Most frequent subwords
Subword Count
iyo 1
Iyo 1
oga 1
ega 1
ose 1
Ose 1
boi 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0564
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Rega rega-rega 76
rega rega-rega 76
Pose pose-pose 42
roga roga-roga 42
pose pose-pose 42
yoga yoga-yoga 39
woga woga-woga 37
wega wega-wega 30
Wega wega-wega 30
Pese pese-pese 26
Subword Length 4 - Most frequent subwords
Subword Count
bata 20
Bata 20
soro 13
diki 7
kare 6
Kare 6
mira 4
Mira 4
Baya 3
kiya 3
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
1.5125
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
zvino rwechizvino-zvino 37
Zvino rwechizvino-zvino 37
chema kuchema-chema 25
zvino yechizvino-zvino 17
Zvino yechizvino-zvino 17
mhere mhere-mhere 11
Mhere mhere-mhere 11
shoma mashoma-shoma 9
zvino dzechizvino-zvino 9
Zvino dzechizvino-zvino 9
Subword Length 5 - Most frequent subwords
Subword Count
chema 21
tsika 13
Tsika 13
Famba 7
famba 7
shoma 6
Shoma 6
zvino 6
Zvino 6
tamba 5
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
3.2198
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
Siyana dzakasiyana-siyana 49
siyana dzakasiyana-siyana 49
Siyana akasiyana-siyana 45
Siyana zvakasiyana-siyana 45
siyana akasiyana-siyana 45
siyana zvakasiyana-siyana 45
Siyana yakasiyana-siyana 22
siyana yakasiyana-siyana 22
mhanya kumhanya-mhanya 17
mukuru mukuru-mukuru 7
Subword Length 6 - Most frequent subwords
Subword Count
siyana 14
Siyana 14
mhanya 7
Hwande 2
hwande 2
Chekwa 2
cheuka 2
Shinga 2
Chairo 1
gwinha 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
2.1188
1684061 msec needed at 2017-10-27 05:23