Korpus: bik_wikipedia_2016

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
in ining 1265
In ining 1265
ba Nababanga 535
Ab Nababanga 535
At natatadang 363
at natatadang 363
ta natatadang 363
Ta natatadang 363
ta tataramon 309
Ta tataramon 309
Subword Length 2 - Most frequent subwords
Subword Count
ka 307
Ka 307
pa 210
Pa 210
an 193
An 193
ba 188
ta 149
Ta 149
La 111
Amount of words containing repeated subwords of length 2 - per mille
Per mille
60.2532
Subword Length 3 - most frequent words
Subword Word Frequency
Bay baybayon 30
bay baybayon 30
Nga nagngangaran 21
Gen nangengenot 21
nga nagngangaran 21
Ang pangangaipo 19
Nga pangangaipo 19
nga pangangaipo 19
ang pangangaipo 19
Ang Tangang 16
Subword Length 3 - Most frequent subwords
Subword Count
nga 52
Nga 52
ang 48
Ang 48
gan 13
Pro 11
pro 11
Bay 8
bay 8
Gen 5
Amount of words containing repeated subwords of length 3 - per mille
Per mille
4.5622
Subword Length 4 - most frequent words
Subword Word Frequency
Laen manlaenlaen 39
laen manlaenlaen 39
labi labilabing 9
Labi labilabi 9
Labi labilabing 9
Labí labilabi 9
Labí labilabing 9
labi labilabi 9
taón taontaon 7
taon taontaon 7
Subword Length 4 - Most frequent subwords
Subword Count
Labí 3
alag 3
pang 3
labi 3
Labi 3
logo 2
Logo 2
Laen 2
laen 2
dali 2
Amount of words containing repeated subwords of length 4 - per mille
Per mille
1.8514
Subword Length 5 - most frequent words
Subword Word Frequency
sunod sunodsunod 6
Sunod sunodsunod 6
tolos tolostolos 3
Tolos tolostolos 3
sadít saditsadit 2
Dakul kadakuldakul 2
tolos Tolostolos 2
kurit kuritkurit 2
pulot mapulotpulot 2
liwat liwatliwat 2
Subword Length 5 - Most frequent subwords
Subword Count
balik 3
Balik 3
tapos 2
Tapos 2
surat 2
lakaw 2
Surat 2
hanap 2
dalan 2
buhay 2
Amount of words containing repeated subwords of length 5 - per mille
Per mille
4.7220
Subword Length 6 - most frequent words
Subword Word Frequency
barong Barongbarong 3
kulang pagkulangkulang 1
barong barongbarong 1
Kulang pagkulangkulang 1
Subword Length 6 - Most frequent subwords
Subword Count
barong 2
kulang 1
Kulang 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.6806
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ag mag-agom 13
ag nag-agi 3
ag nag-agom 3
ag pag-agom 3
on inon-on 2
On inon-on 2
ag mag-agad 2
ag Nag-agi 1
ag Pag-agi 1
ag Pag-agom 1
Subword Length 2 - Most frequent subwords
Subword Count
ag 13
ka 4
Ka 4
pa 3
Pa 3
se 2
Sa 1
si 1
de 1
Si 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.7170
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Iba iba-ibang 86
ibá iba-ibang 86
ibà iba-ibang 86
iba iba-ibang 86
agi Agi-agi 41
agi agi-agi 21
Iba iba-iba 15
ibá iba-iba 15
ibà iba-iba 15
iba iba-iba 15
Subword Length 3 - Most frequent subwords
Subword Count
iba 12
Iba 12
ibá 12
ibà 12
ano 3
Ano 3
año 3
Año 3
agi 2
iyo 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.7804
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Laen manlaen-laen 19
laen manlaen-laen 19
taón taon-taon 17
Taon taon-taon 17
taon taon-taon 17
Enot kaenot-enoteng 9
enot kaenot-enoteng 9
Lapu Lapu-Lapu 8
gibo gibo-gibo 8
Gibo gibo-gibo 8
Subword Length 4 - Most frequent subwords
Subword Count
Sarò 5
sarò 5
enot 5
Enot 5
solo 5
saro 5
Saro 5
sarô 5
Sarô 5
olay 4
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
4.6961
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Buhay buhay-buhay 7
buhay buhay-buhay 7
sunod sunod-sunod 5
Sunod sunod-sunod 5
pulot mapulot-pulot 4
ibong ibong-ibong 4
ibóng ibong-ibong 4
giyaw giyaw-giyaw 4
payag payag-payag 4
Giyaw giyaw-giyaw 4
Subword Length 5 - Most frequent subwords
Subword Count
giyaw 4
Giyaw 4
balik 3
Balik 3
buhay 3
Buhay 3
Gabos 2
gabós 2
tapos 2
grupo 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
8.9986
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
harang maharang-harang 4
pantay pantay-pantay 3
Kulang nagkukulang-kulang 2
minsán paminsan-minsan 2
minsan paminsan-minsan 2
kulang nagkukulang-kulang 2
Minsan paminsan-minsan 2
harang magharang-harang 1
dangog nagdangog-dangog 1
Bulong pagbulong-bulong 1
Subword Length 6 - Most frequent subwords
Subword Count
tangga 3
kulang 3
Kulang 3
minsan 2
Minsan 2
minsán 2
harang 2
bulóng 1
hapros 1
dangog 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
3.4029
374541 msec needed at 2017-11-29 00:06