Korpus: ind_newscrawl_2011_10K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
si posisi 73
Si posisi 73
an makanan 45
di pendidikan 45
An makanan 45
Di pendidikan 45
Ba babak 44
si sisi 39
Si sisi 39
an pelayanan 38
Subword Length 2 - Most frequent subwords
Subword Count
An 113
an 113
di 26
Di 26
si 22
Si 22
Pa 17
ga 17
du 15
Ka 14
Amount of words containing repeated subwords of length 2 - per mille
Per mille
17.5695
Subword Length 3 - most frequent words
Subword Word Frequency
Tan tantangan 22
Din dinding 7
ter tertera 6
kan menekankan 4
Jun menjunjung 4
Kan menekankan 4
ton dipertontonkan 2
ton ditonton 2
bar Barbarian 2
Bar Barbarian 2
Subword Length 3 - Most frequent subwords
Subword Count
ton 5
Tan 4
Din 3
Cin 2
ter 2
Jun 2
Ani 1
dan 1
Dan 1
nya 1
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.1751
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
di pendi-dikan 2
Di pendi-dikan 2
si posi-sinya 1
Si posi-sinya 1
tu ditu-tupinya 1
tu tertu-tup 1
An angan-angan 1
an angan-angan 1
du berpendu-duk 1
Subword Length 2 - Most frequent subwords
Subword Count
tu 2
di 1
Di 1
An 1
an 1
du 1
si 1
Si 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.2307
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
hal hal-hal 10
apa apa-apa 10
Hal hal-hal 10
Apa apa-apa 10
ibu ibu-ibu 6
Ibu ibu-ibu 6
jam jam-jam 3
Jam jam-jam 3
hak hak-hak 2
sah sah-sah 2
Subword Length 3 - Most frequent subwords
Subword Count
apa 3
Apa 3
hal 2
Hal 2
sel 2
abu 1
mal 1
hak 1
tim 1
Hak 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
1.0219
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
anak anak-anak 31
Anak anak-anak 31
tiba tiba-tiba 23
rata rata-rata 17
laki laki-laki 10
sama sama-sama 9
baru baru-baru 9
Baru baru-baru 9
Sama sama-sama 9
Hati hati-hati 8
Subword Length 4 - Most frequent subwords
Subword Count
Pura 5
Hari 4
kali 4
Kali 4
cita 4
hari 4
bagi 3
hati 3
Bagi 3
Hati 3
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
11.7877
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
benar benar-benar 35
Benar benar-benar 35
orang orang-orang 19
Orang orang-orang 19
teman teman-temannya 10
Teman teman-temannya 10
Jalan jalan-jalan 9
kasus kasus-kasus 9
jalan jalan-jalan 9
Kasus kasus-kasus 9
Subword Length 5 - Most frequent subwords
Subword Count
habis 3
Habis 3
orang 3
Orang 3
janji 2
retak 2
benar 2
mudah 2
Benar 2
akhir 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
24.2742
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
negara negara-negara 21
Negara negara-negara 21
Undang Undang-undang 10
undang Undang-undang 10
Undang Undang-Undang 8
undang Undang-Undang 8
Barang barang-barang 8
barang barang-barang 8
Undang undang-undang 6
undang undang-undang 6
Subword Length 6 - Most frequent subwords
Subword Count
undang 5
Undang 5
Daerah 2
negara 2
proyek 2
Negara 2
gereja 2
Proyek 2
partai 2
Gereja 2
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
34.0909
93102 msec needed at 2018-03-09 21:53