Korpus: ind_newscrawl-tufs5_2011_30K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Be beberapa 726
Si posisi 380
si posisi 380
Ba babak 190
di pendidikan 163
Di pendidikan 163
an makanan 155
Me memenangkan 154
ga gagal 148
Si sisi 126
Subword Length 2 - Most frequent subwords
Subword Count
an 147
Me 64
si 49
Si 49
di 39
Di 39
ja 27
ga 24
Is 23
is 23
Amount of words containing repeated subwords of length 2 - per mille
Per mille
19.9690
Subword Length 3 - most frequent words
Subword Word Frequency
Tan tantangan 57
tan tantangan 57
kan menekankan 19
ton tontonan 19
Kan menekankan 19
gan ketegangan 12
gan tunggangan 11
ton ditonton 8
gan perdagangan 8
Tan Tantangan 7
Subword Length 3 - Most frequent subwords
Subword Count
gan 14
Tan 6
tan 6
ton 6
nga 4
kan 4
Kan 4
jin 2
End 2
Ang 2
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.7844
Subword Length 4 - most frequent words
Subword Word Frequency
Lang melanglang 4
Haha HAHAHAHAHA 1
Haha HAHAHAHAHHA 1
Lang Melanglang 1
akan melakanakan 1
Akan melakanakan 1
Jung menjungjung 1
hehe hehehehehehe 1
Subword Length 4 - Most frequent subwords
Subword Count
Lang 2
Haha 2
hehe 1
akan 1
Akan 1
Jung 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.3552
Subword Length 5 - most frequent words
Subword Word Frequency
orang Orangorang 1
Orang Orangorang 1
kagum terkagumkagum 1
Kagum terkagumkagum 1
Subword Length 5 - Most frequent subwords
Subword Count
orang 1
Orang 1
kagum 1
Kagum 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1883
Subword Length 6 - most frequent words
Subword Word Frequency
ladang ladangladang 1
Subword Length 6 - Most frequent subwords
Subword Count
ladang 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.2030
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Ta talenta-talenta 4
ta talenta-talenta 4
ga Gangga-Gangga 1
ra Nusantara-raya 1
Ta Talenta-talenta 1
ta Talenta-talenta 1
di di-diksha 1
Di di-diksha 1
an angan-angan 1
an berangan-angan 1
Subword Length 2 - Most frequent subwords
Subword Count
Ta 2
ta 2
an 2
ga 1
ra 1
di 1
Di 1
Ha 1
ha 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.2136
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
hal hal-hal 62
Hal hal-hal 62
apa apa-apa 34
Apa apa-apa 34
tim tim-tim 28
Tim tim-tim 28
Sel sel-sel 22
sel sel-sel 22
Ide ide-ide 10
ide ide-ide 10
Subword Length 3 - Most frequent subwords
Subword Count
apa 4
Apa 4
hak 3
Hak 3
dua 3
jam 3
Dua 3
Jam 3
gol 3
Gol 3
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
1.6834
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Anak anak-anak 225
anak anak-anak 225
hari sehari-hari 67
Hari sehari-hari 67
sama sama-sama 49
Sama sama-sama 49
baru baru-baru 46
Baru baru-baru 46
rata rata-rata 45
Kata kata-kata 45
Subword Length 4 - Most frequent subwords
Subword Count
anak 6
Anak 6
hari 6
hati 6
Hari 6
Hati 6
kata 6
cita 6
Kata 6
Cita 6
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
14.1059
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Benar benar-benar 145
benar benar-benar 145
Orang orang-orang 117
orang orang-orang 117
nilai nilai-nilai 46
Nilai nilai-nilai 46
Karya karya-karya 35
karya karya-karya 35
Turut berturut-turut 24
turut berturut-turut 24
Subword Length 5 - Most frequent subwords
Subword Count
lebih 6
Lebih 6
Bunga 5
teman 5
Teman 5
bunga 5
macam 4
harap 4
besar 4
Besar 4
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
34.0802
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
pemain pemain-pemain 59
Pemain pemain-pemain 59
Tengah tengah-tengah 17
tengah tengah-tengah 17
lontar lontar-lontar 16
Lontar lontar-lontar 16
Tempat tempat-tempat 15
tempat tempat-tempat 15
Kadang kadang-kadang 15
kadang kadang-kadang 15
Subword Length 6 - Most frequent subwords
Subword Count
Undang 5
undang 5
gejala 4
Gejala 4
sering 3
pemain 3
Sering 3
Pemain 3
Minggu 3
minggu 3
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
40.0000
200915 msec needed at 2024-12-27 14:05