Korpus: ind-in_web_2014_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
be beberapa 5946
Be beberapa 5946
Se seseorang 2096
se seseorang 2096
Ul Rasulullah 1798
ul Rasulullah 1798
An makanan 1536
an makanan 1536
du duduk 1250
Du duduk 1250
Subword Length 2 - Most frequent subwords
Subword Count
An 268
an 268
Me 128
me 128
si 74
Si 74
ka 57
Ka 57
en 53
En 53
Amount of words containing repeated subwords of length 2 - per mille
Per mille
24.6050
Subword Length 3 - most frequent words
Subword Word Frequency
din dinding 360
Din dinding 360
All shallallahu 336
all shallallahu 336
gin menginginkan 314
Gin menginginkan 314
All Shallallahu 311
all Shallallahu 311
Tan tantangan 195
tan tantangan 195
Subword Length 3 - Most frequent subwords
Subword Count
gan 25
Gan 25
All 18
all 18
tan 11
Tan 11
Tun 10
ton 10
Ton 10
Din 10
Amount of words containing repeated subwords of length 3 - per mille
Per mille
2.8844
Subword Length 4 - most frequent words
Subword Word Frequency
hehe hehehehe 24
Hehe hehehehe 24
kang terkangkang 13
Kang terkangkang 13
gkan mengangkangkan 13
haha hahahaha 9
Haha hahahaha 9
Gong menggonggong 9
gong menggonggong 9
nggo menggonggong 9
Subword Length 4 - Most frequent subwords
Subword Count
kang 8
Kang 8
hehe 5
Hehe 5
gkan 5
haha 4
Haha 4
wkwk 3
Wkwk 3
Gong 2
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.9803
Subword Length 5 - most frequent words
Subword Word Frequency
ayat- ayat-ayat-Nya 5
benar benarbenar 2
Benar benarbenar 2
anak- anak-anak- 2
Anak- anak-anak- 2
alat- Alat-alat-kekuasaan 1
Subword Length 5 - Most frequent subwords
Subword Count
ayat- 1
anak- 1
Anak- 1
benar 1
Benar 1
alat- 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1958
Subword Length 6 - most frequent words
Subword Word Frequency
hamba- hamba-hamba-Nya 24
hamba- hamba-hamba-Ku 8
Rasul- Rasul-rasul-Nya 6
hamba- hamba-hamba-Mu 3
Rasul- Rasul-Rasul-Nya 3
Rasul- rasul-rasul-Nya 3
hehehe hehehehehehe 2
Hehehe hehehehehehe 2
kitab- kitab-kitab-Nya 2
Subword Length 6 - Most frequent subwords
Subword Count
hamba- 3
Rasul- 3
hehehe 1
Hehehe 1
kitab- 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.9877
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
sy Asy-Syaikh 28
Sy Asy-Syaikh 28
Al Al-Albani 19
al Al-Albani 19
an angan-angan 18
An angan-angan 18
ng ngomong-ngomong 18
Ng ngomong-ngomong 18
sy Asy-Syafi’i 17
Sy Asy-Syafi’i 17
Subword Length 2 - Most frequent subwords
Subword Count
sy 34
Sy 34
sh 20
Al 9
al 9
th 8
Th 8
an 8
An 8
Ng 4
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
1.1932
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
hal hal-hal 564
Hal hal-hal 564
apa apa-apa 544
Apa apa-apa 544
APa apa-apa 544
sia sia-sia 130
Sel sel-sel 125
sel sel-sel 125
ibu ibu-ibu 44
Ibu ibu-ibu 44
Subword Length 3 - Most frequent subwords
Subword Count
sia 11
apa 10
Apa 10
APa 10
Dua 9
dua 9
Ada 6
ada 6
Abu 5
abu 5
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
1.8706
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Tiba tiba-tiba 813
tiba tiba-tiba 813
Anak anak-anak 759
anak anak-anak 759
laki laki-laki 751
Laki laki-laki 751
Kata kata-kata 543
kata kata-kata 543
satu satu-satunya 316
Satu satu-satunya 316
Subword Length 4 - Most frequent subwords
Subword Count
anak 13
Anak 13
hari 13
Hari 13
hati 11
Hati 11
baik 10
Baik 10
Kata 9
cita 9
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
16.5251
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
orang orang-orang 2503
Orang orang-orang 2503
Benar benar-benar 1282
benar benar-benar 1282
teman teman-teman 510
Teman teman-teman 510
orang Orang-orang 270
Orang Orang-orang 270
teman teman-temannya 198
Teman teman-temannya 198
Subword Length 5 - Most frequent subwords
Subword Count
gesek 10
teman 10
Teman 10
orang 9
Orang 9
gosok 7
Gosok 7
lebih 7
remas 7
Lebih 7
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
44.9851
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
Masing masing-masing 749
masing masing-masing 749
Negara negara-negara 186
negara negara-negara 186
kadang kadang-kadang 148
Kadang kadang-kadang 148
Barang barang-barang 123
barang barang-barang 123
tengah tengah-tengah 116
Tengah tengah-tengah 116
Subword Length 6 - Most frequent subwords
Subword Count
bayang 8
Bayang 8
masing 6
Masing 6
banyak 6
Banyak 6
goyang 6
Goyang 6
hitung 5
Hitung 5
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
64.1975
1268462 msec needed at 2018-04-29 19:19