Korpus: tgl_wikipedia_2018_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ma pamamagitan 2498
Ma pamamagitan 2498
am pamamagitan 2498
Am pamamagitan 2498
pamamagitan 2498
Ti titik 1783
ti titik 1783
in sining 1276
In sining 1276
La kilalang 1236
Subword Length 2 - Most frequent subwords
Subword Count
pa 685
Pa 685
ka 649
Ka 649
649
ḷa 534
ḻa 534
La 534
la 534
an 479
Amount of words containing repeated subwords of length 2 - per mille
Per mille
70.9998
Subword Length 3 - most frequent words
Subword Word Frequency
Say kasaysayan 641
say kasaysayan 641
ang nangangahulugang 345
Nga nangangahulugang 345
ANg nangangahulugang 345
Ang nangangahulugang 345
nga nangangahulugang 345
ang Silangang 221
ANg Silangang 221
Ang Silangang 221
Subword Length 3 - Most frequent subwords
Subword Count
ang 174
Ang 174
ANg 174
nga 103
Nga 103
Bay 30
bay 30
Say 28
say 28
Ngu 21
Amount of words containing repeated subwords of length 3 - per mille
Per mille
6.5053
Subword Length 4 - most frequent words
Subword Word Frequency
Kala kalakalan 115
Kala pangkalakalan 63
Sing singsing 43
sing singsing 43
ding dingding 32
Ding dingding 32
Bini Binibining 27
Bini Binibining Pilipinas 23
taas Kataastaasang 22
Taas Kataastaasang 22
Subword Length 4 - Most frequent subwords
Subword Count
Kala 14
Akal 5
Ning 5
sama 5
Sama 5
Sing 4
sing 4
Bini 4
ding 3
Ding 3
Amount of words containing repeated subwords of length 4 - per mille
Per mille
1.6911
Subword Length 5 - most frequent words
Subword Word Frequency
Banal Kabanalbanalang 19
banal Kabanalbanalang 19
tangi katangitanging 2
Tangi katangitanging 2
crush Crushcrushcrush 1
Crush Crushcrushcrush 1
Subword Length 5 - Most frequent subwords
Subword Count
Banal 1
banal 1
tangi 1
Tangi 1
crush 1
Crush 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1108
Subword Length 6 - most frequent words
Subword Word Frequency
galang galanggalangan 2
Galang galanggalangan 2
pansin Kapansinpansin 1
Subword Length 6 - Most frequent subwords
Subword Count
galang 1
Galang 1
pansin 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.1579
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ka pinaka-karaniwang 29
Ka pinaka-karaniwang 29
pinaka-karaniwang 29
ka pinaka-karaniwan 11
Ka pinaka-karaniwan 11
pinaka-karaniwan 11
Yo yo-yo 9
yo yo-yo 9
ka pinaka-kamakailang 4
Ka pinaka-kamakailang 4
Subword Length 2 - Most frequent subwords
Subword Count
ka 7
Ka 7
7
Sa 3
ṣa 3
ng 3
Ng 3
sa 3
ag 2
Ag 2
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.3265
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
una kauna-unahang 240
Una kauna-unahang 240
ari ari-arian 71
Ari ari-arian 71
isa kaisa-isang 36
iba iba-ibang 36
Isa kaisa-isang 36
Iba iba-ibang 36
Isá kaisa-isang 36
iba iba-iba 31
Subword Length 3 - Most frequent subwords
Subword Count
iba 15
Iba 15
una 7
Una 7
isa 6
Isa 6
Isá 6
ari 3
Ari 3
Jun 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.6545
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
sari sari-saring 85
Sari sari-saring 85
Sári sari-saring 85
araw araw-araw 79
Araw araw-araw 79
ulit paulit-ulit 62
Unti unti-unting 60
unti unti-unting 60
araw pang-araw-araw 56
Araw pang-araw-araw 56
Subword Length 4 - Most frequent subwords
Subword Count
sama 17
Sama 17
araw 6
Araw 6
sari 5
Sari 5
hati 5
Sári 5
Hati 5
Hatî 5
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
3.5215
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
bagay bagay-bagay 52
Bagay bagay-bagay 52
Sunod sunod-sunod 34
sunod sunod-sunod 34
Sabay sabay-sabay 24
sabay sabay-sabay 24
tangi katangi-tanging 23
balik pabalik-balik 23
Tangi katangi-tanging 23
tuloy tuloy-tuloy 21
Subword Length 5 - Most frequent subwords
Subword Count
alang 10
Laban 5
lipat 5
laban 5
dahan 4
nilay 4
sunod 3
salit 3
Sunod 3
watak 3
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
3.3227
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
minsan paminsan-minsang 33
Minsan paminsan-minsang 33
MInsan paminsan-minsang 33
Kabayo kabayo-kabayohan 24
kabayo kabayo-kabayohan 24
minsan paminsan-minsan 23
Minsan paminsan-minsan 23
MInsan paminsan-minsan 23
pantay pagkakapantay-pantay 21
Pantay pagkakapantay-pantay 21
Subword Length 6 - Most frequent subwords
Subword Count
minsan 4
Minsan 4
MInsan 4
kabayo 4
Kabayo 4
pantay 4
Pantay 4
pansin 4
galang 3
Galang 3
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
2.6046
1925450 msec needed at 2019-06-04 12:42