Korpus: tgl_wikipedia_2016_100K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ma pamamagitan 2447
Ma pamamagitan 2447
am pamamagitan 2447
Am pamamagitan 2447
pamamagitan 2447
pamamagitan 2447
in sining 1337
In sining 1337
La kilalang 1267
la kilalang 1267
Subword Length 2 - Most frequent subwords
Subword Count
pa 725
Pa 725
ka 711
Ka 711
711
La 553
la 553
an 490
An 490
ta 403
Amount of words containing repeated subwords of length 2 - per mille
Per mille
72.0256
Subword Length 3 - most frequent words
Subword Word Frequency
say kasaysayan 636
Say kasaysayan 636
say Kasaysayan 399
Say Kasaysayan 399
ANg nangangahulugang 380
Ang nangangahulugang 380
ang nangangahulugang 380
Nga nangangahulugang 380
nga nangangahulugang 380
ANg Silangang 247
Subword Length 3 - Most frequent subwords
Subword Count
ang 185
Ang 185
ANg 185
nga 119
Nga 119
bay 30
Bay 30
Say 29
say 29
Bar 20
Amount of words containing repeated subwords of length 3 - per mille
Per mille
6.4819
Subword Length 4 - most frequent words
Subword Word Frequency
Sing singsing 54
sing singsing 54
ding dingding 30
Ding dingding 30
Bini Binibining 30
taas Kataastaasang 25
Taas Kataastaasang 25
Bini Binibining Pilipinas 20
paki kapakipakinabang 15
mang kamangmangan 11
Subword Length 4 - Most frequent subwords
Subword Count
sama 5
Sama 5
Bini 4
Sing 3
sing 3
Ning 3
ding 3
Ding 3
Māyā 2
mang 2
Amount of words containing repeated subwords of length 4 - per mille
Per mille
1.2534
Subword Length 5 - most frequent words
Subword Word Frequency
Banal Kabanalbanalang 19
banal Kabanalbanalang 19
Banál Kabanalbanalang 19
hanga kahangahangang 3
crush Crushcrushcrush 2
Crush Crushcrushcrush 2
hanga Kahangahangang 1
tangi Katangitanging 1
Subword Length 5 - Most frequent subwords
Subword Count
hanga 2
Banal 1
banal 1
Banál 1
crush 1
Crush 1
tangi 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1785
Subword Length 6 - most frequent words
Subword Word Frequency
minsan paminsanminsang 2
Minsan paminsanminsang 2
galang galanggalangan 2
Galang galanggalangan 2
pansin Kapansinpansin 1
galang Kagalanggalangang 1
Galang Kagalanggalangang 1
Subword Length 6 - Most frequent subwords
Subword Count
galang 2
Galang 2
minsan 1
Minsan 1
pansin 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.3033
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ka pinaka-karaniwang 32
Ka pinaka-karaniwang 32
pinaka-karaniwang 32
ka pinaka-karaniwan 10
Ka pinaka-karaniwan 10
pinaka-karaniwan 10
ka maka-kaliwang 4
ka pinaka-kamakailang 4
Ka maka-kaliwang 4
Ka pinaka-kamakailang 4
Subword Length 2 - Most frequent subwords
Subword Count
ka 7
Ka 7
7
ag 4
Ag 4
ng 4
Ng 4
ga 3
Ga 3
Re 2
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.4521
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
una kauna-unahang 225
Una kauna-unahang 225
ari ari-arian 63
Ari ari-arian 63
iba iba-ibang 41
Iba iba-ibang 41
ISa kaisa-isang 34
isa kaisa-isang 34
Isa kaisa-isang 34
iba iba-iba 32
Subword Length 3 - Most frequent subwords
Subword Count
iba 13
Iba 13
una 9
Una 9
ISa 8
isa 8
Isa 8
Aya 3
aya 3
ala 2
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.7434
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
sari sari-saring 83
Sari sari-saring 83
ulit paulit-ulit 81
araw araw-araw 74
Araw araw-araw 74
Unti unti-unting 69
unti unti-unting 69
araw pang-araw-araw 53
Araw pang-araw-araw 53
Taas Kataas-taasang 46
Subword Length 4 - Most frequent subwords
Subword Count
sama 20
Sama 20
hati 7
Hati 7
Hatî 7
araw 6
Araw 6
Taas 5
sari 5
Sari 5
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
3.6035
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Bagay bagay-bagay 45
bagay bagay-bagay 45
Sunod sunod-sunod 41
sunod sunod-sunod 41
Sunod pagkakasunod-sunod 30
sunod pagkakasunod-sunod 30
Tuloy tuloy-tuloy 25
tuloy tuloy-tuloy 25
tangi katangi-tanging 23
sabay sabay-sabay 22
Subword Length 5 - Most frequent subwords
Subword Count
alang 12
Alang 12
sunod 5
Sunod 5
Laban 4
lipat 4
Lipat 4
nilay 4
dahan 4
Dahan 4
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
3.3563
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
Minsan paminsan-minsang 35
minsan paminsan-minsang 35
pansin kapansin-pansin 23
Minsan paminsan-minsan 21
minsan paminsan-minsan 21
Kabayo kabayo-kabayohan 18
kabayo kabayo-kabayohan 18
Pantay pagkakapantay-pantay 17
pantay pagkakapantay-pantay 17
Minsan Paminsan-minsan 16
Subword Length 6 - Most frequent subwords
Subword Count
pansin 5
pantay 5
Pantay 5
Galang 4
minsan 4
Minsan 4
kabayo 4
Kabayo 4
galang 4
milyon 2
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
2.9568
2122725 msec needed at 2018-01-24 20:44