Korpus: eus_wikipedia_2018_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
En lehenengo 1725
en lehenengo 1725
ko Mexikoko 952
Ko Mexikoko 952
En Lehenengo 706
en Lehenengo 706
Et betetzen 440
et betetzen 440
se frantsesez 355
Se frantsesez 355
Subword Length 2 - Most frequent subwords
Subword Count
en 160
En 160
ra 106
Ra 106
ko 97
Ko 97
go 84
Go 84
et 60
Et 60
Amount of words containing repeated subwords of length 2 - per mille
Per mille
12.6548
Subword Length 3 - most frequent words
Subword Word Frequency
ber berbera 110
Ber berbera 110
ber berberak 67
Ber berberak 67
are sarearen 58
Are sarearen 58
Bar Barbara 53
bar Barbara 53
Bär Barbara 53
are Barearen 39
Subword Length 3 - Most frequent subwords
Subword Count
are 18
Are 18
ber 16
Ber 16
Ari 14
ari 14
Bar 10
bar 10
Bär 10
Dar 4
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.0005
Subword Length 4 - most frequent words
Subword Word Frequency
aren eguzkiarenaren 11
Aren eguzkiarenaren 11
ETAk metaketak 5
Cali Calicalicus 4
bere berebere 3
Bere berebere 3
Make Makemake 3
Subword Length 4 - Most frequent subwords
Subword Count
aren 1
Aren 1
ETAk 1
Cali 1
Make 1
bere 1
Bere 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0814
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ia ia-ia 44
Ia ia-ia 44
ia Ia-ia 5
Le Maule-Lextarre 5
Ia Ia-ia 5
ar Lizoainibar-Arriasgoitiko 5
le Maule-Lextarre 5
Ar Lizoainibar-Arriasgoitiko 5
La Castilla-La 4
la Castilla-La 4
Subword Length 2 - Most frequent subwords
Subword Count
ar 4
Ar 4
ia 2
Ia 2
Le 2
le 2
Po 2
La 1
la 1
Zi 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1244
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
bat bat-batean 69
Bat bat-batean 69
bat bat-bateko 35
Bat bat-bateko 35
oso oso-osorik 27
Oso oso-osorik 27
bat Bat-bateko 19
Bat Bat-bateko 19
bat Bat-batean 13
Bat Bat-batean 13
Subword Length 3 - Most frequent subwords
Subword Count
bat 6
Bat 6
oso 2
Oso 2
era 2
Era 2
den 2
Den 2
Nor 1
doi 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.1787
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
bete bete-betean 46
Erdi erdi-erdian 20
erdi erdi-erdian 20
soil soil-soilik 11
ondo ondo-ondoan 8
Ondo ondo-ondoan 8
bere bere-berea 4
bizi bizi-bizi 4
bizi bizi-bizirik 4
Bere bere-berea 4
Subword Length 4 - Most frequent subwords
Subword Count
bizi 3
Bizi 3
bete 2
bera 1
Erdi 1
Bera 1
erdi 1
albo 1
soil 1
Albo 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.2280
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
behin behin-behineko 627
Behin behin-behineko 627
behin Behin-behineko 20
Behin Behin-behineko 20
barra barra-barra 16
banan banan-banan 16
Barra barra-barra 16
Banan banan-banan 16
bakar bakar-bakarrik 13
behin behin-behinekoa 10
Subword Length 5 - Most frequent subwords
Subword Count
behin 7
Behin 7
txiki 3
Txiki 3
gazte 3
Gazte 3
behar 2
Behar 2
epeka 2
zuzen 2
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.8128
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
poliki poliki-poliki 24
apurka apurka-apurka 15
Apurka apurka-apurka 15
gehien gehien-gehienak 14
Gehien gehien-gehienak 14
bihotz bihotz-bihotzean 9
Bihotz bihotz-bihotzean 9
urtero urtero-urtero 7
Urtero urtero-urtero 7
poliki Poliki-poliki 7
Subword Length 6 - Most frequent subwords
Subword Count
berdin 3
Berdin 3
poliki 2
gehien 2
Gehien 2
jainko 2
Jainko 2
apurka 1
Apurka 1
aldiro 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.8731
1063407 msec needed at 2024-02-14 01:24