Korpus: glg_wikipedia_2016_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
En teñen 3793
en teñen 3793
es meses 1251
és meses 1251
Es meses 1251
os numerosos 768
ós numerosos 768
Ós numerosos 768
Os numerosos 768
Ås casas 518
Subword Length 2 - Most frequent subwords
Subword Count
ós 114
Ós 114
os 114
Os 114
103
da 103
103
Da 103
es 100
Es 100
Amount of words containing repeated subwords of length 2 - per mille
Per mille
11.6955
Subword Length 3 - most frequent words
Subword Word Frequency
end dependendo 350
End dependendo 350
Ant cantante 312
Ase baséase 188
Tar tartarugas 130
Tin Tintín 78
Ant cantantes 74
Tar tartaruga 63
end Dependendo 62
End Dependendo 62
Subword Length 3 - Most frequent subwords
Subword Count
End 27
end 27
Bar 9
Bär 9
Tar 9
bar 9
tan 5
Tan 5
Chi 4
chi 4
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.1830
Subword Length 4 - most frequent words
Subword Word Frequency
Make Makemake 12
Chin Cochinchina 4
Ador adoradores 4
amos amosamos 3
Amos amosamos 3
Amós amosamos 3
Subword Length 4 - Most frequent subwords
Subword Count
Make 1
Chin 1
Ador 1
amos 1
Amos 1
Amós 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0764
Subword Length 5 - most frequent words
Subword Word Frequency
mente vehementemente 6
Mente vehementemente 6
Subword Length 5 - Most frequent subwords
Subword Count
mente 1
Mente 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0367
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
es cidades-estado 7
Ma Ma-Ma 7
Es cidades-estado 7
ma Ma-Ma 7
Ma-Ma 7
és cidades-estado 7
co co-consagradores 5
Co co-consagradores 5
co-consagradores 5
co-consagradores 5
Subword Length 2 - Most frequent subwords
Subword Count
La 2
la 2
2
es 2
Es 2
és 2
Co 1
1
1
Ma 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0635
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Bio Bío-Bío 3
Bío Bío-Bío 3
Subword Length 3 - Most frequent subwords
Subword Count
Bio 1
Bío 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0129
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
aire aire-aire 7
Aire aire-aire 7
ágar ágar-ágar 3
Ágar ágar-ágar 3
Ares Ares-Ares 2
AREs Ares-Ares 2
Subword Length 4 - Most frequent subwords
Subword Count
aire 1
Aire 1
ágar 1
Ágar 1
Ares 1
AREs 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0573
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
dobre dobre-dobre 6
Dobre dobre-dobre 6
Subword Length 5 - Most frequent subwords
Subword Count
dobre 1
Dobre 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0367
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
célula célula-célula 10
Célula célula-célula 10
dobres dobres-dobres 6
Subword Length 6 - Most frequent subwords
Subword Count
célula 1
Célula 1
dobres 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.1757
2365437 msec needed at 2017-12-18 08:56