Corpus: spa-co_web_2017

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
en tienen 14951
En tienen 14951
es meses 10423
Es meses 10423
Is análisis 9177
is análisis 9177
ci ejercicio 6842
Ci ejercicio 6842
Vi vivienda 4801
vivienda 4801
Subword Length 2 - Most frequent subwords
Subword Count
ra 186
os 127
Os 127
da 106
Da 106
106
vi 79
Vi 79
79
ar 76
Amount of words containing repeated subwords of length 2 - per mille
Per mille
12.5964
Subword Length 3 - most frequent words
Subword Word Frequency
Ant cantante 1174
ant cantante 1174
Ant El cantante 173
ant El cantante 173
bar Bárbara 170
Bar Bárbara 170
bar Santa Bárbara 156
Bar Santa Bárbara 156
tan instantánea 146
Tan instantánea 146
Subword Length 3 - Most frequent subwords
Subword Count
bar 11
Bar 11
Ant 5
ant 5
tan 5
Tan 5
Ten 4
ten 4
and 3
And 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.5369
Subword Length 4 - most frequent words
Subword Word Frequency
Chin Chinchiná 70
Chin Chinchina 25
Cali CaliCali 13
cali CaliCali 13
Calí CaliCali 13
Subword Length 4 - Most frequent subwords
Subword Count
Chin 2
Cali 1
cali 1
Calí 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0520
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Subword Length 6 - most frequent words
Subword Word Frequency
Bogotá BogotáBogotá 24
Bogota BogotáBogotá 24
bogota BogotáBogotá 24
bogotá BogotáBogotá 24
Bogotà BogotáBogotá 24
Subword Length 6 - Most frequent subwords
Subword Count
Bogotá 1
Bogota 1
bogota 1
bogotá 1
Bogotà 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0664
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Sa es/sala-de-prensa/Themes/ISA-SalaPrensa/jquery-1.4.1 30
sa es/sala-de-prensa/Themes/ISA-SalaPrensa/jquery-1.4.1 30
re pre-registro 24
Re pre-registro 24
Co co-conspirador 8
co co-conspirador 8
lo Mulaló-Loboguerrero 7
Lo Mulaló-Loboguerrero 7
re pre-requisito 7
Re pre-requisito 7
Subword Length 2 - Most frequent subwords
Subword Count
re 2
Re 2
Sa 1
sa 1
Co 1
co 1
lo 1
Lo 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0530
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
sur Sur-Sur 41
Sur Sur-Sur 41
gas gas-gasolina 20
Gas gas-gasolina 20
sur sur-sur 10
Sur sur-sur 10
Col Col-Col 7
Bio Bío-Bío 7
col Col-Col 7
bio Bío-Bío 7
Subword Length 3 - Most frequent subwords
Subword Count
sur 2
Sur 2
gas 1
Gas 1
Bio 1
bio 1
Col 1
col 1
des 1
Des 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0749
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
gana gana-gana 16
Gana gana-gana 16
verá primavera-verano 8
Vera primavera-verano 8
vera primavera-verano 8
Subword Length 4 - Most frequent subwords
Subword Count
gana 1
Gana 1
verá 1
Vera 1
vera 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0347
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
doble doble-doble 9
Doble doble-doble 9
Subword Length 5 - Most frequent subwords
Subword Count
doble 1
Doble 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0303
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1023383 msec needed at 2020-06-28 00:17