Korpus: spa_newscrawl_2015_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
en tienen 3748
En tienen 3748
Es meses 2523
es meses 2523
an mañana 2122
mañana 2122
na mañana 2122
Na mañana 2122
Is crisis 1401
is crisis 1401
Subword Length 2 - Most frequent subwords
Subword Count
Os 110
os 110
Da 93
da 93
vi 81
Vi 81
81
Ar 76
es 66
Es 66
Amount of words containing repeated subwords of length 2 - per mille
Per mille
10.3867
Subword Length 3 - most frequent words
Subword Word Frequency
Hua Chihuahua 86
bar barbarie 45
Bar barbarie 45
bar Bárbara 38
Bar Bárbara 38
tan instantánea 28
Tan instantánea 28
bar bárbaro 23
Bar bárbaro 23
bar Santa Bárbara 22
Subword Length 3 - Most frequent subwords
Subword Count
bar 12
Bar 12
Chi 4
tan 4
Tan 4
and 4
And 4
Hua 3
ten 3
Ten 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.6698
Subword Length 4 - most frequent words
Subword Word Frequency
jaja jajajaja 5
jaja jajajajaja 3
jaja Jajajaja 2
Subword Length 4 - Most frequent subwords
Subword Count
jaja 3
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0551
Subword Length 5 - most frequent words
Subword Word Frequency
Rojas RojasRojas 2
rojas RojasRojas 2
Subword Length 5 - Most frequent subwords
Subword Count
Rojas 1
rojas 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0337
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
la Castilla-La 57
La Castilla-La 57
Re re-reelección 8
re re-reelección 8
re-reelección 8
la Castilla-la 2
La Castilla-la 2
Subword Length 2 - Most frequent subwords
Subword Count
la 2
La 2
Re 1
re 1
1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0316
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Sur Sur-Sureste 2
sur Sur-Sureste 2
Subword Length 3 - Most frequent subwords
Subword Count
Sur 1
sur 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0126
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
verá Primavera-Verano 3
verá primavera-verano 3
Vera Primavera-Verano 3
Vera primavera-verano 3
vera Primavera-Verano 3
vera primavera-verano 3
Verá Primavera-Verano 3
Verá primavera-verano 3
Subword Length 4 - Most frequent subwords
Subword Count
verá 2
Vera 2
vera 2
Verá 2
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0368
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
doble doble-doble 13
Doble doble-doble 13
Subword Length 5 - Most frequent subwords
Subword Count
doble 1
Doble 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0337
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1060609 msec needed at 2021-08-01 02:27