Corpus: por-pt_web_2016

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
çã aplicação 330231
ca aplicação 330231
aplicação 330231
Ca aplicação 330231
aplicação 330231
Es meses 262377
es meses 262377
És meses 262377
és meses 262377
és portugueses 198421
Subword Length 2 - Most frequent subwords
Subword Count
213
213
ca 213
Ca 213
çã 213
os 121
Os 121
da 118
118
Da 118
Amount of words containing repeated subwords of length 2 - per mille
Per mille
13.0585
Subword Length 3 - most frequent words
Subword Word Frequency
End dependendo 21960
end dependendo 21960
End atendendo 8469
end atendendo 8469
bar Bárbara 7998
Bar Bárbara 7998
End Dependendo 6603
end Dependendo 6603
End defendendo 6277
end defendendo 6277
Subword Length 3 - Most frequent subwords
Subword Count
End 25
end 25
Ass 20
ass 20
bar 13
Bar 13
Tan 6
tar 6
And 5
and 5
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.1526
Subword Length 4 - most frequent words
Subword Word Frequency
zero zerozero 1030
Zero zerozero 1030
coro CoroCoro 418
Coro CoroCoro 418
Subword Length 4 - Most frequent subwords
Subword Count
zero 1
Zero 1
coro 1
Coro 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0365
Subword Length 5 - most frequent words
Subword Word Frequency
mente veementemente 835
Mente veementemente 835
jogos nJogosJogos 788
Jogos nJogosJogos 788
Subword Length 5 - Most frequent subwords
Subword Count
mente 1
Mente 1
jogos 1
Jogos 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0663
Subword Length 6 - most frequent words
Subword Word Frequency
Lisboa LisboaLisboa 481
lisboa LisboaLisboa 481
testes TESTESTESTESTEIndique 278
Testes TESTESTESTESTEIndique 278
Subword Length 6 - Most frequent subwords
Subword Count
Lisboa 1
lisboa 1
testes 1
Testes 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.1499
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Re pré-requisitos 1345
pré-requisitos 1345
pré-requisitos 1345
re pré-requisitos 1345
te Diverte-te 1079
Te Diverte-te 1079
Diverte-te 1079
te diverte-te 779
Te diverte-te 779
diverte-te 779
Subword Length 2 - Most frequent subwords
Subword Count
Re 6
6
6
re 6
te 3
Te 3
3
Pt 1
po 1
se 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.1820
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
cai cai-cai 670
caí cai-cai 670
Cai cai-cai 670
Subword Length 3 - Most frequent subwords
Subword Count
cai 1
caí 1
Cai 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0128
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Vera primavera-verão 1060
verá primavera-verão 1060
vera primavera-verão 1060
Verá primavera-verão 1060
Vera Primavera-Verão 602
verá Primavera-Verão 602
vera Primavera-Verão 602
Verá Primavera-Verão 602
ação investigação-ação 327
Ação investigação-ação 327
Subword Length 4 - Most frequent subwords
Subword Count
Vera 2
verá 2
vera 2
Verá 2
ação 1
Ação 1
Acão 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0547
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
assim assim-assim 296
Assim assim-assim 296
Subword Length 5 - Most frequent subwords
Subword Count
assim 1
Assim 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0331
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1070783 msec needed at 2022-02-18 04:25