Korpus: fra-pf_web_2016_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Me même 6505
me même 6505
même 6505
%d candidat 2877
%d demande 2590
%d vendredi 1845
te tête 948
tête 948
Te tête 948
ti compétition 869
Subword Length 2 - Most frequent subwords
Subword Count
%d 1299
er 104
me 68
Me 68
68
ti 62
Re 56
re 56
56
56
Amount of words containing repeated subwords of length 2 - per mille
Per mille
22.8790
Subword Length 3 - most frequent words
Subword Word Frequency
%de demande 2590
%de décidé 720
%de demander 554
%de demandes 526
%de demandé 526
Ent représentent 388
ent représentent 388
%de compteDemander 264
Ent présentent 241
ent présentent 241
Subword Length 3 - Most frequent subwords
Subword Count
%de 169
ent 31
Ent 31
Ass 10
Tei 6
Are 5
rea 5
are 5
Rea 5
Nou 4
Amount of words containing repeated subwords of length 3 - per mille
Per mille
3.9615
Subword Length 4 - most frequent words
Subword Word Frequency
cher chercher 271
Cher chercher 271
cher rechercher 152
Cher rechercher 152
tapu Taputapuatea 132
Tapu Taputapuatea 132
Rent rentrent 34
tapu Taputapuātea 24
Tapu Taputapuātea 24
cher Rechercher 22
Subword Length 4 - Most frequent subwords
Subword Count
cher 9
Cher 9
tapu 5
Tapu 5
chou 4
Reva 2
Tara 2
reva 2
re’a 2
pehe 2
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.8938
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Subword Length 6 - most frequent words
Subword Word Frequency
Mahina MahinaMahina 2
mahina MahinaMahina 2
Odette d’OdetteOdette 2
Mähina MahinaMahina 2
Subword Length 6 - Most frequent subwords
Subword Count
Mahina 1
mahina 1
Mähina 1
Odette 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.1542
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Co éco-collège 18
co éco-collège 18
Te compte-tenu 11
te compte-tenu 11
Re pré-requis 11
re pré-requis 11
pré-requis 11
compte-tenu 11
pré-requis 11
pré-rentrée 8
Subword Length 2 - Most frequent subwords
Subword Count
%d 9
Co 9
co 9
Re 4
re 4
4
4
2
mi 2
Mi 2
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.2981
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
éco éco-école 9
Eco éco-école 9
éco Éco-École 7
Eco Éco-École 7
Tei Tei-Tei 7
éco Eco-Ecole 5
Eco Eco-Ecole 5
Upa upa-upa 3
upa upa-upa 3
17h 17h-17h15 1
Subword Length 3 - Most frequent subwords
Subword Count
éco 3
Eco 3
upa 1
12h 1
17h 1
Tei 1
Upa 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0921
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Bora Bora-Bora 49
bora Bora-Bora 49
boum boum-boum 5
Pago Pago-Pago 4
Puka Puka-Puka 3
mahi mahi-mahi 3
Mahi mahi-mahi 3
Nord Nord-Nord-Ouest 2
nord Nord-Nord-Ouest 2
Maru maru-maru 2
Subword Length 4 - Most frequent subwords
Subword Count
Puka 1
mahi 1
Mahi 1
Nord 1
nord 1
Maru 1
maru 1
Bora 1
bora 1
boum 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.1331
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Coupe coupe-coupe 4
coupe coupe-coupe 4
coupé coupe-coupe 4
Coupé coupe-coupe 4
ylang ylang-ylang 2
Ylang ylang-ylang 2
Subword Length 5 - Most frequent subwords
Subword Count
ylang 1
Ylang 1
Coupe 1
coupe 1
coupé 1
Coupé 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0687
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1076973 msec needed at 2018-04-25 06:08