Korpus: fra-ch_web_2012_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
me même 5632
Me même 5632
%o formation 3814
%o toujours 3324
%o informations 2307
%o disposition 2066
%o conditions 1693
%o propose 1603
%o pouvoir 1581
%o fonction 1505
Subword Length 2 - Most frequent subwords
Subword Count
%o 6922
er 75
me 60
Me 60
31
31
Re 31
re 31
es 27
ès 27
Amount of words containing repeated subwords of length 2 - per mille
Per mille
78.4542
Subword Length 3 - most frequent words
Subword Word Frequency
� jusqu�� 106
Hua chihuahua 95
� cr�� 76
� d��tre 60
� th��tre 45
tes testés 45
Tes testés 45
bar Barbara 44
Bar Barbara 44
Bär Barbara 44
Subword Length 3 - Most frequent subwords
Subword Count
� 150
Ass 10
bar 9
Bar 9
Bär 9
Tan 7
tes 6
Tes 6
ant 5
cou 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
2.8986
Subword Length 4 - most frequent words
Subword Word Frequency
cher chercher 334
Cher chercher 334
cher rechercher 136
Cher rechercher 136
cher Rechercher 14
Cher Rechercher 14
B� b�b� 14
cher cherchera 12
Cher cherchera 12
cher Chercher 10
Subword Length 4 - Most frequent subwords
Subword Count
cher 10
Cher 10
B� 3
chou 3
Chou 3
Rent 1
Call 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.3176
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
%o micro-organismes 27
Pa Europa-Park 19
pa Europa-Park 19
%o co-fondateur 15
%o micro-ondes 15
%o médico-social 14
%o socio-économique 12
Re pré-requis 11
re pré-requis 11
pré-requis 11
Subword Length 2 - Most frequent subwords
Subword Count
%o 96
Re 5
re 5
5
5
Pa 3
pa 3
ch 2
Ch 2
er 2
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
1.2275
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Che sèche-cheveux 22
Ché sèche-cheveux 22
Che s�che-cheveux 5
Ché s�che-cheveux 5
eau eau-eau 4
Eau eau-eau 4
pré Pré-Presse 3
Pré Pré-Presse 3
cha cha-cha-cha 3
Cha cha-cha-cha 3
Subword Length 3 - Most frequent subwords
Subword Count
Che 2
Ché 2
cha 2
Cha 2
chà 2
eau 1
Eau 1
pré 1
Pré 1
air 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0875
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
plan Plan-Plan 2
Plan Plan-Plan 2
zoom Zoom-Zoom 2
Zoom Zoom-Zoom 2
Subword Length 4 - Most frequent subwords
Subword Count
plan 1
Plan 1
zoom 1
Zoom 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0353
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
cache cache-cache 5
caché cache-cache 5
Cache cache-cache 5
Caché cache-cache 5
train train-train 4
Train train-train 4
Baden Baden-Baden 2
Subword Length 5 - Most frequent subwords
Subword Count
cache 1
caché 1
Cache 1
Caché 1
train 1
Train 1
Baden 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0933
Subword Length 6 - most frequent words with hyphen
Subword Word Frequency
moitié moitié-moitié 5
Subword Length 6 - Most frequent subwords
Subword Count
moitié 1
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0675
1086790 msec needed at 2023-11-10 01:25