Corpus: vol_wikipedia_2011_100K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
da bidädas 7023
Pöpinumamabür 5376
ma Pöpinumamabür 5376
Ma Pöpinumamabür 5376
da bidäda 4696
Lo lölöfik 30
an pänan 22
An pänan 22
ta Kretata 21
Lo lölöfiko 12
Subword Length 2 - Most frequent subwords
Subword Count
an 55
An 55
ta 33
Ön 22
On 22
ön 22
on 22
en 17
in 15
In 15
Amount of words containing repeated subwords of length 2 - per mille
Per mille
5.2247
Subword Length 3 - most frequent words
Subword Word Frequency
Bar Bárbara 3
cin Cincinnati 2
Ins Coinsins 1
gen Gengenbach 1
que Couquèques 1
San Sansan 1
fel Staffelfelden 1
Érd Oberderdingen 1
Bar Barbara 1
Bar Barbarano 1
Subword Length 3 - Most frequent subwords
Subword Count
Bar 5
San 1
fel 1
cin 1
Chi 1
Les 1
Ins 1
que 1
gen 1
Côa 1
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.3692
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ön Castleton-on-Hudson 2
on Castleton-on-Hudson 2
Ön Castleton-on-Hudson 2
On Castleton-on-Hudson 2
Le Neuvelle-lès-Lure 1
Le Capelle-lès-Hesdin 1
Le Neuville-lès-Wasigny 1
Le Celle-les-Bordes 1
Le Poule-les-Écharmeaux 1
Le Cornillé-les-Caves 1
Subword Length 2 - Most frequent subwords
Subword Count
Le 28
De 4
de 4
ön 3
on 3
Ön 3
La 3
On 3
3
la 3
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.7558
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Sur Vaux-sur-Sûre 1
sur Vaux-sur-Sûre 1
Les Campigneulles-les-Petites 1
Les Corcelles-les-Monts 1
Les Courcelles-lès-Lens 1
Les Courcelles-lès-Montbard 1
Les Culles-les-Roches 1
Les Marles-les-Mines 1
Les Marolles-les-Buis 1
Sai Cisai-Saint-Aubin 1
Subword Length 3 - Most frequent subwords
Subword Count
Les 11
Sai 1
Sur 1
sur 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.2999
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Mont Beaumont-Monteux 1
Subword Length 4 - Most frequent subwords
Subword Count
Mont 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0340
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Baden Baden-Baden 1
Rohan Frontenay-Rohan-Rohan 1
Subword Length 5 - Most frequent subwords
Subword Count
Baden 1
Rohan 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.1155
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
751525 msec needed at 2018-01-28 15:50