Korpus: deu_wikipedia_2016_30K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Ge gegen 428
En denen 187
en denen 187
Es dieses 172
Re mehrere 172
es dieses 172
mehrere 172
re mehrere 172
En verschiedenen 124
en verschiedenen 124
Subword Length 2 - Most frequent subwords
Subword Count
en 342
En 342
te 227
Te 227
Ge 188
er 143
Er 143
st 53
St 53
In 39
Amount of words containing repeated subwords of length 2 - per mille
Per mille
12.7169
Subword Length 3 - most frequent words
Subword Word Frequency
bar Barbara 15
Bar Barbara 15
Bär Barbara 15
Den stattfindenden 11
den stattfindenden 11
Den Bodendenkmale 7
den Bodendenkmale 7
Den entscheidenden 6
den entscheidenden 6
ein eineinhalb 5
Subword Length 3 - Most frequent subwords
Subword Count
den 23
Den 23
bar 19
Bar 19
Bär 19
Ten 7
ten 7
End 6
Süd 3
ein 2
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.8606
Subword Length 4 - most frequent words
Subword Word Frequency
Nord nordnordöstlich 1
Subword Length 4 - Most frequent subwords
Subword Count
Nord 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0143
Subword Length 5 - most frequent words
Subword Word Frequency
Luft- Luft-Luft-Lenkwaffen 1
Subword Length 5 - Most frequent subwords
Subword Count
Luft- 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0191
Subword Length 6 - most frequent words
Subword Word Frequency
Triebe Industriebetriebe 2
Triebe Industriebetrieben 1
Subword Length 6 - Most frequent subwords
Subword Count
Triebe 2
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0569
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
en Bergen-Enkheim 2
En Bergen-Enkheim 2
En Flöten-Ensemble 1
Ma Enigma-Maschinen 1
er Homburg-Ober-Erlenbach 1
es Hyères-Est 1
Er Homburg-Ober-Erlenbach 1
Es Hyères-Est 1
di Hindi-Dialekten 1
Ju Ju-Jutsus 1
Subword Length 2 - Most frequent subwords
Subword Count
er 7
Er 7
St 2
en 2
st 2
En 2
Te 1
Ju 1
Re 1
1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.2173
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Bag Bag-Bag 1
Ost Ost-Österreich 1
Süd süd-südöstlicher 1
Subword Length 3 - Most frequent subwords
Subword Count
Bag 1
Ost 1
Süd 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0349
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Luft Luft-Luft-Lenkwaffen 1
lose Lose-Lose 1
Lose Lose-Lose 1
Nord nord-nordwestlich 1
Subword Length 4 - Most frequent subwords
Subword Count
lose 1
Lose 1
Luft 1
Nord 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0429
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Baden Baden-Baden 1
Subword Length 5 - Most frequent subwords
Subword Count
Baden 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0191
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
2536751 msec needed at 2017-12-08 14:47