Korpus: pol_newscrawl_2018_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
ci właściciel 203
Ci właściciel 203
tu Instytutu 187
Tu Instytutu 187
na znana 178
Na znana 178
ci właścicieli 163
Ci właścicieli 163
ci właściciela 155
Ci właściciela 155
Subword Length 2 - Most frequent subwords
Subword Count
Li 36
li 36
35
ja 35
Ja 35
35
35
ci 33
Ci 33
na 25
Amount of words containing repeated subwords of length 2 - per mille
Per mille
3.6552
Subword Length 3 - most frequent words
Subword Word Frequency
bar Barbara 143
Bar Barbara 143
nie zatrudnienie 89
Nie zatrudnienie 89
NIe zatrudnienie 89
nie zapewnienie 78
Nie zapewnienie 78
NIe zapewnienie 78
nie zwolnienie 73
Nie zwolnienie 73
Subword Length 3 - Most frequent subwords
Subword Count
nie 106
Nie 106
NIe 106
nią 22
nia 22
nià 22
bar 12
Bar 12
cię 6
Cię 6
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.9711
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
ka Kosiniaka-Kamysza 5
ów JOW-ów 4
Ów JOW-ów 4
Subword Length 2 - Most frequent subwords
Subword Count
ka 1
ów 1
Ów 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0208
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1833516 msec needed at 2019-06-24 08:42