Corpus: pol-com_web_2018_100K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
eM systemem 158
Ni niniejszej 152
ni niniejszej 152
ci właścicieli 67
Ci właścicieli 67
Ni Niniejsza 54
ni Niniejsza 54
kakao 53
eM problemem 53
na znana 48
Subword Length 2 - Most frequent subwords
Subword Count
ja 34
34
Ja 34
34
34
na 34
Na 34
An 31
an 31
że 30
Amount of words containing repeated subwords of length 2 - per mille
Per mille
4.1625
Subword Length 3 - most frequent words
Subword Word Frequency
nią udostępniania 51
Tom TomTom 51
nia udostępniania 51
tom TomTom 51
Nią udostępniania 51
nie zapewnienie 47
Nie zapewnienie 47
NIe zapewnienie 47
bar rabarbar 27
Bar rabarbar 27
Subword Length 3 - Most frequent subwords
Subword Count
nie 101
Nie 101
NIe 101
nią 30
nia 30
Nią 30
cię 12
Cie 12
cie 12
bar 12
Amount of words containing repeated subwords of length 3 - per mille
Per mille
2.2888
Subword Length 4 - most frequent words
Subword Word Frequency
Ikea IKEAIKEA 3
Call CallCallback 1
call CallCallback 1
hehe Hehehehehe 1
Subword Length 4 - Most frequent subwords
Subword Count
Ikea 1
Call 1
call 1
hehe 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0541
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Subword Length 6 - most frequent words
Subword Word Frequency
zmiany ZmianyZmiany 4
Zmiany ZmianyZmiany 4
Dublin DublinDublin 1
Subword Length 6 - Most frequent subwords
Subword Count
zmiany 1
Zmiany 1
Dublin 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.1577
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
at AT-AT 2
At AT-AT 2
Pl pl-pl 2
Subword Length 2 - Most frequent subwords
Subword Count
at 1
At 1
Pl 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0210
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
Gadu Gadu-Gadu 2
Subword Length 4 - Most frequent subwords
Subword Count
Gadu 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0180
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1870934 msec needed at 2019-09-22 00:34