Korpus: fin_newscrawl_2017_300K

Weitere Korpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Ko koko 3650
ko koko 3650
an maanantaina 1063
en kymmenen 912
En kymmenen 912
ta tätä 887
Ko kokonaan 663
ko kokonaan 663
Ko Koko 443
ko Koko 443
Subword Length 2 - Most frequent subwords
Subword Count
ko 298
Ko 298
is 275
in 81
In 81
an 67
en 67
En 67
et 67
Et 67
Amount of words containing repeated subwords of length 2 - per mille
Per mille
13.3719
Subword Length 3 - most frequent words
Subword Word Frequency
lla illalla 260
Koo KooKoon 141
Koo KooKoo 127
sta keskustasta 79
ssä Forssassa 49
ssa Forssassa 49
lla Illalla 40
Ava avaavat 36
maa huomaamaan 24
Maa huomaamaan 24
Subword Length 3 - Most frequent subwords
Subword Count
lla 8
Koo 8
sta 6
ssä 6
ssa 6
Ava 5
vaa 4
all 3
Isä 3
Isa 3
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.7083
Subword Length 4 - most frequent words
Subword Word Frequency
Perä peräperää 5
perä peräperää 5
höpö höpöhöpö 4
Höpö höpöhöpö 4
höpö Höpöhöpö 3
höpö höpöhöpöä 3
Ansa kansaansa 3
Höpö Höpöhöpö 3
Höpö höpöhöpöä 3
ansa kansaansa 3
Subword Length 4 - Most frequent subwords
Subword Count
höpö 3
Höpö 3
Perä 1
perä 1
Ansa 1
ansa 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0703
Subword Length 5 - most frequent words
Subword Word Frequency
mistä valmistamista 12
Mistä valmistamista 12
mistä omistamista 11
Mistä omistamista 11
Subword Length 5 - Most frequent subwords
Subword Count
mistä 2
Mistä 2
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0426
Subword Length 6 - most frequent words
Subword Word Frequency
lasten lastenlasten 9
Lasten lastenlasten 9
lapsen lapsenlapsen 5
Lapsen lapsenlapsen 5
lapsen lapsenlapsensa 3
Lapsen lapsenlapsensa 3
Subword Length 6 - Most frequent subwords
Subword Count
lapsen 2
Lapsen 2
lasten 1
Lasten 1
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.1160
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
La Etelä-Lapissa 3
la Etelä-Lapissa 3
ti Ti-Ti 3
Ti Ti-Ti 3
Subword Length 2 - Most frequent subwords
Subword Count
La 1
la 1
ti 1
Ti 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0205
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
1838918 msec needed at 2018-05-29 03:01