Corpus: nno_wikipedia_2012

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
är innbyggjarar 4920
år innbyggjarar 4920
ar innbyggjarar 4920
Ar innbyggjarar 4920
År innbyggjarar 4920
er varierer 947
Er varierer 947
Et Universitetet 639
Te Universitetet 639
et Universitetet 639
Subword Length 2 - Most frequent subwords
Subword Count
år 602
År 602
ar 602
är 602
Ar 602
Er 210
er 210
in 122
In 122
Ne 113
Amount of words containing repeated subwords of length 2 - per mille
Per mille
18.2760
Subword Length 3 - most frequent words
Subword Word Frequency
tar startar 619
den nordenden 109
Den nordenden 109
nan einannan 82
Nan einannan 82
NaN einannan 82
Tan representantane 60
Esk menneskeskapte 52
Ing svingingar 50
bar Barbara 47
Subword Length 3 - Most frequent subwords
Subword Count
Ing 12
bar 10
Bar 10
tar 6
vêr 6
ver 6
Ver 6
Vêr 6
vér 6
Tan 5
Amount of words containing repeated subwords of length 3 - per mille
Per mille
1.1608
Subword Length 4 - most frequent words
Subword Word Frequency
sone sonesonen 18
Sone sonesonen 18
para Paraparaumu 8
Para Paraparaumu 8
Pará Paraparaumu 8
sone Sonesonen 4
make Makemake 4
Make Makemake 4
Sone Sonesonen 4
Subword Length 4 - Most frequent subwords
Subword Count
sone 2
Sone 2
Make 1
para 1
Para 1
Pará 1
make 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0735
Subword Length 5 - most frequent words
Subword Word Frequency
barne barnebarnet 29
Barne barnebarnet 29
Stene gudstenestene 16
barne Barnebarnet 5
Barne Barnebarnet 5
Stene tryggleikstenestene 4
Stene Gudstenestene 3
Subword Length 5 - Most frequent subwords
Subword Count
Stene 3
barne 2
Barne 2
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.1506
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
la Castilla-La 10
La Castilla-La 10
Castilla-La 10
se Troitse-Sergijeva 6
Se Troitse-Sergijeva 6
Go Go-Go 5
go Go-Go 5
ya Ya-Ya’s 4
Ya Ya-Ya’s 4
me Dame-messa 3
Subword Length 2 - Most frequent subwords
Subword Count
me 1
Me 1
la 1
1
La 1
1
1
se 1
Se 1
Go 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0526
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
sør sør-sørvest 4
Sør sør-sørvest 4
Wah wah-wah 4
sur Esch-sur-Sûre 3
Sur Esch-sur-Sûre 3
Subword Length 3 - Most frequent subwords
Subword Count
sør 1
Sør 1
Wah 1
sur 1
Sur 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0387
Subword Length 4 - most frequent words with hyphen
Subword Word Frequency
nord nord-nordvest 5
Nord nord-nordvest 5
Subword Length 4 - Most frequent subwords
Subword Count
nord 1
Nord 1
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0184
Subword Length 5 - most frequent words with hyphen
Subword Word Frequency
Baden Baden-Baden 12
Subword Length 5 - Most frequent subwords
Subword Count
Baden 1
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0301
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
3371099 msec needed at 2018-01-09 00:56