Corpus: mlt_wikipedia_2012_10K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Po popolari 44
il-Papa 34
Papa 33
Il-Papa 31
Po popolazzjoni 23
et mietet 23
tal-Papa 21
Po il-popolazzjoni 16
l-Papa 13
papali 11
Subword Length 2 - Most frequent subwords
Subword Count
Po 46
Ta 43
43
ta 43
25
Si 7
6
re 5
Re 5
da 5
Amount of words containing repeated subwords of length 2 - per mille
Per mille
4.6789
Subword Length 3 - most frequent words
Subword Word Frequency
Bar Barbara 2
Bar Barbari 2
tir tirtira 1
car ċċarċar 1
far titfarfar 1
mad l-madmad 1
Bar Agatha Barbara 1
mar Marmara 1
Mar Marmara 1
iċ- Ic-Ic-Chair-person 1
Subword Length 3 - Most frequent subwords
Subword Count
Bar 5
Mar 1
mad 1
tir 1
far 1
ċar 1
car 1
iċ- 1
Iċ- 1
mar 1
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.3097
Subword Length 4 - most frequent words
Subword Word Frequency
għar għargħar 1
Għar għargħar 1
Wiki WikiWikiWeb 1
wiki WikiWikiWeb 1
Subword Length 4 - Most frequent subwords
Subword Count
Wiki 1
wiki 1
għar 1
Għar 1
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0795
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
Al tal-album 3
al tal-album 3
Al tal-Alleati 2
Al tal-alkoħol 2
Al tal-alleli 2
al tal-Alleati 2
al tal-alkoħol 2
al tal-alleli 2
Al tal-Albanija 1
Al tal-Albertina 1
Subword Length 2 - Most frequent subwords
Subword Count
Al 14
al 14
An 1
an 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.3655
Subword Length 3 - most frequent words with hyphen
Subword Word Frequency
Fil fil-film 7
fil fil-film 7
fil fil-Filosofija 1
fil fil-films 1
tat tat-tattiċi 1
mal mal-Maltin 1
Fil Fil-Fillandiż 1
Fil fil-Filippini 1
Fil fil-Filosofija 1
Fil fil-films 1
Subword Length 3 - Most frequent subwords
Subword Count
Fil 5
fil 5
mal 1
tat 1
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.1971
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
525051 msec needed at 2018-01-04 10:44