Corpus: sin_wikipedia_2016_100K

Other corpora

2.2.11 Repetitions

Typical repetitions within words

Subword Length 2 - most frequent words
Subword Word Frequency
Ti Partitions 5
em Remember 3
Na Senanayake 2
is WinISIS 2
Ar Arars 2
Is WinISIS 2
Ti Partition 2
Ti partition 2
ar Arars 2
Ti otitis 1
Subword Length 2 - Most frequent subwords
Subword Count
Ti 9
is 4
Is 4
Ch 4
al 3
An 3
an 3
Al 3
em 2
co 2
Amount of words containing repeated subwords of length 2 - per mille
Per mille
0.5233
Subword Length 3 - most frequent words
Subword Word Frequency
Has Arthashastra(බුද්ධ 1
has Arthashastra(බුද්ධ 1
Subword Length 3 - Most frequent subwords
Subword Count
has 1
Has 1
Amount of words containing repeated subwords of length 3 - per mille
Per mille
0.0139
Amount of words containing repeated subwords of length 4 - per mille
Per mille
0.0000
Subword Length 5 - most frequent words
Subword Word Frequency
200px 200px200px200px 1
Subword Length 5 - Most frequent subwords
Subword Count
200px 1
Amount of words containing repeated subwords of length 5 - per mille
Per mille
0.0484
Amount of words containing repeated subwords of length 6 - per mille
Per mille
0.0000
Subword Length 2 - most frequent words with hyphen
Subword Word Frequency
en open-ended 1
Subword Length 2 - Most frequent subwords
Subword Count
en 1
Amount of words with hyphen containing repeated subwords of length 2 - per mille
Per mille
0.0107
Amount of words with hyphen containing repeated subwords of length 3 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 4 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 5 - per mille
Per mille
0.0000
Amount of words with hyphen containing repeated subwords of length 6 - per mille
Per mille
0.0000
510240 msec needed at 2018-01-18 18:16