Values for some general parameters
parameter |
value |
number of sentences |
799605 |
average sentence length in characters |
284.2499 |
average sentence length in words |
11.6178 |
number of distinct word forms |
1011581 |
percentage of lower case word forms |
94.1653 |
percentage of multi word units |
0.8654 |
number of running word forms |
10896635 |
percentage of lower case running words |
98.3391 |
average word form length |
32.0677 |
average running word length |
23.50906982 |
percentage of word forms with frequency=1 |
63.9058 |
number of sentence based co-occurrences |
1622618 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
108845.88 |
number of neighbour based co-occurrences |
253634 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
160407.27 |
average number of sentence based co-occurrences per sentence |
23.7469 |
average number of neighbour co-occurrences per sentence |
3.0403 |
most frequent word |
ஒரு |
frequent word's frequency |
84513 |
14427 msec needed at 2018-01-23 18:00