Values for some general parameters
Parameter |
Value |
Number of sentences |
300000 |
Average sentence length in characters |
232.1189 |
Average sentence length in words |
19.7495 |
Number of distinct word forms |
291744 |
Number of distinct word forms (without multiwords) |
289631 |
Percentage of lower case word forms |
59.4429 |
Number of multi word units |
2113 |
Percentage of multi word units |
0.7243 |
Number of running word forms |
5923825 |
Number of running word forms (without multiwords) |
5919703 |
Percentage of lower case running words |
84.7559 |
Average word form length |
16.6174 |
Average running word length |
10.64780277 |
Percentage of word forms with frequency=1 |
53.5123 |
Number of sentence based co-occurrences |
1266820 |
- minimal likelihood ratio |
-61236.31 |
- maximal likelihood ratio |
19440.74 |
Number of neighbour based co-occurrences |
210558 |
- minimal likelihood ratio |
-31418.71 |
- maximal likelihood ratio |
47905.60 |
Average number of sentence based co-occurrences per sentence |
106.4574 |
Average number of neighbour co-occurrences per sentence |
9.8370 |
Most frequent word |
και |
Frequent word's frequency |
198365 |
2741 msec needed at 2018-02-23 23:03