Values for some general parameters
parameter |
value |
number of sentences |
290396 |
average sentence length in characters |
126.1416 |
average sentence length in words |
19.3880 |
number of distinct word forms |
277912 |
percentage of lower case word forms |
68.0276 |
percentage of multi word units |
0.0000 |
number of running word forms |
6266869 |
percentage of lower case running words |
84.2761 |
average word form length |
9.0395 |
average running word length |
5.44799760 |
percentage of word forms with frequency=1 |
58.4149 |
number of sentence based co-occurrences |
947114 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
41295.95 |
number of neighbour based co-occurrences |
125575 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
67228.66 |
average number of sentence based co-occurrences per sentence |
108.2566 |
average number of neighbour co-occurrences per sentence |
9.6494 |
most frequent word |
ya |
frequent word's frequency |
288917 |
5290 msec needed at 2017-10-29 01:18