Values for some general parameters
parameter |
value |
number of sentences |
1189133 |
average sentence length in characters |
117.1815 |
average sentence length in words |
18.1995 |
number of distinct word forms |
963830 |
percentage of lower case word forms |
53.7349 |
percentage of multi word units |
3.4828 |
number of running word forms |
24621139 |
percentage of lower case running words |
83.6231 |
average word form length |
10.8330 |
average running word length |
5.38108454 |
percentage of word forms with frequency=1 |
59.1997 |
number of sentence based co-occurrences |
2964718 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
51041.24 |
number of neighbour based co-occurrences |
445909 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
164075.00 |
average number of sentence based co-occurrences per sentence |
108.3027 |
average number of neighbour co-occurrences per sentence |
9.0661 |
most frequent word |
og |
frequent word's frequency |
703690 |
16603 msec needed at 2017-12-06 11:30