Values for some general parameters
parameter |
value |
number of sentences |
3571556 |
average sentence length in characters |
103.3309 |
average sentence length in words |
15.2329 |
number of distinct word forms |
2174616 |
percentage of lower case word forms |
53.5695 |
percentage of multi word units |
5.0054 |
number of running word forms |
61406928 |
percentage of lower case running words |
80.7935 |
average word form length |
11.0062 |
average running word length |
5.75020239 |
percentage of word forms with frequency=1 |
58.4053 |
number of sentence based co-occurrences |
9748754 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
1856007.25 |
number of neighbour based co-occurrences |
1072247 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
5618911.00 |
average number of sentence based co-occurrences per sentence |
106.1041 |
average number of neighbour co-occurrences per sentence |
8.5631 |
most frequent word |
och |
frequent word's frequency |
1660257 |
40500 msec needed at 2018-01-23 02:30