Values for some general parameters
parameter |
value |
number of sentences |
10000 |
average sentence length in characters |
105.1449 |
average sentence length in words |
15.5693 |
number of distinct word forms |
38793 |
percentage of lower case word forms |
63.4547 |
percentage of multi word units |
0.0026 |
number of running word forms |
197323 |
percentage of lower case running words |
81.5523 |
average word form length |
8.6948 |
average running word length |
5.68122798 |
percentage of word forms with frequency=1 |
71.0412 |
number of sentence based co-occurrences |
10988 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
1063.58 |
number of neighbour based co-occurrences |
3485 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
2478.95 |
average number of sentence based co-occurrences per sentence |
18.1540 |
average number of neighbour co-occurrences per sentence |
2.7880 |
most frequent word |
och |
frequent word's frequency |
5163 |
410 msec needed at 2018-01-22 19:00