Values for some general parameters
parameter |
value |
number of sentences |
16306317 |
average sentence length in characters |
104.3408 |
average sentence length in words |
17.3390 |
number of distinct word forms |
3672020 |
percentage of lower case word forms |
60.6719 |
percentage of multi word units |
0.0000 |
number of running word forms |
314390379 |
percentage of lower case running words |
85.2512 |
average word form length |
11.9878 |
average running word length |
4.97493392 |
percentage of word forms with frequency=1 |
57.0018 |
number of sentence based co-occurrences |
32310558 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
1128361.12 |
number of neighbour based co-occurrences |
3426042 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
3043863.00 |
average number of sentence based co-occurrences per sentence |
154.3193 |
average number of neighbour co-occurrences per sentence |
11.2796 |
most frequent word |
i |
frequent word's frequency |
7834023 |
145493 msec needed at 2017-10-27 06:20