Values for some general parameters
parameter |
value |
number of sentences |
297713 |
average sentence length in characters |
118.7805 |
average sentence length in words |
16.4193 |
number of distinct word forms |
219234 |
percentage of lower case word forms |
48.6895 |
percentage of multi word units |
0.0000 |
number of running word forms |
5553151 |
percentage of lower case running words |
76.9010 |
average word form length |
7.9173 |
average running word length |
6.18524493 |
percentage of word forms with frequency=1 |
55.7103 |
number of sentence based co-occurrences |
930628 |
minimal likelihood ratio |
6.63 |
maximal likelihood ratio |
32200.59 |
number of neighbour based co-occurrences |
143504 |
minimal likelihood ratio |
3.84 |
maximal likelihood ratio |
49694.78 |
average number of sentence based co-occurrences per sentence |
67.4766 |
average number of neighbour co-occurrences per sentence |
7.0548 |
most frequent word |
yang |
frequent word's frequency |
142574 |
5086 msec needed at 2018-01-05 11:30