Korpus: lin_wikipedia_2014

Weitere Korpora

1.1 Summary

Values for some general parameters

parameter value
number of sentences 764
average sentence length in characters 112.3115
average sentence length in words 18.6047
number of distinct word forms 3871
percentage of lower case word forms 75.2260
percentage of multi word units 0.3617
number of running word forms 16040
percentage of lower case running words 86.6549
average word form length 6.9271
average running word length 4.97578197
percentage of word forms with frequency=1 64.5311
number of sentence based co-occurrences 1840
minimal likelihood ratio 6.65
maximal likelihood ratio 79.63
number of neighbour based co-occurrences 369
minimal likelihood ratio 3.85
maximal likelihood ratio 177.40
average number of sentence based co-occurrences per sentence 13.4058
average number of neighbour co-occurrences per sentence 2.8037
most frequent word na
frequent word's frequency 1168
681 msec needed at 2018-01-01 05:30