Korpus: ell_web_2012

Weitere Korpora

1.1 Summary

Values for some general parameters

Parameter Value
Number of sentences 1457833
Average sentence length in characters 208.7530
Average sentence length in words 17.4216
Number of distinct word forms 832053
Number of distinct word forms (without multiwords) 826613
Percentage of lower case word forms 57.2811
Number of multi word units 5440
Percentage of multi word units 0.6538
Number of running word forms 25375400
Number of running word forms (without multiwords) 25322604
Percentage of lower case running words 85.2524
Average word form length 16.7509
Average running word length 10.94433092
Percentage of word forms with frequency=1 55.2902
Number of sentence based co-occurrences 5855934
- minimal likelihood ratio 6.63
- maximal likelihood ratio 94444.97
Number of neighbour based co-occurrences 736924
- minimal likelihood ratio 3.84
- maximal likelihood ratio 217959.12
Average number of sentence based co-occurrences per sentence 115.5412
Average number of neighbour co-occurrences per sentence 9.8922
Most frequent word και
Most frequent word's frequency 983725
14786 msec needed at 2019-12-21 08:00