Rank in Wordlist | Frequency | Word |
---|---|---|
5082 | 29 | 5,000 |
5199 | 28 | 1,000 |
6315 | 21 | 10,000 |
8313 | 14 | 15,000 |
8315 | 14 | 2,000 |
9172 | 12 | 20,000 |
9174 | 12 | 25,000 |
9179 | 12 | 4,000mAh |
9691 | 11 | 3,000 |
9696 | 11 | 40,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
10265 | 10 | 10% |
10975 | 9 | 5% |
10976 | 9 | 50% |
11780 | 8 | 90% |
12658 | 7 | 100% |
13883 | 6 | 30% |
17419 | 4 | 3% |
17458 | 4 | 70% |
17474 | 4 | 95% |
20297 | 3 | 20% |
Rank in Wordlist | Frequency | Word |
---|---|---|
15408 | 5 | IL&FS |
25322 | 2 | J&K |
25449 | 2 | S&P |
36965 | 1 | M&M |
37262 | 1 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
124 | 1593 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
185 | 1077 | .' |
33181 | 2 | है'। |
33715 | 1 | 112'अब |
37135 | 1 | O'Dwyer |
37595 | 1 | Video'कबीर |
42331 | 1 | कप'WC |
42721 | 1 | कहा,''मानसिक |
42722 | 1 | कहा,''मुख्यमंत्री |
42723 | 1 | कहा,'आजादी |
42724 | 1 | कहा,'किसी |
Rank in Wordlist | Frequency | Word |
---|---|---|
20352 | 3 | 3GB+32GB |
20373 | 3 | 4GB+64GB |
34258 | 1 | 2+2 |
59372 | 1 | रैम+128जीबी |
59373 | 1 | रैम+64जीबी |
Rank in Wordlist | Frequency | Word |
---|---|---|
10274 | 10 | 26/11 |
11779 | 8 | 9/11 |
13952 | 6 | https://t |
17472 | 4 | 8GB/128GB |
17943 | 4 | एससी/एसटी |
20503 | 3 | I/O |
20611 | 3 | f/1.8 |
20612 | 3 | f/2.4 |
21449 | 3 | क्रेडिट/डेबिट |
24705 | 2 | 1/2 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots