Rank in Wordlist | Frequency | Word |
---|---|---|
549 | 125 | 1,5 |
797 | 83 | 0,5 |
1312 | 52 | 2,5 |
2198 | 30 | 0,2 |
2275 | 29 | 0,3 |
2276 | 29 | 0,8 |
3015 | 21 | 0,7 |
3143 | 20 | 0,4 |
3598 | 17 | 0,6 |
4533 | 13 | 1,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
6635 | 8 | 10% |
6668 | 8 | 70% |
7340 | 7 | 60% |
8211 | 6 | 25% |
8223 | 6 | 5% |
10492 | 5 | l'1% |
11074 | 4 | 100% |
11216 | 4 | 40% |
11234 | 4 | 50% |
13625 | 3 | 12% |
Rank in Wordlist | Frequency | Word |
---|---|---|
14994 | 3 | Pianelli&Traversa |
30665 | 1 | A&S |
39180 | 1 | H&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
50910 | 1 | Vi$$er |
Rank in Wordlist | Frequency | Word |
---|---|---|
2508 | 26 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
3 | 20737 | l'é |
17 | 5374 | ch'a |
18 | 5343 | l'ha |
43 | 2114 | l'era |
44 | 2057 | n'aira |
45 | 2056 | s'ëstend |
66 | 1207 | l'han |
97 | 727 | l'avìa |
145 | 448 | ch'as |
154 | 429 | d'un |
Rank in Wordlist | Frequency | Word |
---|---|---|
12055 | 4 | a+ib |
27449 | 2 | x+1 |
27583 | 1 | 0+ib |
29290 | 1 | 3+3 |
50456 | 1 | U+3021 |
50457 | 1 | U+3029 |
56566 | 1 | d'a+ib |
Rank in Wordlist | Frequency | Word |
---|---|---|
47 | 1955 | ab/km |
73 | 1101 | 26/05/2014 |
76 | 1056 | ab/km² |
92 | 830 | 8/06/2009 |
106 | 686 | 14/06/2004 |
174 | 383 | l'08/06/2009 |
222 | 301 | 9/05/2005 |
309 | 215 | 06/06/2016 |
361 | 185 | 30/05/2006 |
602 | 112 | 28/05/2007 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots