I have been using the Routledge Frequency dictionaries, which you can see in the picture above (the Spanish one proudly bearing the sticky note which marks the end of the numerical list and the beginning of the alphabetical list).
Some people were asking how these frequency lists were arrived at. The preface to the Spanish version (the one I have used the most) gives as sources for the Routledge lists:
- the corpus, of 20 million words from fiction and non-fiction texts, the internet and transcripts of spoken language
- words from all 21 Spanish-speaking countries
- recent language (the words for this edition were collected in 2014-15)
Most telling, I think, are the subtitles of the dictionaries. The Spanish one is A Frequency Dictionary of Spanish: Core Vocabulary for Learners", thus acknowledging that learners and users will want to use other words outside this list of 5000 to say what they want to say, depending on their interests.
As well as the main numerical and alphabetical lists of the highest frequency 5000 words in the language, the dictionaries also contain some useful lists of different groups of words, such as this list of French animals:
Animals are of great interest to primary children, yet many of the animal words that we teach are relatively low frequency and indeed outside the top 5000. They are usually accompanied in speech and writing, however, by much higher frequency words, such as determiners and key verb forms.
I have also been seeing some interesting links between word frequency and culture.
No comments:
Post a Comment