Cliff Jones Jr.

Core Vocabulary Across Languages #

What are the most basic concepts that virtually all languages express? Many linguists have put a great deal of effort into answering this question, and several short word lists have come out of it. The most famous are probably the Swadesh lists, based mostly on intuition and refined over time. Later lists like the Leipzig-Jakarta have used more stringent methods to determine which vocabulary items are most resistant to borrowing and change over time.


The First 95 Items #

What I’ve done here is I’ve taken five such word lists (Swadesh 100, Ranked Swadesh 40, Swadesh-Yakhontov, Leipzig-Jakarta, and Woodward) and kept only the items that occur in at least two of the lists.

Nouns

  1. name
  2. water
  3. blood
  4. fire
  5. stone/rock
  6. dog
  7. fish
  8. louse/flea
  9. hand/arm
  10. eye
  11. ear
  12. nose
  13. tongue
  14. tooth
  15. bone
  16. horn
  17. tail
  18. egg
  19. leaf
  20. night/evening
  21. star
  22. sun
  23. moon
  24. earth/soil
  25. salt
  26. mountain
  27. tree
  28. rain
  29. wind
  30. bird
  31. flesh/meat
  32. liver
  33. skin/hide
  34. knee
  35. breast/chest
  36. person/human
  37. man
  38. woman
  39. child
  40. hair/fur
  41. mouth
  42. neck
  43. foot/leg
  44. feather
  45. grease/fat
  46. smoke
  47. ash/soot
  48. sand
  49. wood
  50. root
  51. rope/cord
  52. path/road
  53. year

Verbs

  1. die
  2. see/look/watch
  3. hear/listen
  4. know
  5. drink
  6. give
  7. come
  8. stand
  9. sit/set
  10. lie/lay
  11. fly
  12. eat
  13. bite
  14. burn
  15. kill
  16. say/tell/speak/talk
  17. laugh

Adjectives

  1. new
  2. full
  3. good
  4. long
  5. red
  6. black
  7. white
  8. green
  9. yellow
  10. small/little
  11. big/large
  12. wide/broad
  13. heavy
  14. old
  15. dry

Other

  1. I/me
  2. you
  3. what/which
  4. who/whom
  5. one/a/an
  6. two
  7. not/no
  8. this/these
  9. we/us
  10. all/everything/everyone

You’ll notice that this list contains some very basic words but not necessarily the most frequent words, so it’s hardly an ideal place to start for language study. For the next phase of this project, I took word frequency into account to make the list more practical and well rounded.


The Next 105 Items #

Since every language is different, with its own particular (often difficult to translate) grammatical words, I limited my analysis of word frequency to Mandarin, Spanish, and English. For each, I started with a reputable list of the language’s 100 most frequent words. Then, I compared this to another list generated from a corpus of subtitles from TV and movies. For a vocabulary item to make the cut, it had to appear in both lists. Once I had a list for each of these three languages, I threw out all the items that only appeared in one of the languages. What was left could tentatively be called the most frequently expressed concepts in the most widely spoken languages on the planet.

Of course, there was some overlap with the original 95-item list (i.e., items 36, 55, 57, 69, 73, 86, 87, 88, 90, 92, 93, 94, and 95), but most of the vocabulary items were new. I then took all the items that I’d filtered out of my first list for not appearing in more than one of the reference lists and added those back in, just to make sure all the really basic concepts were covered. Much to my delight, I wound up with 105 additional vocabulary items, bringing the total to an even 200.

Nouns

  1. time/instance/occurrence
  2. shade/shadow
  3. house/home
  4. head
  5. belly/abdomen
  6. navel/belly-button
  7. heart/core
  8. back
  9. thigh
  10. wing
  11. nail/claw
  12. animal/beast
  13. ant
  14. cat
  15. pig
  16. snake
  17. worm
  18. parent/father/mother
  19. sibling/brother/sister
  20. spouse/husband/wife
  21. day
  22. cloud
  23. snow
  24. ice
  25. river/stream
  26. sea/ocean
  27. seed
  28. grass/lawn
  29. flower/blossom
  30. bark/husk

Verbs

  1. do/make
  2. be
  3. become/get
  4. have
  5. want
  6. can/be-able-to
  7. think/consider
  8. go
  9. walk/run
  10. take
  11. carry/wear
  12. tie/bind
  13. hide
  14. fall
  15. cry/weep
  16. blow
  17. suck
  18. hit/beat
  19. crush/grind
  20. live
  21. sleep
  22. work
  23. play
  24. swim
  25. hunt
  26. dance
  27. sing
  28. count
  29. vomit

Adjectives

  1. right/correct/proper
  2. bad/wrong
  3. far/distant
  4. hard (not soft)
  5. thick
  6. thin
  7. narrow
  8. sweet
  9. bitter
  10. hot
  11. warm
  12. cold
  13. wet
  14. smooth
  15. sharp
  16. dull (not sharp)
  17. dirty
  18. short
  19. round

Other

  1. it/he/him/she/her/they/them/the
  2. itself/oneself/himself/herself/-self
  3. that/those
  4. other/another/others
  5. no/none/no-one
  6. and
  7. or
  8. but/yet/except
  9. at/in/on
  10. to/toward
  11. of/from/’s (possessive)
  12. by/through/per/for/because
  13. with
  14. over/above/on
  15. for/in-order-to
  16. so/then (effect)
  17. then/right-away (sequence)
  18. now/still/doing (ongoing)
  19. did/done (past)
  20. will/be-going-to (future)
  21. much/many
  22. very/really
  23. where
  24. when
  25. how/like/as
  26. if/whether
  27. yesterday

The List in Other Languages #

I originlly posted this list on the now-defunct Duolingo discussion forums back in 2014. I then posted some translations and asked for help translating the items into different languages. Here’s what we have so far:

Contributing #

If you’d like to help with this project by offering feedback on my translations or translating the list into a new language, please hit me up on social media. I’d love to keep this list growing.