← All posts

How many words to read a novel.

The honest numbers — for Spanish, French, German, Japanese, and Chinese. And what those numbers actually mean.

People ask this all the time. The honest answer involves nuance. The shorter version: less than you think for casual fiction, more than you think for literary fiction, and the gap between "vocabulary you can recognize" and "vocabulary you need to read fluently" is bigger than you'd guess.

The basic numbers

Reading research generally finds that to read a text comfortably — that is, to follow what's happening without constant lookups — you need to know around 95-98% of the words on the page. Not 80%. Not 90%. 95% or better.

That sounds like a tiny gap, but it's not. At 90% known, you're stopping or guessing every 10 words. At 95%, every 20 words. At 98%, every 50. The reading experience is wildly different.

How many words is 95-98% in practice?

For most European languages, by the time you know around 5,000-7,000 word families, you can read most casual modern fiction at 95%+. Word "families" means base forms (run, ran, runs, running all count as one). For literary fiction or classics, you need more like 8,000-10,000.

Specific languages

  • Spanish: 5,000-6,000 word families gets you most modern fiction. ~10,000 for literary like García Márquez.
  • French: Similar — 5,000-7,000 for modern, more for literary. French literary tradition is older and ranges further.
  • German: 6,000-8,000 — German has a lot of compound nouns that are technically separate words but built from familiar parts.
  • Italian: 5,000-6,000 for modern.
  • Portuguese: Similar to Spanish.
  • Japanese: Roughly 8,000-10,000 vocab items + ~1,500-2,000 kanji for unrestricted novel reading. Less for manga and YA.
  • Chinese: 5,000-6,000 characters + 30,000-40,000 vocabulary items for serious literary reading. Much less for newspapers (~3,000 characters covers most).

These numbers come from corpus studies — counting how words distribute across actual texts.

Why the numbers feel intimidating

Because they sound enormous. But Zipf's law saves you: word frequency follows a power law. The top 1,000 words in any language cover 70-80% of any text. The top 5,000 cover 95%. So you don't need to learn 50,000 words. You need to learn the right 5,000.

The good news: those right 5,000 are exactly the ones you'll meet over and over in real reading. You don't need a frequency list. You just need to read enough text to keep meeting them.

The gap between "knowing" and "fluent"

Here's where most people stumble. Knowing a word on a flashcard is not the same as recognizing it in context fast enough not to lose the sentence. Fluent reading requires automatic recognition — you see the word, you know the meaning, in milliseconds.

This is exactly what extensive reading builds. Each repetition in real text drives the recognition closer to automatic. By the fifth time you've seen arrebatar in a Spanish novel, you don't need to think about it.

What this means for picking a first book

If you have around 3,000-4,000 word families, your "comfortable" reading is going to be at the simpler end of YA, contemporary literary fiction, or anything translated from a simpler-language original. You'll be tapping every paragraph but you'll keep up.

If you have around 5,000-6,000, most modern fiction is open to you. You'll tap a few times per page.

If you have around 8,000+, classics and literary fiction become realistic. You'll still meet new words, but they'll be literary ones, not basic vocabulary.

The single fastest way to get from 3,000 to 8,000 is to read books at your current level and let the next 5,000 accrue from them. Don't pre-load with frequency lists. Read.

Reading is the test and the practice

The numbers are useful as a sanity check, not as a target. If you're tapping every line, you're below the comfort threshold. If you're tapping every five pages, you're at it. If you're never tapping, you've drifted past it and the book is too easy.

For more on how vocabulary actually accumulates in this process, see the complete guide. The post on comprehensible input explains why context-based exposure works so much better than memorization for this.

Less than you think for the right book. More than you think for the wrong one. Either way: read.

Try it on a real book.

Free to download, free to try. iPhone & iPad, 14 languages.

Download on App Store
Next: Free EPUB Spanish classics → All posts