A question by a member of the Latin Best Practices FB group prompted me to look into text coverage, which ultimately led me to comprehension. These are two ideas that a lot of people have misinterpreted, much like the “4%er” figure, and even “90% target language use.” I’m thinking people have a hard time with mathematical concepts, and maybe we should avoid percentages moving forward. But first, we should take care of what damage has already been done by looking at simple examples right away:
Text Coverage
Text coverage is measured by tokens. There are five tokens in the sentence “the bird sees the cat.” Two of the tokens in that sentence happen to be the same word. Therefore, “the” represents 40% text coverage. If the reader doesn’t know “the,” they have a text coverage of 60%. The reader who knows everything except “cat” would have a text coverage of 80%.
Comprehension
Comprehension is a different idea entirely. If the reader who doesn’t know “cat” were asked “what does the bird see?” and it were scored, they’d have a comprehension score of zero. If they were asked two questions about the bird, and two questions about the cat, their score would be 50% comprehension with their 80% coverage of the text. Not the same thing.
Reading
Laufer et al.’s research shows that learners need a text coverage—not comprehension—of 98% ideally to read with ease (and 99-100% whenever possible), but that’s just getting through the reading. That 98% figure is just the start of comprehension.
Hold up.
Yeah, that’s right. Knowing 98% of a text—STOP!!—Remember the first section on tokens. It’s not 98 out of 100 different words, but 98 of 100 tokens (i.e. some words probably repeat). So, knowing 98% of a text doesn’t even guarantee comprehension of what is read. That’s quite the trip, isn’t it? It gets worse when we look at some findings from one of Eric Herman’s Acquisition Classroom Memos on exactly how [in]comprehensible reading can get with what seems like decent text coverage.
There’s a lot in that chart, but compare the text coverage to comprehension scores. Even 95% text coverage can get woefully low comprehension (55%). Keep in mind that the higher scores are still in the “most” range, as in learners are understanding most of what they read when they know 95%+ of a text. Also, those vocabulary sizes are incredibly high for what the majority of K-12 teachers should expect from their students. Eric also adds some context to the research:
“For the most part, the above reading studies were done with high proficiency students, ungraded and academic texts, and count word families. A reasonable prediction is that even higher text coverage and vocabulary size numbers are required to enable adequate comprehension of graded texts by lower level proficiency students. And this is not considering levels necessary for a confident and pleasurable reading experience, which would undoubtedly be even higher!“
Higher would be 100%. Let’s make sure we set the record straight:
- Students need to know 98% of a text to read it with ease.
- Reading with ease from knowing 98% of a text can still result in much lower comprehension scores, like 70%.
- Coverage ≠ comprehension
Providing students with texts of 98%…even 100% coverage of known words is step zero. It’s actually the minimum hope we could have for students reading with ease with high levels of comprehension. It turns out that text coverage isn’t very important to look at, because even knowing 100% of the words doesn’t guarantee 100% comprehension. It all goes back to vocab as top priority, sheltering whenever possible so gradual exposure to new words increases vocabulary without the burden of incomprehension. What does this mean for class? Probably using even fewer words than you think! Students can’t magically learn thousands of words, so if we expect them to comprehend high levels of what they read—especially during any kind of independent reading—we must use and create texts with a very limited number of words.