Tag Archives: extensive reading

I succeeded in acquiring more classic French readers recently. One of my new favourites is Dantès from Otto Bond’s Basic French Readings alternate series. It is a simplified extract from Dumas’s Comte de Monte Cristo. The story starts with an assumed knowledge of 97 frequent words, much like Sept-d’un-coup by the same publisher, but succeeds in having a higher proportion of cognates, leading to an impressive expected vocabulary for 95% coverage of 316 (based on my word list). This makes it the lowest I’ve seen so far, apart from my own series.

However, the important thing is also whether it was an enjoyable read. I definitely got hooked on the story, and then all of a sudden the extract ends, and I’m left wanting to read the rest of the story. That can only be a good thing.

I was less captured by the remaining stories in the five-story volume, but still enjoyed most of them.

Regarding the match between publicised vocabulary sizes of graded readers and the reality of reading them, I can say from my cursory investigations that there is not always a good match between the two. Perhaps it averages out across the books, as I only take the first chunk of text for my comparisons, but if the first few paragraphs are too challenging, then a language learner may lose interest.

I’ve developed a new estimate of readability now, which is more complex than ones I’ve previously used and seems to match the foreign language learner’s experience reasonably well. Based on this, and my more recent acquisitions, I now recommend the following as first reads for French beginners with English-speaking background.

Young children: Luc et Sophie series, or Bonjour Berthe, which I find more entertaining. Le Petit Napoléon series is also quite good, and suitable for all ages, for those who like cats.

Teenagers: I quite like the Teen Readers series. Catastrophe au Camping des Roses is rated as a vocabulary of 300 words, and my estimate has the 95% coverage vocabulary at 2421, which isn’t too bad. But Dantès mentioned above is easier vocabulary-wise.

Adults: Dantès is my current favourite as a first read. Becky Tucker’s Histoires pour les grandes appears to be easy, but I haven’t read enough to know whether it is interesting. I have yet to rate other ebooks.

However, the only stories that you can read immediately in French without having studied it is the Gnomeville series. There are some minor issues with it though, as have been brought to my attention recently. There are places where I have used “de la Fantasia” that should be “de Fantasia”. I was uncertain of the rule for this, but now I have discovered it. Mostly “de” is used with a country, but “de la” is used in expressions that have a temporal sense to them, such as “le gouvernement de la France” (since governments are not permanent), or if there is an adjective applied: l’Histoire de France but l’Histoire économique de la France. Very subtle indeed and I hope I can be forgiven for getting that subtlety wrong in my comic. I intend to make a second edition of Episode 2 at some point to rectify this. Another error in Episode 2 is the use of the verb “voyage” combined with “à” (“voyage à la Place des Roses”). Voyage doesn’t get used this way and “à” should probably be “vers” to communicate this idea. The sentence will be removed from the second edition.

In other news I attended the Applied Linguistics Association of Australia 2018 conference a couple of weeks ago. It was very inspiring, and also emphasised that the important thing with language acquisition is communication, not perfection. Perfection is unlikely to be achieved, but improvement is always possible. So let’s keep improving our language skills. Read, listen, write and speak. With practice comes improvement. Until next time.

Since my last graded reader update I’ve looked at a few more books, some of which are “classics”, in the sense they were from the “direct reading” era of the first half of the twentieth century, following the influence of Michael West’s constrained vocabulary for language teaching, the various word and idiom frequency lists created at the time, and the idea of readability. Some of these books I had already acquired earlier; but through reading some papers published at that time, I was able to compile a shopping list of other books written according to the same philosophy.

As a result, I have a new winner in terms of expected vocabulary size at the 95% threshold of reading comfort. A New French Reader by Ford and Hicks received a 95% vocabulary size of 3532, and Otto Bond’s Sept-d’un-Coup was a close second, with 3650. Bond’s book starts with a much smaller initial assumed vocabulary (97 words) than the Ford and Hicks book (523), so Bond’s book may be a better first read despite the slightly higher vocabulary score here. As seen in my first post on expected vocabulary size for 95% coverage, these are much higher scores than my Gnomeville comics, as my comics take readability criteria to the extreme.

So based on the current stats available on vocabulary, I recommend the following first graded readers for English speakers learning French.

Something I have been pondering lately is the enormous vocabulary load that occurs in some graded readers that are intended for beginners. The grammar is simple but the vocabulary load is huge. Sure, things are often glossed, which speeds up the process of finding out the meaning of words, but it still prevents fluent reading.

When I first found booklets from the Bibliobus series published by Mary Glasgow, I thought they were wonderful. I only had 3 of them, at levels 6 and 8. I also loved Le Chapeau Rouge released by the same publisher. I’ve since collected more Bibliobus stories, and also acquired a collection of Lire Davantage booklets. What is clear to me, and I have been reminded of it by a friend who has been reading them lately, is that there is not much text but a lot of vocabulary load. The stories do vary a little in terms of quality and difficulty within the published levels, so some are probably of greater value than others.

In theory, these high vocabulary load stories provide language exposure that will increase a learner’s skill, since there are things that are unknown. Provided the learner can read them quickly they have some value for extensive reading. However, if there is a choice between another story with lower vocabulary load, more text and a smoother gain in vocabulary, then that would be better. It’s all down to availability. However, ultimately what matters is whether the story appeals to you enough that you are keen to read it. If not, it is best to find something else to read. As long as you are reading at least 10 minutes per day at a level that allows you to fluently read, follow the story, but not already know all the language that you encounter then you will improve your language knowledge.

Here are a couple more books/stories I’ve analysed for general vocabulary size at the ~95% cut-off, based on the first 100 words. Note that just using this figure in isolation is a bit misleading, because books like the one by Ford and Hicks use a lot of repetition and a relatively small vocabulary overall, making it possible to learn relatively easily. It just isn’t necessarily all highly frequent vocabulary. This is where vocabulary density is also a useful guide, so I’ve included this figure as well.
If the 95% general vocabulary size is high and the vocabulary density is high, it means that you may be able to read comfortably, learning the vocabulary of the given text, but its relative usefulness will depend on whether it matches the vocabulary that you need for your language goals.

Title

Author

Publisher/Series

Gen Vocab Size at 95%

Vocab Density at (n) words

Reading approach to French

Ford and Hicks

J.M. Dent and Sons (Canada) Ltd.

12,059

2.77 (122)

Le Visiteur

Sue Finnie

Mary Glasgow/Bibliobus

11,260

2.05 (123)

This follows on from my previous table of figures. When using general vocabulary rank frequency lists it certainly seems normal for graded readers to effectively use a very wide vocabulary, leading to an expected general (raw) vocabulary size of 4,000 to 12,000. To do something considerably less requires careful vocabulary control, such as occurs in my Gnomeville series, which achieves this through exclusively using French-English cognates and the most frequently occurring words. Initially it may seem a little artificial, but becomes more natural and flowing as the stories progress. A similar approach is used in Si Nous Lisions, in that a very small vocabulary is used initially, and then a new word is added every 90 or so words. While I came up with the idea independently, the concept of vocabulary control is attributed to Michael West.

At some point I’ll publish a comprehensive list of graded readers with these statistics, but I’ll first need to automate the process a bit more and get rid of a few bugs. Meanwhile, let’s keep reading at least 10 minutes a day of easy but not too easy text in the languages we want to learn.

I’ve been tinkering with ways of comparing different easy readers for language learners. Previous posts I’ve used a type-token ratio or vocabulary density, which gives some idea of how likely it is you might learn new words through repetition from a text. But for something to be readable, the general consensus is that you need to know at least 95% of the words that you read. This is a level that allows people to guess the meaning of the words they don’t know.

So something I’ve been messing with recently is predicting the general vocabulary size needed for different beginner stories in French, assuming people know all cognates and all proper nouns. I’ve only been working with short samples of text so far, and there are many other assumptions and issues that make it not a perfect comparison – including bugs in my code…

Given a small set of extracts, and assuming you don’t learn the words via their introduction one at a time, as in my comic books, we have the following:

Note that this vocabulary size assumes that each conjugation of verbs is a separate vocabulary item, as are plurals etc. so will be much larger than word family figures normally used.

You can see that the one text written for native French speaking children (Martine) has a much richer vocabulary than the texts written for language learners. The figures for these look worse than they seem, because there are many words that are typically taught early to allow conversation, but which feature much lower on word frequency lists. For example, “maman” was at rank 6163 in my list. In contrast, my Gnomeville comics are designed to prioritise frequent words and cognates to optimally improve reading, at the expense of conversation. Hence the very small vocabulary sizes required.

Recently I’ve been reading a 1939 paper by Tharp that looked at measuring vocabulary difficulty. He appears to have had similar ideas about measuring vocabulary load based on the general frequency of the words, as well as a measure of density of difficulty words. I also recently acquired yet another very early graded reader, “Si nous lisions”, from 1930, which attempted to introduce new words every ~60 running words, in the style of Michael West, who seems to have been the first to use the approach. However, I have a graded reader published in 1909 in my collection, which was intended for “rapid reading”, and was part of a series that commenced with short easy texts. I’m not sure if they methodically introduced words at specific intervals as was done by West and others following his example.

In searches on-line, I found a French adapted reader from 1790, so we’ve been at it for quite a while. I’d like to say we know more about how to write graded readers these days, but I think West had it fairly right. The only thing we can do now is make them more interesting and relevant.

Anyway, I’ll finish off here with the usual things: we need 95% coverage to read comfortably (on average). To do that with native texts requires quite a large vocabulary. But vocabulary increases as you read more. So we should read as much as possible at the level that is right for us and of reading material that interests and motivates us. My Gnomeville comics are ideal first readers in French for those with an English language background and a good vocabulary in English. The Berthe and Luc et Sophie series are reasonable alternatives for children that are possibly too young for Gnomeville, as are the ELI A0 series. Until next time…

As someone who reads and writes books for language acquisition via extensive reading, I look with curiosity at any that are available. Today I came across books by Sylvie Lainé on Amazon.

The books aim to support language acquisition via extensive reading, and provide extensive glosses to prevent the need for consulting a dictionary. The stories are written by a native French speaker who has studied applied linguistics. Her biography on Amazon seems to indicate that she is unaware of the long history of graded readers for language learners, including those in French, published by both English publishers, such as Oxford University Press, as well as French publishers, such as Hachette. Or perhaps she was only referring to those available as ebooks when saying there were no suitable stories available. Nevertheless, it is great that she has published stories for learners of French. Given the amount of reading a learner should do, the more good stories of low to moderate difficulty available the better.

My assessment in this review is about the suitability of these stories for extensive reading. For this purpose, there should be at least 95% vocabulary coverage for fluent reading. That’s only 1 in 20 words that should be unfamiliar. This is difficult to do well for beginner readers. It took me years to come up with the initial story for Gnomeville, which takes a learner from zero French to a small vocabulary of common words and French-English cognates. Episode 1 of the comic doesn’t achieve the 95% coverage figure consistently until page 12 of the story, but maintains a 1 new word per page until then, with illustrations providing essential parts of the story. So the first few pages could be classed as intensive reading, rather than extensive. After that, and for all future episodes, the story has the 95% coverage level consistently, allowing people to practise guessing the meanings of words, with a reasonable chance of success, and still having a gloss to check guesses, for optimal vocabulary retention.

Let’s see how Sylvie Lainé‘s stories do. For this study I have assumed that the unglossed vocabulary is known, and that glossed vocabulary is unknown. To be generous in the analysis, I assume that a glossed multi-word expression is a single unknown vocabulary item, but still count the individual words of the expression when determining the total number of words read. This is a manual calculation, so I am using the first and last chunk of the free sample, where a “chunk” is a section of text that is followed by a set of glossed words. I do both of these chunks, as it is possible that the known vocabulary would be greater as the story progresses, as is the case with Gnomeville.

First up is Voyage en France. The first chunk has 16 glossed words out of 74, making a coverage of only 78%, which is very low, even when reading stories written for native speakers as a foreign language learner with a vocabulary size of 1000.

The second chunk is much better, having 92% coverage, but still a bit low.

Second story is Le Pendentif, which is actually the first volume in the series. This is truly simple French, and if we make the same assumptions of vocabulary knowledge as above, has only 2 glossed words for 74 words, giving a 97% coverage, which is perfect for extensive reading. The final chunk is a little low at 3 glossed out of 38, making 92% coverage, but overall, this story does seem to be a suitable beginner story.

Voyage à Marseille goes from 93% coverage to 91% at the end. Based on these three stories, I’d say that you can expect an average of 92% vocabulary coverage from the series, assuming your knowledge matches the assumption of the author. Comments from reviewers indicate that people reading these stories still need to consult a dictionary for some words, so these coverage figures are an over-estimate for some people.

So in summary, I believe that Le Pendentif (t. 1) should be a fairly comfortable read for beginners after a few lessons of French, and is good for those who are after stories written in authentic French for adult beginners. The other stories would be slow reads due to the vocabulary load, despite the glosses, so it would be better to come back to the series after reading other stories with a better-matched vocabulary level. Apart from my own comic that assumes zero vocabulary, there are series that go from vocabularies of 150 words onwards. Looking at my database of graded readers for French I can see that most beginner ones are indeed targeting children or adolescents. If you can’t face those (I quite enjoy some of them), there are some excellent simplified classics published by CLE International for vocabularies of 500 or more. Two of my favourites so far are Jacquou le Croquant (vocabulary = 600) and En Famille (vocabulary = 500).

A disclaimer for my review of these stories is that I haven’t read beyond the preview chunks available on Amazon, so I can’t comment on how they progress, or how entertaining they are. If you find them very interesting, then you may have the willpower to read on despite the vocabulary load. The Amazon reviews are mostly favorable.

People sometimes say “you can only have two” of the three for things. I think the original was probably the [good:cheap:fast] for projects, and I’ve seen [good grades:social life:enough sleep] for college life. I’ve also seen the somewhat dodgy [pretty:intelligent:sane] for girlfriends. The equivalent for graded readers seems to be: [easy:authentic:interesting]. The more authentic it is, the less easy it will be. The easier it is, the less interesting it’s likely to be. The more interesting, the more difficult, etc. So as authors of graded readers, we try to balance these three things. I also try to include a fourth element, related to “easy”, and that is, perhaps “effective”, in that I try to ensure that the stories are constructed in a way that ensures vocabulary growth, by using all the outcomes from extensive reading research:

95%+ vocabulary coverage,

focus on very frequent words (eg. “le”/”the”) to give best coverage sooner,

repetition of new vocabulary,

glosses,

images to enhance recall,

high interest story (I hope).

Anyway, that’s all for this review. I recommend Le Pendentif, based on extensive reading criteria. The others may be useful for more advanced learners.

I’ve been reading some more research on extensive reading and readability lately. One paper showed gains in reading rate, vocabulary and comprehension with students reading about 150K words over 15 weeks at an intermediate level. This was contrasted with another study where learners read ~65K words over 28 weeks and failed to show improvement. I think there is probably a threshold of some kind where you need to read a certain amount per week to improve language skill. The amount probably varies with the level of skill you already have. Someone still improving their knowledge of the most frequent 400 words of the language will not need to read as much to achieve vocabulary gain (assuming appropriate graded readers) as someone reading at the 2000 word level. The study that showed gains had students reading with vocabularies of 800+.

Given the 10K words per week guide, and the typical reading rate in foreign languages often being around 150 words per minute, that equates to about an hour of reading per week, or 10 minutes a day. That’s not a bad aim for maintaining and hopefully improving your language skills.

Three distinct sections in this reader, at different levels of difficulty.
1. Beginner French, with very simple grammar, but school vocabulary assumed. Progresses through the chapters. Not overly interesting.
2. History. Written in present tense. I enjoyed reading about the ancient history more than the modern. I had read some of these before in Roussy de Sales’s earlier publications, where these were separate books. Again, there is quite a bit of vocabulary here.
3. Famous short stories. These include perfect and imperfect tense, so grammatically suitable for the intermediate student. For some reason I don’t really enjoy these stories, though I think I understood more of them in my most recent reading than when I read them over 10 years ago in other editions.
There is still quite a vocabulary burden when reading these, so their suitability will depend on how comfortable people are with unknown words, and the size of their current vocabulary.

Further info on an extract of the text.

Chapter 1 is 87 words (tokens) and 43 distinct words (types), which makes a type-token ratio of 0.49, which is suitably low for beginners. This compares favourably with other beginner stories, like Bonjour Berthe, and Gnomeville Episodes 1 and 2, but is aimed at an older audience.

Chapter 1 gives a reasonable amount of repetition for de, est and il. Other words would need to be encountered more frequently to be acquired via reading.

In summary, it is good that these stories are still available, as they certainly have their place for French extensive reading.

I sold the first “I can’t believe I’m reading French” Gnomeville comic that I listed on ebay last week, and I’ve decided it’s worth putting my comics up there to provide somewhere for people to buy them easily until I move toward having my on-line shop. Currently sales are a little too low to warrant having a shop front, but it will come. So far I’ve sold about 20 comics, and given away 14 ebook issues, but things are on the increase.

Here are a couple of photos of a comic book page spread in Episodes 1 and 2.