Analyzing and Comparing Books
I have just noticed that Amazon has a “Text Stats” section on its book pages. I’m not sure how long it has been there, but it’s very interesting.
- The Fog Index was developed by Robert Gunning. It indicates the number of years of formal education required to read and understand a passage of text.
- The Flesch Index, developed in 1940 by Dr. Rudolph Flesch, is another indicator of reading ease. The score returned is based on a 100 point scale, with 100 being easiest to read. Scores between 90 and 100 are appropriate for 5th and 6th graders, while a college degree is considered necessary to understand text with a score between 0 and 30.
- The Flesch-Kincaid Index is a refinement to the Flesch Index that tries to relate the score to a U.S. grade level. For example, text with a Flesch-Kincaid score of 10.1 would be considered suitable for someone with a 10th grade or higher reading level.
Information Technology & Book Writing
I wonder how long before publishers and writers start to use this data to better zero in on certain targets in the hope of better reaching their target demographics. I’m sure that someday – if it hasn’t already happened – writers will get notes from editors asking them to “bring the Fog Index rating of their manuscript down by at least 20%” or “reduce the number of complex words by 10%”, all based on statistical analysis of the composition of recent best sellers.
A kind of Search Engine Optimization (SEO) for books, in a way.
Like all tools, it could be abused and lead to bad results. But if used properly, it could result in more readable books and reduce the variability in quality output between individual editors (probably not by much, but any improvement would be welcome).
The pic on top of this post is from the Amazon page for I Am A Strange Loop by Douglas R. Hofstadter (which I’m currently reading).
Here’s an excerpt that I liked:
Update: Just to make it clear, this post isn’t an ad for I Am A Strange Loop. It’s just the book I looked up on Amazon when I noticed the Amazon Text Stats feature, and I thought some people might be curious to know which book the Text Stats in the screenshot came from.
If you liked this post, please consider subscribing to my RSS feed. Thanks.