In its attempt to collect all of the information in the known universe, Google (somewhat secretly) began working on a project in 2002. The intention was to make books searchable, so that people could find what they were looking for. Like many of Google’s informational goals, it sounds like a noble one; however, it’s much more complicated than it first seems.
Some of the features of Google books include:
- Google search of contents of books
- Browse books by subject
- Log in to review books, mark favorites, create bookshelves and read/to-read lists (much like LibraryThing or GoodReads)
You may initially think how could this possibly be bad? I’ll admit that there have been plenty of times in my life that I’ve found myself flipping through an entire book, scanning for a specific passage that I vaguely remembered, but hadn’t marked. At those moments, I would have given just about anything to have been able to search the book like that. This, of course, is not the major issue. One big problem with Google books is copyright. While there is certainly no shortage of volumes that have passed into public domain, a huge portion of the books that Google would like to digitize are still subject to copyright. The rights to these books are still owned by their publishers, and in order to display them, Google first has to get permission. As a result, only portions of them are available—what publisher is going to allow free online access to an item they intend to sell? Some publishers *coughHarperCollinscough* don’t even like that libraries allow free access to their titles—they’ve actually capped the number of loans allowed on their eBooks, after which the library must re-purchase the title. Having only certain pages available, or none at all, severely limits the scope of the Google books project.
As a partial remedy for this issue, Google books calls the limited available pages of copyrighted items “previews,” and allows its users to follow a link to purchase the actual item. However, I feel that this makes them less of an information portal than a sales outlet for publishers and retail bookstores—a huge problem in the “free access to information” department. Perhaps if they included a link to WorldCat, people using Google books could find the nearest library that owned the item instead of being told that the only way to obtain access is by purchasing it.
Another issue to consider is the fact that Google books is a “metadata train wreck”. There are any number of sources that illuminate just how horrid, inconsistent, and inaccurate Google’s metadata records can be. There are books on the Internet dated 1900 and books supposedly written before the author’s birth, and that’s not to mention the scanning errors—pages that are upside-down or backwards, etc.
While quite a few libraries have partnered with Google for this project, many—including the Library of Congress—have remained wary, whether for copyright or metadata reasons. Although I am entirely in favor of the idea of such dramatically increased access to books and information, if it can’t be done in a consistent, accurate manner without breaking any laws, it may end up doing more harm than good.