September 10, 2008
Google Digitizes Newspaper Archives Online Project Lets People Search and View Original Pages
By Miguel Helft
Brad Stone contributed reporting.*
Google has begun scanning microfilm from some newspapers' historical archives to make them searchable online, first through Google News and eventually on the newspapers' own Web sites, the company said.
The new program announced Monday expands a two-year-old service that allows Google News users to search the archives of some major U.S. newspapers and magazines that were already available in digital form, including The New York Times, whose global edition is the International Herald Tribune, as well as The Washington Post and Time. Readers will be able to search the archives using keywords and view articles as they appeared originally in print pages.
Under the expanded program, Google will shoulder the cost of digitizing newspaper archives, much as the company does with its book-scanning project. Google angered some book publishers because it had failed to seek permission to scan books that were protected by copyrights. It will obtain permission from newspaper publishers.
Google, based in Mountain View, California, will place advertisements alongside search results and share the revenue with the publishers.
"This is really good for newspapers because we are going to be bringing online an old generation of contributions from journalists, as well as widening the reader base of news archives," said Marissa Mayer, Google's vice president for search products and user experience.
But many newspaper publishers view Google and other search engines as threats to their business. And those that see their archives as a potential source of revenue might not hand them over to Google.
"The concern is that Google, in making all of the past newspaper content available, can greatly commoditize that content, just like news portals have commoditized current news content," said Ken Doctor, an analyst with Outsell, a research company.
Google said it was working with more than 100 newspapers and with partners like Heritage Microfilm and ProQuest, which aggregate historical newspaper archives in microfilm. It has already scanned millions of articles.
Other companies are already working with newspapers to digitize archives and some sell those archives to schools, libraries and other institutions, helping newspapers earn money from their historical content.
The National Digital Newspaper Program, a joint program of the National Endowment for the Humanities and the Library of Congress, is creating a digital archive of historically significant newspapers in the United States from 1836 to 1922. It will be on the Internet; material published before 1923 is no longer protected by copyright.
Newspapers that are participating in the Google program say it is attractive.
Pierre Little, publisher of The Quebec Chronicle-Telegraph, which has been published since 1764 and calls itself "North America's Oldest Newspaper," said many readers visit the newspaper's Web site to look for obituaries and conduct research on their ancestors.
Tim Rozgonyi, research editor at The St. Petersburg Times in Florida, said that years ago it had looked at digitizing its archives.
"It appeared to be exceedingly costly," he said. "We wouldn't be talking about digitization if Google had not entered this arena."
The newspaper might be able to generate additional revenue from the digital archives by producing historical booklets or commemorative front pages. But he said that increasing sales was not the primary objective of the digitization program.
"Getting the digitized content available is a wonderful thing for people of this area," he said. "They'll be able to go to our site or Google's and tap into 100 years of history."
Originally published by The New York Times Media Group.
(c) 2008 International Herald Tribune. Provided by ProQuest LLC. All rights Reserved.