June 2, 2009

Digital library/search engine created

U.S. researchers have created a digital library and search engine that now holds more than 1 million journal articles and can be easily accessed by anyone.

Pennsylvania State University scientists said the library, called CiteSeerX, is based in the university's College of Information Sciences and Technology. It is designed to enhance the dissemination of scientific literature by making papers and other documents easier to locate online, the school said.

The library provides resources such as algorithms, data, metadata, services, techniques and software that are transferable to other digital libraries, officials said. It also has the capability to search tables and is based on open-source software, meaning it can be adapted as needed to fit a user's requirements.

We won't keep it to ourselves, said Professor C. Lee Giles, who developed the technology. We'll give it to other people and they can build similar systems. Because it's modular, it can be changed to meet their needs.

Another feature, called MyCiteSeerX, is a customizable personal space where individual users can do tagging, make corrections, create a collection and monitor paper updates.

CiteSeerX was funded by the National Science Foundation, the Microsoft Corp., the National Aeronautics and Space Administration and the university.

The library is available at http://citeseerx.ist.psu.edu.