CiteSeer-API: towards seamless resource location and interlinking for digital libraries

We introduce CiteSeer-API, a public API to CiteSeer-like services. CiteSeer-API is SOAP/WSDL based and allows for easy programmatical access to all the specific functionalities offered by CiteSeer services, including full text search of documents and citations and citation-based document discovery. In order to enable operability and interlinking with arbitrary software agents and digital library systems, CiteSeer-API uses digital content signatures to create system-independent handles for the Document, Citation and Group resources of CiteSeer servers. We discuss specific functionalities of CiteSeer-API that take advantage of these handlers in order to enable seamless location of CiteSeer resources. Finally we argue that the digital signature scheme used by CiteSeer-API is well suited for the creation of machine-usable semantic descriptions of digital library services which is the key toward seamless discovery and integration of services such as CiteSeer-API. CiteSeer-API is currently showcased on CiteSeer.IST, the CiteSeer server of the School of Information Science and Technology at the Pennsylvania State University.

[1]  Gurmeet Singh Manku,et al.  SETS: search enhanced by topic segmentation , 2003, SIGIR.

[2]  C.L. Giles,et al.  Enabling interoperability for autonomous digital libraries: an API to CiteSeer services , 2004, Proceedings of the 2004 Joint ACM/IEEE Conference on Digital Libraries, 2004..

[3]  Herbert Van de Sompel,et al.  Access Interfaces for Open Archival Information Systems based on the OAI-PMH and the OpenURL Framework for Context-Sensitive Services , 2005, ArXiv.

[4]  Jeff Heflin,et al.  Searching the Web with SHOE , 2000 .

[5]  Robert B. Ross,et al.  Service Description Language , 2000 .

[6]  David M. Pennock,et al.  Statistical relational learning for document mining , 2003, Third IEEE International Conference on Data Mining.

[7]  James H. Burrows,et al.  Secure Hash Standard , 1995 .

[8]  Hector Garcia-Molina,et al.  Archival storage for digital libraries , 1998, DL '98.

[9]  Alf-Christian Ortyl Paul Achilles,et al.  The Collection of Computer Science Bibliographies , 1995 .

[10]  C. Lee Giles,et al.  Indexing and retrieval of scientific literature , 1999, CIKM '99.

[11]  C. Lee Giles,et al.  CiteSeer: an automatic citation indexing system , 1998, DL '98.

[12]  C. Lee Giles,et al.  Distributed error correction , 1999, DL '99.

[13]  Vijay V. Raghavan,et al.  Enhancing Internet Search Engines to Achieve Concept-based Retrieval , 1999 .

[14]  James A. Hendler,et al.  Searching the Web with SHOE In Artificial Intelligence for Web Search , 2000 .

[15]  Hui Han,et al.  eBizSearch: an OAI-compliant digital library for ebusiness , 2003, 2003 Joint Conference on Digital Libraries, 2003. Proceedings..

[16]  Wang Jun Open Archives Initiative Protocol for Metadata Harvesting , 2005 .

[17]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .