Open Access News

News from the open access movement


Wednesday, July 27, 2005

Compact OAI harvester from OCLC

How short can an OAI-compliant metadata harvester be? For the apparent record-holder, see OCLC's 2PageOAI. From the site:
"Amazing! Simply, bloody amazing!!" –Art Rhyno, University of Windsor.

These Python scripts are a demonstration of how short a compliant OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) can be. The following files are available: [1] The harvester is a one-page OAI client that can download metadata from an OAI repository and put it into a file suitable for the repository. [2] The repository is a two-page OAI server that can read in an XML file, such as that written by the harvester above, and make them harvestable in turn. [3] There is a short readme file explaining how to use the harvester and respository programs. These programs have been tested with Python 2.2.2 and 2.2.3, but should work with any 2.2 or later Python. They are completely self-contained, using standard Python XML and HTTP libraries. No additional libraries beyond those included in the standard Python distribution are needed. The Web server used by the repsository is 'built-in'.