Open Access News

News from the open access movement


Friday, October 06, 2006

Python code for harvesting OAI-compliant repositories

Rufus Pollock, Accessing open access repositories using the python oaipmh package, miscellaneous factZ, October 6, 2006. Excerpt:

The Open Access Initiative Protocol for Metadata Harvesting (OAIPMH) is growing rapidly as the standard web protocol for making metadata, primarily bibliographic information, available online for programmatic access and I’ve long meant to write something that would allow be to pull information down from remote repositories into my local bibliographic database automatically (it would save an awful lot of typing).

I’ve mentioned the oaipmh package provided by infrae.com before however the documentation they provide has got rather out of date and though I’ve made a few attempts I’ve never quite been able to get it to work. However after a bit more effort recently with the newer v2.0+ of the package I’ve managed to get something basic working which you can find [here]....