[Koha-bugs] [Bug 10662] Build OAI-PMH Harvesting Client

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Nov 30 00:52:30 CET 2016


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=10662

--- Comment #109 from David Cook <dcook at prosentient.com.au> ---
I think part of the problem is that I don't want to use Koha's batch import
system. I want to handle importing individual records belonging to arbitrary
metadata schemas myself. At the moment, I'm abusing the batch system to do what
I want.

I want to filter the data, I want to check database-level OAI-PMH data, then I
want to check Zebra-level MARCXML data, and the batch import isn't really set
up for that. And I think it might be prohibitively difficult to update it to
handle that.

I could decompose C4::ImportBatch::BatchCommitRecords into individual
add/update/delete functionality. That would mostly work, although then I would
lose the history that the batches afford you. But then there's bug 14367 which
would make up for that in theory.

I suppose this might all just be idealistic thinking on my part, and perhaps
outside the scope of this enhancement.

I already have code that mostly works using just the OAI-PMH identifier. It's a
flawed concept, but it's the same one used by VuFind and DSpace, and they
haven't fallen over yet. 

I have a few fixes I need to make to C4::Matcher and C4::Search for the
matching to work as expected though. 

I think perhaps what I have so far will just have to be good enough for now.
It's far from ideal, but it's functional. 

I'm tempted to add an "originDescription" as per
https://www.openarchives.org/OAI/2.0/guidelines-provenance.htm in each
harvested record so that we preserve as much metadata as possible during the
OAI-PMH harvest, although I think that's an inappropriate use of that element,
as that's supposed to be created at dissemination time rather than harvest
time.

Perhaps I'll just capture all relevant data and store it in the RDBMS, and we
can use it at a later time if necessary.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list