[Koha-bugs] [Bug 10662] Build OAI-PMH Harvesting Client

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Mon Sep 2 09:40:12 CEST 2013


http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=10662

--- Comment #5 from David Cook <dcook at prosentient.com.au> ---
Thoughts on things to include:

1) Add a preference/config for using identifier + datestamp OR identifier +
datestamp + metadataPrefix as the indicator of the highest order of uniqueness.

2) Add an email feature that tells library staff to check a report which
enumerates the status of records imported via OAI-PMH. These can be create (new
records), replace (updated records), deleted (for incoming records with a
status of deleted), or ambiguous (essentially a new record but linked to
multiple existing bib records. These are almost certainly duplicates but
require manual merging since it's tough to know which is the real authoritative
record).

I was thinking perhaps of sending an email containing a link to a Template
Toolkit page (so that translation would be possible), which would contain the
import/history log.

3) Improving error handling

4) Make the import options more configurable? Although I think the hardcoded
options for always replacing a bib match, adding for no match, and ignoring
items are probably pretty good. There might be other use cases where people
want something different though, so configuration might be a good idea
(although ignoring items is fairly essential, as you could duplicate items if
you're importing updated records with items). Perhaps the MARC21 XSLT should
also strip 952 fields.

5) When using the cronjob, if the "from" date for a repository is "null", check
for existing records in Koha, and use the latest "datestamp"? This way we're
able to do selective harvesting automatically without having to update our
configuration. (I'll probably add this one soon.)

6) Matching rules: a) Check what MARC field that system uses for its
biblionumber. Check if there is a matching rule for that field in Koha. If not,
create one? Having this matching rule is essential for matching updated
records.

7) Improving the DC => MARC conversion (might look at this soon too...it will
always be a "best guess" but it has room for a lot of easy improvement)

8) Make an OAI-PMH harvesting web UI. This would allow people to plug in the
baseURL for a remote OAI-PMH repository and use the 6 verbs on it.

I imagine it being a good way of people getting used to what a OAI-PMH repo
has, so that they can set up the automatic cronjob configuration. It could also
be a good idea to allow the "ImportRecordsIntoKoha" method for selective
harvesting.

That said, if it's too easy to use, it might also be abused by someone who
doesn't know what they're doing. If there were a GUI, it would need a
permission and/or system preference most likely.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list