[Koha-bugs] [Bug 35659] OAI Harvester

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Feb 28 08:34:07 CET 2024


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35659

Andreas Roussos <a.roussos at dataly.gr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |a.roussos at dataly.gr

--- Comment #31 from Andreas Roussos <a.roussos at dataly.gr> ---
Just wanted to point out that I tried to harvest bibliographic
records from the first repository suggested in the Test plan
(http://staroai.theses.fr/OAIHandler, oai_dc data format) in a
MARC21 Koha instance (therefore using OAIDCtoMARC21XML.xsl to
transform the records prior to importing), and when I ran the
OAI harvester script in verbose mode I got many lines similar
to the following in the output:

no mapping found for [0xC9] at position 0 in Économie politique
g0=ASCII_DEFAULT g1=EXTENDED_LATIN at /usr/share/perl5/MARC/Charset.pm line
308.
no mapping found for [0xC9] at position 0 in Église et société g0=ASCII_DEFAULT
g1=EXTENDED_LATIN at /usr/share/perl5/MARC/Charset.pm line 308.

The script did actually finish and managed to add 91 records,
but I noticed that in the biblio details page in the OPAC some
(not all) words containing accented characters did not appear
correctly. I have uploaded a screenshot that illustrates this.

Upon further investigation, it looks like the script *does*
manage to populate database fields such as biblio.title and
biblio.abstract with the correctly accented characters, but
some MARC fields (such as 245 - Title, and 6xx - Subjects)
in the MARCXML representation in biblio_metadata.metadata
still contain weird characters wherever accents exist.

The fix is to apply Franck Theeten's solution from Bug 16488
(https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=16488#c24)
and modify the value of the MARC leader's 10th character
to 'a' in the XSLT. Then, the accented characters get imported
properly and the records appear correctly in the OPAC.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list