[Koha-bugs] [Bug 35104] We should warn when attempting to save badly encoded MARC records

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Mon Oct 23 02:10:07 CEST 2023


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35104

--- Comment #2 from David Cook <dcook at prosentient.com.au> ---
But first a test...

I change the Library of Congress Z39.50 from utf8 to "ISO_8859-1" (note
changing to MARC-8 didn't seem to make a difference), and then I search
"bibliothecaire" for the Title.

I quickly notice obvious encoding problems like "Fonction peÌdagogique du
documentaliste-bibliotheÌcaire"

So I go to import that one... and actually I don't have any problems. I get a
record with the following title: "Fonction peÌdagogique du
documentaliste-bibliotheÌcaire : journeÌe acadeÌmique des
documentalistes-bibliotheÌcaires, le 7 feÌvrier 1977"

I suppose it's because Ì and  are valid UTF-8 characters. 

If I change the Z39.50 back to "utf8" then I get: "Fonction pédagogique du
documentaliste-bibliothécaire :"

I'm not sure why é would be interpreted as Ì though. If I download as MARCXML,
I see the title is "Fonction pe&#xCC;&#x81;dagogique du
documentaliste-bibliothe&#xCC;&#x81;caire :"

I guess I'm not confident how many conversions are happening and at which
points but this still looks odd...

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.


More information about the Koha-bugs mailing list