[Koha-bugs] [Bug 19506] New: Staged Marc Import should warn if the selected encoding does not match MARC 21 leader

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Sat Oct 21 21:08:30 CEST 2017


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=19506

            Bug ID: 19506
           Summary: Staged Marc Import should warn if the selected
                    encoding does not match MARC 21 leader
 Change sponsored?: ---
           Product: Koha
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: enhancement
          Priority: P5 - low
         Component: MARC Bibliographic record staging/import
          Assignee: koha-bugs at lists.koha-community.org
          Reporter: barton at bywatersolutions.com
        QA Contact: testopia at bugs.koha-community.org

Mis-matches between character encodings will cause diacritics be incorrectly
imported into Koha. If the imported marc record is MARC8 encoded and imported
as UTF-8, diacritics will be converted to the REPLACEMENT CHARACTER '�'. A
UTF-8 encoded file imported as MARC8 will try to render the multi-byte UTF-8
characters as MARC8, resulting in garbage text.

MARC21 stores its encoding in leader position 09 - Character coding scheme
(http://www.loc.gov/marc/bibliographic/bdleader.html). The accepted encodings
are '#' for MARC8 and 'a' for UTF-8 (although Koha exports ' ' for MARC8).

Koha should check the leader for MARC21 records and warn if the selected
encoding does not match the encoding of the marc record, giving the user the
opportunity to change the encoding or download fresh records with the correct
encoding.

[I couldn't tell if this was an issue for UNIMARC]

-- 
You are receiving this mail because:
You are watching all bug changes.
You are the assignee for the bug.


More information about the Koha-bugs mailing list