[Koha-bugs] [Bug 24123] New: bulkmarcimport.pl doesn't support UTF-8 encoded MARCXML records

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Tue Nov 26 15:08:58 CET 2019


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=24123

            Bug ID: 24123
           Summary: bulkmarcimport.pl doesn't support UTF-8 encoded
                    MARCXML records
 Change sponsored?: ---
           Product: Koha
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: P5 - low
         Component: Command-line Utilities
          Assignee: koha-bugs at lists.koha-community.org
          Reporter: joonas.kylmala at helsinki.fi
        QA Contact: testopia at bugs.koha-community.org
                CC: jonathan.druart at bugs.koha-community.org,
                    robin at catalyst.net.nz

Importing an UTF-8 encoded MARC21 MARCXML record that has leader value in
position 09 equal 'a' (unicode) makes non-ascii characters like 'ä' to be
replaced with character '�'.

The command being run is

> misc/migration_tools/bulkmarcimport.pl -biblios -file record.marcxml -m=MARCXML -v -v


I tracked this problem down to the line

> $MARC::File::XML::_load_args{BinaryEncoding} = 'utf-8';

If you add that line again as first thing in the RECORD label block further
down in the bulkmarcimport.pl the problem is fixed.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.


More information about the Koha-bugs mailing list