[Koha-bugs] [Bug 1567] New: MARC record parsing from and to XML
bugzilla-daemon at pippin.metavore.com
bugzilla-daemon at pippin.metavore.com
Tue Nov 6 22:07:56 CET 2007
http://bugs.koha.org/cgi-bin/bugzilla/show_bug.cgi?id=1567
Summary: MARC record parsing from and to XML
Product: Koha
Version: rel_3_0
Platform: PC
URL: http://support.tamil.fr/tako_test.pl.tgz
OS/Version: Linux - Debian
Status: NEW
Severity: blocker
Priority: P3
Component: MARC Import
AssignedTo: galen.charlton at liblime.com
ReportedBy: frederic at tamil.fr
QAContact: koha-bugs at nongnu.org
On a Debian/Lenny, parsing of XML marc records encoded in UTF-8 doesn't work
properly depending of SAX parser used : PurePerl, Expat or LibXML.
(1) Records are converted into XML string before being written in
bibliotems.marcxml. A function of MARC::File::XML us used in Biblio.pm:
$record->as_xml_record($encoding). With Expat or LibXML SAX Parser, encoding is
suppressed:
<?xml version="1.0" encoding=""?>
Consequently, records can't be read back from this biblioitems.marcxml. This is
a blocker bug.
(2) Records are created from XML string (biblioitems.marcxml). In Biblio.pm:
$record = eval {MARC::Record::new_from_xml( $marcxml, "utf8",
C4::Context->preference('marcflavour'))};
This function doesn't work properly with Expat or LibXML. UTF-8 extended
characters (é for example) are converted in their ISO-8859-1 counterpart.
Perl script attached allow to test those bugs.
------- You are receiving this mail because: -------
You are the QA contact for the bug, or are watching the QA contact.
More information about the Koha-bugs
mailing list