[Koha-bugs] [Bug 9496] New: rebuild_zebra.pl should handle invalid XML records better
bugzilla-daemon at bugs.koha-community.org
bugzilla-daemon at bugs.koha-community.org
Sat Jan 26 01:43:46 CET 2013
http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9496
Bug ID: 9496
Summary: rebuild_zebra.pl should handle invalid XML records
better
Classification: Unclassified
Change sponsored?: ---
Product: Koha
Version: master
Hardware: All
OS: All
Status: NEW
Severity: normal
Priority: P5 - low
Component: Command-line Utilities
Assignee: gmcharlt at gmail.com
Reporter: gmcharlt at gmail.com
If a bib or authority MARCXML record is not well-formed, rebuild_zebra.pl
should catch it so that invalid XML is not included in the file it passes to
zebraidx. This is particularly important when DOM mode indexing is used. This
is because in DOM mode zebraidx is processing a single XML document (namely,
the MARCXML import), and if it reaches a record that causes the XML file to be
invalid, it will not handle any records past that point. Worse, it doesn't
complain quite loudly enough when that happens, so a user could find their Koha
catalog to be partially indexed for no apparent reason.
In addition, when rebuild_zebra is run using the -nosanitize option, zebraidx
in DOM mode will refuse to process the output because there are extraneous XML
declarations.
--
You are receiving this mail because:
You are watching all bug changes.
More information about the Koha-bugs
mailing list