[Koha-bugs] [Bug 9496] New: rebuild_zebra.pl should handle invalid XML records better

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Sat Jan 26 01:43:46 CET 2013


http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9496

            Bug ID: 9496
           Summary: rebuild_zebra.pl should handle invalid XML records
                    better
    Classification: Unclassified
 Change sponsored?: ---
           Product: Koha
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: P5 - low
         Component: Command-line Utilities
          Assignee: gmcharlt at gmail.com
          Reporter: gmcharlt at gmail.com

If a bib or authority MARCXML record is not well-formed, rebuild_zebra.pl
should catch it so that invalid XML is not included in the file it passes to
zebraidx.  This is particularly important when DOM mode indexing is used.  This
is because in DOM mode zebraidx is processing a single XML document (namely,
the MARCXML import), and if it reaches a record that causes the XML file to be
invalid, it will not handle any records past that point.  Worse, it doesn't
complain quite loudly enough when that happens, so a user could find their Koha
catalog to be partially indexed for no apparent reason.

In addition, when rebuild_zebra is run using the -nosanitize option, zebraidx
in DOM mode will refuse to process the output because there are extraneous XML
declarations.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list