[Koha-bugs] [Bug 18683] New: biblioitems.marcxml does not match MARC21slim.xsd

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri May 26 20:56:13 CEST 2017


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=18683

            Bug ID: 18683
           Summary: biblioitems.marcxml does not match MARC21slim.xsd
 Change sponsored?: ---
           Product: Koha
           Version: 17.05
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P5 - low
         Component: Database
          Assignee: gmcharlt at gmail.com
          Reporter: mail at casi.pw
        QA Contact: testopia at bugs.koha-community.org

I found the following record in a Koha installation that I manage, running on a
Debian server.

----

<record
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://www.loc.gov/MARC21/slim
http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"
    xmlns="http://www.loc.gov/MARC21/slim">

  <leader>[...]</leader>
  <datafield tag="999" ind1=" " ind2=" ">
    <subfield code="c">[...]</subfield>
    <subfield code="d">[...]</subfield>
  </datafield>
  <controlfield tag="001">[...]</controlfield>
  [...]

----

This does not comply with the MARC21slim.xsd Schema definition, which says:

<xsd:complexType name="recordType" id="record.ct">
        <xsd:sequence minOccurs="0">
                <xsd:element name="leader" type="leaderFieldType"/>
                <xsd:element name="controlfield" type="controlFieldType"
minOccurs="0" maxOccurs="unbounded"/>
                <xsd:element name="datafield" type="dataFieldType"
minOccurs="0" maxOccurs="unbounded"/>
        </xsd:sequence>
        <xsd:attribute name="type" type="recordTypeType" use="optional"/>
        <xsd:attribute name="id" type="idDataType" use="optional"/>
</xsd:complexType>

These malformed xml strings were created by Koha over the time. To localize the
errors, I ran a SQL query on the database.

> select timestamp from biblioitems where marcxml like "%<datafield tag=\"999\"%<controlfield%" order by timestamp

gives 962 rows, since 24th of October 2016

Some of the marcxml fields even contain the field 999 a second time, at the
end.

> select timestamp from biblioitems where marcxml like "%<datafield tag=\"999\"%<controlfield%<datafield tag=\"999\"%" order by timestamp

gives 509 rows, since 4th of January 2017

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list