[Koha-bugs] [Bug 35710] New: Indexing problem with "&" symbol in Zebra. Some biblio-records with "&" symbol are not searching

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Jan 5 15:26:07 CET 2024


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35710

            Bug ID: 35710
           Summary: Indexing problem with "&" symbol in Zebra. Some
                    biblio-records with "&" symbol are not searching
 Change sponsored?: ---
           Product: Koha
           Version: 21.05
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P5 - low
         Component: Searching - Zebra
          Assignee: koha-bugs at lists.koha-community.org
          Reporter: yuriy.kotsyuk at oa.edu.ua

I find a problem in my academy koha instance. My librarians can't find some
biblio-records. I check database and figured out that records looks good. So, I
decided to rebuild zebra index.
During rebuilding where a lot of errors. Simple analyses show, that problem was
with "&" symbol. Example below:
...
Wide character in warn at /usr/share/koha/lib/C4/Biblio.pm line 1199.
 problem with :423324 : :42: parser error : xmlParseEntityRef: no name
    <subfield code="b">Norton & Company</subfield>
                               ^

<?xml version="1.0" encoding="UTF-8"?>
<record
    xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
    xsi:schemaLocation="http://www.loc.gov/MARC21/slim
http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"
    xmlns="http://www.loc.gov/MARC21/slim">

  <leader>00516nam a22002177a 4500</leader>
  <controlfield tag="003">UA-OsUOA</controlfield>
  <controlfield tag="001"> 133930 </controlfield>
  <controlfield tag="007">ta</controlfield>
  <controlfield tag="008">20181112b un ||||| |||| 00| 0 ukr d</controlfield>
  <datafield tag="090" ind1=" " ind2=" ">
    <subfield code="a">51</subfield>
  </datafield>
  <datafield tag="942" ind1=" " ind2=" ">
    <subfield code="c">BK</subfield>
  </datafield>
  <datafield tag="040" ind1=" " ind2=" ">
    <subfield code="a">UA-OsUOA</subfield>
    <subfield code="b">eng</subfield>
    <subfield code="c">UA-OsUOA</subfield>
    <subfield code="d">UA-OsUOA</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">The Principles of Mathematics</subfield>
    <subfield code="c">B. Russell</subfield>
  </datafield>
  <datafield tag="300" ind1=" " ind2=" ">
    <subfield code="a">535 p. с.</subfield>
  </datafield>
  <datafield tag="955" ind1=" " ind2=" ">
    <subfield code="a">2</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="080" ind1=" " ind2=" ">
    <subfield code="a">51</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="a">New York</subfield>
    <subfield code="b">Norton & Company</subfield>
    <subfield code="c">1951</subfield>
  </datafield>
  <datafield tag="852" ind1=" " ind2=" ">
    <subfield code="a"></subfield>
  </datafield>
  <datafield tag="650" ind1=" " ind2="7">
    <subfield code="a">51 Математика</subfield>
    <subfield code="2">UDC</subfield>
  </datafield>
  <datafield tag="999" ind1=" " ind2=" ">
    <subfield code="c">423324</subfield>
    <subfield code="d">423324</subfield>
  </datafield>
  <datafield tag="901" ind1=" " ind2=" ">
    <subfield code="c">2</subfield>
    <subfield code="d">Юрій Анатолійович Коцюк</subfield>
  </datafield>
</record>
error retrieving biblio 423324 at
/usr/share/koha/bin/migration_tools/rebuild_zebra.pl line 686.
... 


One more example:
...
Wide character in warn at /usr/share/koha/lib/C4/Biblio.pm line 1199.
 problem with :423001 : :43: parser error : EntityRef: expecting ';'
    <subfield
code="u">http://zt.knteu.kiev.ua/index.php?option=com_content&view
                                                                               
^
:43: parser error : EntityRef: expecting ';'
ld
code="u">http://zt.knteu.kiev.ua/index.php?option=com_content&view=article&id
                                                                               
^
:43: parser error : EntityRef: expecting ';'
http://zt.knteu.kiev.ua/index.php?option=com_content&view=article&id=2349&Itemid
                                                                               
^
:43: parser error : EntityRef: expecting ';'
.knteu.kiev.ua/index.php?option=com_content&view=article&id=2349&Itemid=721&lang
                                                                               
^
...

It's looks like script try use ampersand sequence...

-- 
You are receiving this mail because:
You are watching all bug changes.
You are the assignee for the bug.


More information about the Koha-bugs mailing list