[Koha-bugs] [Bug 35710] New: Indexing problem with "&" symbol in Zebra. Some biblio-records with "&" symbol are not searching
bugzilla-daemon at bugs.koha-community.org
bugzilla-daemon at bugs.koha-community.org
Fri Jan 5 15:26:07 CET 2024
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35710
Bug ID: 35710
Summary: Indexing problem with "&" symbol in Zebra. Some
biblio-records with "&" symbol are not searching
Change sponsored?: ---
Product: Koha
Version: 21.05
Hardware: All
OS: Linux
Status: NEW
Severity: normal
Priority: P5 - low
Component: Searching - Zebra
Assignee: koha-bugs at lists.koha-community.org
Reporter: yuriy.kotsyuk at oa.edu.ua
I find a problem in my academy koha instance. My librarians can't find some
biblio-records. I check database and figured out that records looks good. So, I
decided to rebuild zebra index.
During rebuilding where a lot of errors. Simple analyses show, that problem was
with "&" symbol. Example below:
...
Wide character in warn at /usr/share/koha/lib/C4/Biblio.pm line 1199.
problem with :423324 : :42: parser error : xmlParseEntityRef: no name
<subfield code="b">Norton & Company</subfield>
^
<?xml version="1.0" encoding="UTF-8"?>
<record
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.loc.gov/MARC21/slim
http://www.loc.gov/standards/marcxml/schema/MARC21slim.xsd"
xmlns="http://www.loc.gov/MARC21/slim">
<leader>00516nam a22002177a 4500</leader>
<controlfield tag="003">UA-OsUOA</controlfield>
<controlfield tag="001"> 133930 </controlfield>
<controlfield tag="007">ta</controlfield>
<controlfield tag="008">20181112b un ||||| |||| 00| 0 ukr d</controlfield>
<datafield tag="090" ind1=" " ind2=" ">
<subfield code="a">51</subfield>
</datafield>
<datafield tag="942" ind1=" " ind2=" ">
<subfield code="c">BK</subfield>
</datafield>
<datafield tag="040" ind1=" " ind2=" ">
<subfield code="a">UA-OsUOA</subfield>
<subfield code="b">eng</subfield>
<subfield code="c">UA-OsUOA</subfield>
<subfield code="d">UA-OsUOA</subfield>
</datafield>
<datafield tag="245" ind1=" " ind2=" ">
<subfield code="a">The Principles of Mathematics</subfield>
<subfield code="c">B. Russell</subfield>
</datafield>
<datafield tag="300" ind1=" " ind2=" ">
<subfield code="a">535 p. с.</subfield>
</datafield>
<datafield tag="955" ind1=" " ind2=" ">
<subfield code="a">2</subfield>
</datafield>
<datafield tag="041" ind1=" " ind2=" ">
<subfield code="a">eng</subfield>
</datafield>
<datafield tag="080" ind1=" " ind2=" ">
<subfield code="a">51</subfield>
</datafield>
<datafield tag="260" ind1=" " ind2=" ">
<subfield code="a">New York</subfield>
<subfield code="b">Norton & Company</subfield>
<subfield code="c">1951</subfield>
</datafield>
<datafield tag="852" ind1=" " ind2=" ">
<subfield code="a"></subfield>
</datafield>
<datafield tag="650" ind1=" " ind2="7">
<subfield code="a">51 Математика</subfield>
<subfield code="2">UDC</subfield>
</datafield>
<datafield tag="999" ind1=" " ind2=" ">
<subfield code="c">423324</subfield>
<subfield code="d">423324</subfield>
</datafield>
<datafield tag="901" ind1=" " ind2=" ">
<subfield code="c">2</subfield>
<subfield code="d">Юрій Анатолійович Коцюк</subfield>
</datafield>
</record>
error retrieving biblio 423324 at
/usr/share/koha/bin/migration_tools/rebuild_zebra.pl line 686.
...
One more example:
...
Wide character in warn at /usr/share/koha/lib/C4/Biblio.pm line 1199.
problem with :423001 : :43: parser error : EntityRef: expecting ';'
<subfield
code="u">http://zt.knteu.kiev.ua/index.php?option=com_content&view
^
:43: parser error : EntityRef: expecting ';'
ld
code="u">http://zt.knteu.kiev.ua/index.php?option=com_content&view=article&id
^
:43: parser error : EntityRef: expecting ';'
http://zt.knteu.kiev.ua/index.php?option=com_content&view=article&id=2349&Itemid
^
:43: parser error : EntityRef: expecting ';'
.knteu.kiev.ua/index.php?option=com_content&view=article&id=2349&Itemid=721&lang
^
...
It's looks like script try use ampersand sequence...
--
You are receiving this mail because:
You are watching all bug changes.
You are the assignee for the bug.
More information about the Koha-bugs
mailing list