[Koha-bugs] [Bug 26472] Elasticsearch - ES - Authority record results not ordered correctly due to punctuation marks

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Jun 24 19:21:23 CEST 2022


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=26472

Esther Melander <estherm at sodaspringsid.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |estherm at sodaspringsid.com

--- Comment #1 from Esther Melander <estherm at sodaspringsid.com> ---
We are using Elastic Search and have noticed a similar problem for authorities
with trailing periods. The trailing periods of an authority should be ignored
in the sort, but are instead showing at the end of the result list. This
problem appears in the advanced editor with the authority look-up
(ctrl-shift-L) search. If you were to search for "Cooking." with the search
parameter "contains" the result is not returned at the top of the list as
expected, but rather towards the bottom. Further complicating, if you use the
search parameter "starts with" or "exact" no results are returned. Do the same
search without the trailing period "Cooking" and you will get the expected
results.

Also, Library of Congress is moving toward minimally punctuated authorities in
which there is no closing punctuation on some authorities. As a result there
are now existing authorities following different punctuation rules. Regardless
of the source, trailing periods are throwing off the sort and I also suspect
auto linking.

Here are some additional papers on punctuation in authorities.
https://www.oclc.org/bibformats/en/6xx.html shows examples of the 600 subject
tags without punctuation at the end.

These are links to a power point and paper on minimally punctuated records put
out by Library of Congress.
https://www.loc.gov/aba/pcc/sct/documents/GuidelinesMinimallyPunctuatedMARC-SCT-2020-01.pptx
https://www.loc.gov/aba/pcc/documents/PCC-Guidelines-Minimally-Punctuated-MARC-Data.docx

In any event, Elastic Search appears to need some refinement in how punctuation
is handled to bring it into compliance with current practice.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.


More information about the Koha-bugs mailing list