[Koha-bugs] [Bug 26472] Elasticsearch - ES - Authority record results not ordered correctly due to punctuation marks

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Mon Sep 11 13:44:56 CEST 2023


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=26472

--- Comment #41 from Nick Clemens <nick at bywatersolutions.com> ---
(In reply to Victor Grousset/tuxayo from comment #37)
> > 1 - Create authorities with main headings like below and confirm they sort in the order shown
> > Hand blows
> > Hand book for Prospect Park
> > Hand (Fictitious character)
> > Hand in glove
> > Hand-ball
> > Handbok for sangere
> > Handbook for adventure
> > Hande im Pflug
> > Hands in the past
> > Handu
> 
> nope T_T
> 
> i get this:
>  Hand (Fictitious character) 
>  Hand blows 
>  Hand book for Prospect Park 
>  Hand in glove 
>  Hand-ball 
>  Handbok for sangere 
>  Handbook for adventure 
>  Hande im Pflug 
>  Hands in the past 
>  Handu 
> 
> With ES 7 and opensearch 1.x
> And I double checked not using zebra instead of ES.
> My koha (ktd) was started with the patch already applied.
> 
> Any idea of what could go wrong? Any implicit step before the 1st in the
> test plan ?

Restesting, this is what I get, and rereading, this is what is expected with
quaternary - it considers differences at the 4th level, meaning accents and
punctuation are both considered

So "Hand (Fictitious character)" comes first because the space and then the '('
come before the letters

It makes other things sort too:
hand book for Prospect Park
Hand book for Prospect Park
Hand boôk for Prospect Park
Hand böok for Prospect Park

It seems what we want is "punctuation ignored, but whitespace not ignored" but
I don't seem to find that option:
https://unicode-org.github.io/icu/userguide/collation/architecture.html#strength-level
https://unicode-org.github.io/icu/userguide/collation/customization/ignorepunct.html

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list