[Koha-bugs] [Bug 26472] Elasticsearch - ES - Authority record results not ordered correctly due to punctuation marks

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Aug 4 00:03:17 CEST 2023


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=26472

--- Comment #33 from Victor Grousset/tuxayo <victor at tuxayo.net> ---
(In reply to Nick Clemens from comment #18)
> Forgive the silly example, but "alternate: shifted" ignores punctuation and
> whitespace.
> 
> Do we want white space considered? i.e. is it correct for 'Santabna' to sort
> before 'Santa clarita'?

Thanks for catching this! :o

---

(In reply to Heather from comment #32)
> I don't think so.  In
> https://babel.hathitrust.org/cgi/pt?id=mdp.39015022080140&view=1up&seq=63 
> Rule 12 states, "Words connected by a hyphen [...]

So if I understand correctly. The current proposal should sort " " (space)
after "-","," and other symbols?

How much of a problem is it? The 1st problematic cases on this ticket caused
the results to grouped by punctuation. Which was bad because it was randomly
mixing stuff that semantically was linked and should have been together, the
letters have the real meaning here. Not a hyphen.

comment 18 also shows stuff mixed that splat the group of "santa " that should
have stayed all together. 

So about sorting " " after "-" and "," => does it lead to examples of stuff
that is in the wrong place as bad as the two above cases? Or is the rule more
about having a standardized unambiguous order for the sake of consistency and
predictability even if there was nothing semantically messed up without that
rule.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list