[Koha-bugs] [Bug 2629] Diacritics not being ignored when searching
bugzilla-daemon at bugs.koha-community.org
bugzilla-daemon at bugs.koha-community.org
Thu Oct 4 20:38:19 CEST 2012
http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=2629
wajasu <matted-34813 at mypacks.net> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |matted-34813 at mypacks.net
--- Comment #41 from wajasu <matted-34813 at mypacks.net> ---
As ICU is being persued, I am only noting an alternate possible idea in case
that route does not come about.
We might be able to write a perl script that uses Unicde::UCD or such routines
to scan the DB searchable fields, and convert them to NFD form to detect
diacritics(decomposed form) or another alogrithm, and compile a table
accordingly. Then we could generate the /etc/zebradb/etc/word-phrase-utf.chr
map. It may suffice as a stop gap measure, that once run in a large library,
it might cover 99.9% of the cases, catering to the context of each library. So
instead of a patch per character, the site would periodically run the script to
handle such cases.
But ICU is where we are persuing for now.
--
You are receiving this mail because:
You are watching all bug changes.
More information about the Koha-bugs
mailing list