[Koha-bugs] [Bug 2629] Diacritics not being ignored when searching

Thu Oct 4 20:38:19 CEST 2012

http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=2629

wajasu <matted-34813 at mypacks.net> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |matted-34813 at mypacks.net

--- Comment #41 from wajasu <matted-34813 at mypacks.net> ---
As ICU is being persued, I am only noting an alternate possible idea in case
that route does not come about.

We might be able to write a perl script that uses Unicde::UCD or such routines
to scan the DB searchable fields, and convert them to NFD form to detect
diacritics(decomposed form) or another alogrithm, and compile a table
accordingly.  Then we could generate the /etc/zebradb/etc/word-phrase-utf.chr 
map.   It may suffice as a stop gap measure, that once run in a large library,
it might cover 99.9% of the cases, catering to the context of each library.  So
instead of a patch per character, the site would periodically run the script to
handle such cases.

But ICU is where we are persuing for now.

-- 
You are receiving this mail because:
You are watching all bug changes.