[Koha-bugs] [Bug 2629] Diacritics not being ignored when searching

Thu Sep 24 17:14:16 CEST 2009

http://bugs.koha.org/cgi-bin/bugzilla3/show_bug.cgi?id=2629

Brian Harrington <brian at jhu.edu> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |brian at jhu.edu

--- Comment #3 from Brian Harrington <brian at jhu.edu>  2009-09-24 15:14:16 ---
Should we switch default.idx over to icuchain indexing to take advantage of
ICU's Unicode normaliztion?  I think that the yaz/zebra issues with it have
been fixed.  The following word.xml is working for me, but could use more
non-diacritic special characters.

<icu_chain locale="en">
        <transform rule="[:Control:] Any-Remove" />
        <casemap rule="l" />
        <tokenize rule="w" />
        <transliterate rule="\- > ' '" />
        <transform rule="NFKD; [[:WhiteSpace:][:Punctuation:][:Diacritic:]]
Remove" />
        <transliterate rule="ø > o"/>
        <transliterate rule="ð > th"/>
        <transliterate rule="ʒ > z"/>
        <transliterate rule="ȝ > g"/>
        <transliterate rule="æ > ae"/>
        <transliterate rule="œ > oe"/>
        <transliterate rule="þ > th"/>
        <display />
</icu_chain>

-- 
Configure bugmail: http://bugs.koha.org/cgi-bin/bugzilla3/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching all bug changes.