[Koha-bugs] [Bug 2629] Diacritics not being ignored when searching

bugzilla-daemon at kohaorg.ec2.liblime.com bugzilla-daemon at kohaorg.ec2.liblime.com
Thu Sep 24 17:14:16 CEST 2009


http://bugs.koha.org/cgi-bin/bugzilla3/show_bug.cgi?id=2629


Brian Harrington <brian at jhu.edu> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |brian at jhu.edu




--- Comment #3 from Brian Harrington <brian at jhu.edu>  2009-09-24 15:14:16 ---
Should we switch default.idx over to icuchain indexing to take advantage of
ICU's Unicode normaliztion?  I think that the yaz/zebra issues with it have
been fixed.  The following word.xml is working for me, but could use more
non-diacritic special characters.

<icu_chain locale="en">
        <transform rule="[:Control:] Any-Remove" />
        <casemap rule="l" />
        <tokenize rule="w" />
        <transliterate rule="\- > ' '" />
        <transform rule="NFKD; [[:WhiteSpace:][:Punctuation:][:Diacritic:]]
Remove" />
        <transliterate rule="ø > o"/>
        <transliterate rule="ð > th"/>
        <transliterate rule="ʒ > z"/>
        <transliterate rule="ȝ > g"/>
        <transliterate rule="æ > ae"/>
        <transliterate rule="œ > oe"/>
        <transliterate rule="þ > th"/>
        <display />
</icu_chain>


-- 
Configure bugmail: http://bugs.koha.org/cgi-bin/bugzilla3/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching all bug changes.


More information about the Koha-bugs mailing list