[Koha-devel] Stemming and zebra
David Cook
dcook at prosentient.com.au
Wed Aug 27 01:29:53 CEST 2014
Hi Francois
Writing from a tablet so I will be brief.
If you’re not using queryparser, there is a Perl module that does the stemming I believe. Lingua::Snowball or something like that. It would be easy to add to the queryparser too but haven’t gotten to it yet.
Oh… now that I think about it… I think we mangle keywords before stemming them. I can’t remember why but there was some reason for it. Sorry this message is so incoherent. Will investigate and type a better response when I have a proper keyboard and not just thumbs on a small screen.
David
From: koha-devel-bounces at lists.koha-community.org [mailto:koha-devel-bounces at lists.koha-community.org] On Behalf Of Francois Charbonnier
Sent: Wednesday, 27 August 2014 2:09 AM
To: koha-devel at lists.koha-community.org
Subject: [Koha-devel] Stemming and zebra
Hello,
I have tested the QueryStemming system preference on Koha 3.14 (my local installation) and I'm wondering, does zebra just right truncate the words or is there an algorithm to find the stems?
I use ICU and I have enabled "QueryWeightFields". I don't have automatic truncation or fuzzy search on. I use these words for my tests:
* ski, skiing, skills
* fish, fished, fishing, fisher, fishxsdfe
Each time, with QueryStemming on, skills and fishxsdfe come out in the search results. Is it what I should expect? "Skills", maybe but "fishxsdfe"?
Do you know how it works? or have a good example that would help me to understand?
Thanks!
--
François Charbonnier,
Bibl. prof. / Chef de produits
Tél. : (888) 604-2627
<mailto:francois.charbonnier at inLibro.com> francois.charbonnier at inLibro.com
inLibro | pour esprit libre | <http://www.inLibro.com> www.inLibro.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.koha-community.org/pipermail/koha-devel/attachments/20140827/8dcb063d/attachment-0001.html>
More information about the Koha-devel
mailing list