<html>
<head>
<meta content="text/html; charset=windows-1252"
http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
<div class="moz-cite-prefix">Hi,<br>
Thanks Brooke for the support. I did read the rfc by the way but
it's for queryparser which I'm not using now. I wish I'll find
time to test it!<br>
Thanks David, I haven't read the whole zebra documention (ahah,
sounds crazy) but did some researches. I had the impression zebra
didn't manage stemming but it wasn't clear. So thank you for your
answer. It will definitely help me to understand!<br>
Thanks Mathieu for the link. I'll have a look. For sure!<br>
Have a good day ! :^)<br>
François<br>
<br>
<div class="moz-signature">
<style type="text/css">
.moz-signature {
color: #FFFFFF;
}
.sig_inlibro {
padding-top : 2px;
color: #888888;
font-family : "Trebuchet MS", verdana;
font-size: 90%;
}
.sig_content {
border-top: 2px solid #DDDDDD;
border-bottom: 2px solid #BFD13D;
background-color : #F6F6F6;
padding-left:10px;
}
.sig_inlibro a:visited, .sig_inlibro a:hover, .sig_inlibro a:link {
text-decoration: none;
color: #005B85;
}
.nom {
color: #005B85;
font-weight : bold;
}
.inlibro, .in {
color: #BFD13D;
}
.libro {
color: #005B85;
}
.in, .libro {
font-size : 120%;
}
.desc {
margin-bottom: 0;
padding-bottom: 5px;
}
.small {
font-size: 80%;
}
.tagline {
color : #00BCE4;
}
.sig_footer {
padding-left : 10px;
background-color : #EEEFEA;
}
</style>
<div class="sig_inlibro">
<div class="sig_content"> <span class="nom">François
Charbonnier,</span><br>
<span class="tagline small">Bibl. prof. / Chef de produits</span><br>
<p class="desc small"> Tél. : (888) 604-2627<br>
<a href="mailto:francois.charbonnier@inLibro.com">francois.charbonnier@inLibro.com</a>
</p>
</div>
<div class="sig_footer"> <span class="in">in</span><span
class="libro">Libro</span> <span class="tagline small">|
pour esprit libre |</span> <a class="small"
href="http://www.inLibro.com">www.inLibro.com</a> </div>
</div>
</div>
Le 2014-08-27 05:48, David Cook a écrit :<br>
</div>
<blockquote
cite="mid:014c01cfc1dc$025a9350$070fb9f0$@prosentient.com.au"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=windows-1252">
<meta name="Generator" content="Microsoft Word 15 (filtered
medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:"Trebuchet MS";
panose-1:2 11 6 3 2 2 2 2 2 4;}
@font-face
{font-family:"Trebuchet MS \, sans-serif";
panose-1:0 0 0 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:black;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p
{mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:black;}
pre
{mso-style-priority:99;
mso-style-link:"HTML Preformatted Char";
margin:0cm;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Courier New";
color:black;}
p.moz-signature, li.moz-signature, div.moz-signature
{mso-style-name:moz-signature;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:white;}
p.siginlibro, li.siginlibro, div.siginlibro
{mso-style-name:sig_inlibro;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:11.0pt;
font-family:"Trebuchet MS","sans-serif";
color:#888888;}
p.sigcontent, li.sigcontent, div.sigcontent
{mso-style-name:sig_content;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
background:#F6F6F6;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:black;}
p.nom, li.nom, div.nom
{mso-style-name:nom;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:#005B85;
font-weight:bold;}
p.inlibro, li.inlibro, div.inlibro
{mso-style-name:inlibro;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:#BFD13D;}
p.in, li.in, div.in
{mso-style-name:in;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:14.5pt;
font-family:"Times New Roman","serif";
color:#BFD13D;}
p.libro, li.libro, div.libro
{mso-style-name:libro;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:14.5pt;
font-family:"Times New Roman","serif";
color:#005B85;}
p.desc, li.desc, div.desc
{mso-style-name:desc;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
margin-bottom:0cm;
margin-left:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:black;}
p.small, li.small, div.small
{mso-style-name:small;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:9.5pt;
font-family:"Times New Roman","serif";
color:black;}
p.tagline, li.tagline, div.tagline
{mso-style-name:tagline;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:#00BCE4;}
p.sigfooter, li.sigfooter, div.sigfooter
{mso-style-name:sig_footer;
mso-style-priority:99;
mso-margin-top-alt:auto;
margin-right:0cm;
mso-margin-bottom-alt:auto;
margin-left:0cm;
background:#EEEFEA;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:black;}
span.nom1
{mso-style-name:nom1;
color:#005B85;
font-weight:bold;}
span.tagline1
{mso-style-name:tagline1;
color:#00BCE4;}
span.in1
{mso-style-name:in1;
color:#BFD13D;}
span.libro1
{mso-style-name:libro1;
color:#005B85;}
span.EmailStyle33
{mso-style-type:personal;
font-family:"Calibri","sans-serif";
color:windowtext;}
span.HTMLPreformattedChar
{mso-style-name:"HTML Preformatted Char";
mso-style-priority:99;
mso-style-link:"HTML Preformatted";
font-family:Consolas;
color:black;}
span.EmailStyle36
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
/* List Definitions */
@list l0
{mso-list-id:1335691936;
mso-list-template-ids:-1684797560;}
@list l0:level1
{mso-level-number-format:bullet;
mso-level-text:"&\#61623";
mso-level-tab-stop:36.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l0:level2
{mso-level-number-format:bullet;
mso-level-text:o;
mso-level-tab-stop:72.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:"Courier New";
mso-bidi-font-family:"Times New Roman";}
@list l0:level3
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:108.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level4
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:144.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level5
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:180.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level6
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:216.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level7
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:252.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level8
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:288.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l0:level9
{mso-level-number-format:bullet;
mso-level-text:"&\#61607";
mso-level-tab-stop:324.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Wingdings;}
@list l1
{mso-list-id:2053722613;
mso-list-template-ids:-1348158560;}
@list l1:level1
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:36.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level2
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:72.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level3
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:108.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level4
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:144.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level5
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:180.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level6
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:216.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level7
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:252.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level8
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:288.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
@list l1:level9
{mso-level-number-format:bullet;
mso-level-text:\F0B7;
mso-level-tab-stop:324.0pt;
mso-level-number-position:left;
text-indent:-18.0pt;
mso-ansi-font-size:10.0pt;
font-family:Symbol;}
ol
{margin-bottom:0cm;}
ul
{margin-bottom:0cm;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
<div class="WordSection1">
<p class="MsoNormal"><a moz-do-not-send="true"
name="_MailEndCompose"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">Hi
Mathieu:<o:p></o:p></span></a></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">I
think many of us think certain things happen in Zebra when
they actually happen in Koha before the query ever reaches
Zebra ;).<br>
<br>
As for stemming, theoretically the language obtained via
“C4::Templates::getlanguage($cgi, 'intranet');” should
filter down into the Snowball stemming. If it isn’t working
in French, it might be because the right locale isn’t being
passed to Snowball correctly. That’s very possible as I
think we’re using arbitrary language codes rather than
standard locales in some cases. It looks like there is a
fallback to English in C4::Templates::getlanguage() as well.
If it’s not working for French, it probably just needs a
tweak!<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">Yeah,
I first heard about Snowball when reading through Zebra
docs, and I was pleasantly surprised when I saw that
Lingua::Stem::Snowball existed as a Perl interface for the C
program.<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">
</span><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">David
Cook<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">Systems
Librarian<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">Prosentient
Systems<o:p></o:p></span></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">72/330
Wattle St, Ultimo, NSW 2007<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div style="border:none;border-left:solid blue 1.5pt;padding:0cm
0cm 0cm 4.0pt">
<div>
<div style="border:none;border-top:solid #E1E1E1
1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext"
lang="EN-US">From:</span></b><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext"
lang="EN-US">
<a class="moz-txt-link-abbreviated" href="mailto:koha-devel-bounces@lists.koha-community.org">koha-devel-bounces@lists.koha-community.org</a>
[<a class="moz-txt-link-freetext" href="mailto:koha-devel-bounces@lists.koha-community.org">mailto:koha-devel-bounces@lists.koha-community.org</a>] <b>On
Behalf Of </b>Mathieu Saby<br>
<b>Sent:</b> Wednesday, 27 August 2014 7:30 PM<br>
<b>To:</b> <a class="moz-txt-link-abbreviated" href="mailto:koha-devel@lists.koha-community.org">koha-devel@lists.koha-community.org</a><br>
<b>Subject:</b> Re: [Koha-devel] Stemming and zebra<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal" style="margin-bottom:12.0pt">Hi<br>
<br>
I had always thought stemming was made by Zebra, and only in
english!<br>
<br>
In fact the algorithm for french language is here:<br>
<a moz-do-not-send="true"
href="http://snowball.tartarus.org/algorithms/french/stemmer.html">http://snowball.tartarus.org/algorithms/french/stemmer.html</a><br>
<br>
(Lingua::Stem::Snowball is a Perl interface to the C version
of the Snowball stemmers)<br>
<br>
<br>
Mathieu Saby<br>
<br>
<br>
<o:p></o:p></p>
<div>
<p class="MsoNormal">Le 27/08/2014 10:22, David Cook a
écrit :<o:p></o:p></p>
</div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">Hi
Francois:</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">I
wrote an email earlier on my tablet, but not 100% sure
if it got sent. In any case, I’m writing again now!<br>
<br>
You’ll want to look at
C4::Search::_build_stemmed_operand().</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">Zebra
doesn’t actually do any stemming itself. If you read
through the Zebra docs (if you’re masochistic), you’ll
notice that they say explicitly that Zebra doesn’t do
any stemming, but that you can do stemming (using a
stemmer like Snowball) while building a query. That’s
exactly what we do in Koha.</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">The
Perl module that does the stemming is
Lingua::Stem::Snowball. </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">However,
you might notice that your query’s operands aren’t
always stemmed properly. I haven’t looked in a while,
but I think it’s because we don’t build our queries very
well at all (when not using QueryParser). </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">If
you want to understand why you’re getting “skills” and
“fishxsdfe” in your results, I would suggest running
some tests ( using “Data::Dumper” and “warn” ) so that
you can see your query as it is built.</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">I
have a lot of work I want to do on
C4::Search::buildQuery, but just don’t have the time :/.</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US">Unfortunately,
at the moment, there is no stemming when using the
QueryParser. However, fortunately, using
Lingua::Stem::Snowball with QueryParser would be really
really easy. I think that I’ve written a note on how to
do that somewhere on Bugzilla or maybe on Trello…<br>
<br>
I hope that helps! Feel free to send me an email or
shout at me on IRC if you want any clarification. I know
I probably didn’t make it any clearer but hopefully this
might help you on your path to understanding.</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<div>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">David
Cook</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">Systems
Librarian</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">Prosentient
Systems</span><o:p></o:p></p>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext">72/330
Wattle St, Ultimo, NSW 2007</span><o:p></o:p></p>
</div>
<p class="MsoNormal"><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext;mso-fareast-language:EN-US"> </span><o:p></o:p></p>
<div style="border:none;border-left:solid blue
1.5pt;padding:0cm 0cm 0cm 4.0pt">
<div>
<div style="border:none;border-top:solid #E1E1E1
1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext"
lang="EN-US">From:</span></b><span
style="font-size:11.0pt;font-family:"Calibri","sans-serif";color:windowtext"
lang="EN-US"> <a moz-do-not-send="true"
href="mailto:koha-devel-bounces@lists.koha-community.org">koha-devel-bounces@lists.koha-community.org</a>
[<a moz-do-not-send="true"
href="mailto:koha-devel-bounces@lists.koha-community.org">mailto:koha-devel-bounces@lists.koha-community.org</a>]
<b>On Behalf Of </b>Francois Charbonnier<br>
<b>Sent:</b> Wednesday, 27 August 2014 2:09 AM<br>
<b>To:</b> <a moz-do-not-send="true"
href="mailto:koha-devel@lists.koha-community.org">koha-devel@lists.koha-community.org</a><br>
<b>Subject:</b> [Koha-devel] Stemming and zebra</span><o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"> <o:p></o:p></p>
<p class="MsoNormal">Hello,<br>
<br>
I have tested the QueryStemming system preference on
Koha 3.14 (my local installation) and I'm wondering,
does zebra just right truncate the words or is there an
algorithm to find the stems?<br>
<br>
I use ICU and I have enabled "QueryWeightFields". I
don't have automatic truncation or fuzzy search on. I
use these words for my tests:<o:p></o:p></p>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;margin-left:36.0pt;text-indent:-18.0pt;mso-list:l0
level1 lfo3"><!--[if !supportLists]--><span
style="font-size:10.0pt;font-family:Symbol"><span
style="mso-list:Ignore"><span
style="font:7.0pt "Times New Roman"">
</span></span></span><!--[endif]-->ski, skiing,
skills<o:p></o:p></p>
<p class="MsoNormal"
style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto;margin-left:36.0pt;text-indent:-18.0pt;mso-list:l0
level1 lfo3"><!--[if !supportLists]--><span
style="font-size:10.0pt;font-family:Symbol"><span
style="mso-list:Ignore"><span
style="font:7.0pt "Times New Roman"">
</span></span></span><!--[endif]-->fish, fished,
fishing, fisher, fishxsdfe<o:p></o:p></p>
<p>Each time, with QueryStemming on, skills and fishxsdfe
come out in the search results. Is it what I should
expect? "Skills", maybe but "fishxsdfe"?<o:p></o:p></p>
<p>Do you know how it works? or have a good example that
would help me to understand?<o:p></o:p></p>
<p>Thanks!<o:p></o:p></p>
<div>
<p class="MsoNormal">-- <o:p></o:p></p>
<div>
<div style="border-top:solid #DDDDDD
1.5pt;border-left:none;border-bottom:solid #BFD13D
1.5pt;border-right:none;padding:0cm 0cm 0cm 0cm">
<p class="MsoNormal" style="background:#F6F6F6"><span
class="nom1"><span
style="font-size:11.0pt;font-family:"Trebuchet
MS , sans-serif","serif"">François
Charbonnier,</span></span><span
style="font-size:11.0pt"><br>
</span><span class="tagline1"><span
style="font-size:11.0pt;font-family:"Trebuchet
MS , sans-serif","serif"">Bibl.
prof. / Chef de produits</span></span><o:p></o:p></p>
<p class="desc" style="background:#F6F6F6"><span
style="font-size:11.0pt">Tél. : (888) 604-2627<br>
<a moz-do-not-send="true"
href="mailto:francois.charbonnier@inLibro.com"><span
style="color:#005B85;text-decoration:none">francois.charbonnier@inLibro.com</span></a>
</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal" style="background:#EEEFEA"><span
class="in1"><span
style="font-size:13.0pt;font-family:"Trebuchet
MS , sans-serif","serif"">in</span></span><span
class="libro1"><span
style="font-size:13.0pt;font-family:"Trebuchet
MS , sans-serif","serif"">Libro</span></span><span
style="font-size:11.0pt"> </span><span
class="tagline1"><span
style="font-size:11.0pt;font-family:"Trebuchet
MS , sans-serif","serif"">|
pour esprit libre |</span></span><span
style="font-size:11.0pt"> <a
moz-do-not-send="true"
href="http://www.inLibro.com"><span
style="font-size:9.0pt;color:#005B85;text-decoration:none">www.inLibro.com</span></a>
</span><o:p></o:p></p>
</div>
</div>
</div>
</div>
<p class="MsoNormal"><br>
<br>
<br>
<o:p></o:p></p>
<pre>_______________________________________________<o:p></o:p></pre>
<pre>Koha-devel mailing list<o:p></o:p></pre>
<pre><a moz-do-not-send="true" href="mailto:Koha-devel@lists.koha-community.org">Koha-devel@lists.koha-community.org</a><o:p></o:p></pre>
<pre><a moz-do-not-send="true" href="http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel">http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel</a><o:p></o:p></pre>
<pre>website : <a moz-do-not-send="true" href="http://www.koha-community.org/">http://www.koha-community.org/</a><o:p></o:p></pre>
<pre>git : <a moz-do-not-send="true" href="http://git.koha-community.org/">http://git.koha-community.org/</a><o:p></o:p></pre>
<pre>bugs : <a moz-do-not-send="true" href="http://bugs.koha-community.org/">http://bugs.koha-community.org/</a><o:p></o:p></pre>
</blockquote>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
Koha-devel mailing list
<a class="moz-txt-link-abbreviated" href="mailto:Koha-devel@lists.koha-community.org">Koha-devel@lists.koha-community.org</a>
<a class="moz-txt-link-freetext" href="http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel">http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-devel</a>
website : <a class="moz-txt-link-freetext" href="http://www.koha-community.org/">http://www.koha-community.org/</a>
git : <a class="moz-txt-link-freetext" href="http://git.koha-community.org/">http://git.koha-community.org/</a>
bugs : <a class="moz-txt-link-freetext" href="http://bugs.koha-community.org/">http://bugs.koha-community.org/</a></pre>
</blockquote>
<br>
</body>
</html>