[Koha-bugs] [Bug 3144] anti-spam for opac-suggestions

bugzilla-daemon at liblime.com bugzilla-daemon at liblime.com
Thu Jun 18 16:31:59 CEST 2009


http://bugs.koha.org/cgi-bin/bugzilla3/show_bug.cgi?id=3144





--- Comment #5 from MJR <mjr at ttllp.co.uk>  2009-06-18 14:31:58 ---
I asked about this and was told it's essentially a result of using
French-language data with one of the plugins that's trained with
English-language ham data:

"I train the bayasian filter using results that are known-good or
 known-spam, but in both cases the training is with English text.

  I'd never considered the use of foreign language to be an issue,
 but I can easily imagine if the corpus is always trained with English
 that French wouldn't be seen, and is more likely to weigh in as spam.

  If this is a common occurrence then the caller should disable the
 bayasian plugin when they submit their comments for testing, see
 here for details of how to do that:

    http://blogspam.net/api/testComment.html

  (You'll want to disable the plugin called "bayasian".)"

Alternatively, Francophones may like to run their own spam filter, trained with
French-language ham.  

(By the way, did you know that the documents used for reCAPTCHA are all taken
from US sources at the moment?  I hear it's great fun for UKUS and NZUS
spelling differences.)


-- 
Configure bugmail: http://bugs.koha.org/cgi-bin/bugzilla3/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching all bug changes.



More information about the Koha-bugs mailing list