[Koha-devel] Linking bibs to authorities

Ian Bays ian.bays at ptfs-europe.com
Thu Oct 17 19:06:50 CEST 2013


Hi Fridolyn,
Thank you for this.  I was beginning to wonder if nobody was interested 
in the question.

The bug you mention is a useful addition, but we achieved considerable 
success by correcting the contents of the default.idx 
(~/koha-dev/etc/zebradb/etc/default.idx) to use phrases-icu.xml for 
index p, and to ensure phrases-icu.xml and words-icu.xml have the 
correct stanzas for searching accent-blind.

With these both in place I believe we have a very good hit rate for 
linking bib to authorities as long as the authority terms have been 
de-duplicated.

I think Colin has submitted patches so that these are in place for all 
moving forwards, but it is good to know that others are using these 
features.

What we were seeing before was if the term in the bib was (say) "History 
of Art" it would only match on the first word so would match an 
authority of (say) "History Books".

All the best.

Ian
On 17/10/2013 16:53, Fridolyn SOMERS wrote:
> Hie,
>
> It is because in Zebra, exact search does not work.
> It can find a phrase (all words in same order) but not say if this 
> phrase fits the entire (sub)field. This is the purpose of PQF attribut 
> 6, but is does not work.
> So when linking bibs to authorities, linker will match more 
> authorities than needed. For example, in biblio record "History" will 
> match authorities with headings "History", "History of art", "History 
> of science", ...
>
> That is wy I created Bug 9072 : 
> http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9072
>
> Le 02/08/2013 17:32, Ian Bays a écrit :
>> Hi.
>> I mentioned we also have a problem with linking bibs to authorities.
>>
>> We use authorities and want to link them to bibs.  Over previous
>> versions of Koha this has sometimes worked quite well.  However on a few
>> systems we have at version 3.12 it seems that the Zebra search into the
>> authorities (used by link_bib_to_authorities) is returning all terms
>> that contain any of the words in the bib field.
>>
>> So if the linkage is set to "default" then most authority searches will
>> return multiple answers which means they do not link.
>> If the linkage syspref is set to first or last then most of the bibs
>> will link to the wrong authority as it is pot luck which will be first
>> (or last) of the many authorities found by searching all the words ORed
>> together.
>> The only terms that will link are single-word authorities that are not
>> used in any other authorities.
>>
>> If anyone has any pointers or knows how to get round or over this
>> problem I would love to hear from you.  More details on request...
>>
>> We are using Zebra dom indexing and icu chains and are using Koha 3.12
>> on Debian.  We are not yet using QueryParser.  Sysprefs and Help > About
>> are here:
>>
>> Sysprefs are:
>>
>> +--------------------------+-------+
>> | variable                 | value |
>> +--------------------------+-------+
>> | IncludeSeeFromInSearches | 0     |
>> | QueryAutoTruncate        | 0     |
>> | QueryFuzzy               | 0     |
>> | QueryStemming            | 0     |
>> | QueryWeightFields        | 1     |
>> | TraceCompleteSubfields   | 0     |
>> | TraceSubjectSubdivisions | 0     |
>> | UseICU                   | 1     |
>> | UseQueryParser           | 0     |
>> +--------------------------+-------+
>>
>>
>> Koha> About:
>>
>> Koha version:     3.12.01.000
>> OS version ('uname -a'):     Linux dclg.koha.ptfsadmin.uk0.bigv.io
>> 3.2.0-4-amd64 #1 SMP Debian 3.2.46-1 x86_64 GNU/Linux
>> Perl interpreter:     /usr/bin/perl
>> Perl version:     5.014002
>> Perl @INC:     /home/koha/kohaclone
>> /etc/perl
>> /usr/local/lib/perl/5.14.2
>> /usr/local/share/perl/5.14.2
>> /usr/lib/perl5
>> /usr/share/perl5
>> /usr/lib/perl/5.14
>> /usr/share/perl/5.14
>> /usr/local/lib/site_perl
>> .
>> MySQL version:     mysql Ver 14.14 Distrib 5.5.31, for debian-linux-gnu
>> (x86_64) using readline 6.2
>> Apache version:     Server version: Apache/2.2.22 (Debian)
>> Zebra version:     Zebra 2.0.55 (C) 1994-2013, Index Data ApS Zebra is
>> free software, covered by the GNU General Public License, and you are
>> welcome to change it and/or distribute copies of it under certain
>> conditions. SHA1 ID: bd2bc9360225e695bbaba2c2d1cd6925c4eb23a5 Using ICU
>>
>> Many thanks folks.
>> Ian
>>
>>
>


-- 
Ian Bays
Director of Projects, PTFS Europe Limited
Content Management and Library Solutions
+44 (0) 800 756 6803 (phone)
+44 (0) 7774 995297 (mobile)
+44 (0) 800 756 6384 (fax)
skype: ian.bays
email: ian.bays at ptfs-europe.com



More information about the Koha-devel mailing list