[Koha-bugs] [Bug 9352] Zebra indexes useless subfields in UNIMARC 7XX

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Aug 28 03:10:55 CEST 2013


http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9352

Bernardo Gonzalez Kriegel <bgkriegel at gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #17165|0                           |1
        is obsolete|                            |

--- Comment #14 from Bernardo Gonzalez Kriegel <bgkriegel at gmail.com> ---
Created attachment 20677
  -->
http://bugs.koha-community.org/bugzilla3/attachment.cgi?id=20677&action=edit
[SIGNED-OFF] Bug 9352 : More specific indexing of UNIMARC 7XX fields (GRS1)

1st patch : for GRS1 indexing
Problem :
Zebra indexes all subfields of UNIMARC 7XX fields in author index, including
$9, $3, $4 (function code), $f (dates of authors), $c (additions other than
dates), $p (address).
It causes Koha to give too much results.
For example, if an author is born in 1984 and is a teacher, searching "1984" or
"teacher" in simple search will return all work by this author if these pieces
of informations are in 7XX field. This is not how most ILS work, and it should
be corrected.

Solution :
This patch makes indexing of unimarc 7XX fields more specific. For each field,
useless subfields are not indexed.
70X : Do not index $f (dates),$c (additions other than dates),$p
(affiliation/address),$3,$4. Index $9 only in Koha-Auth-Number.
710-712 : Do not index $p (affiliation/address),$3,$4. Index $9 only in
Koha-Auth-Number. (I kept all other subfields : even if some may be useless I
am not sure of it)
716 : Do not index $f (dates),$c (additions other than dates),$3,$4. Index $9
only in Koha-Auth-Number.
72X : Do not index $f (dates),$3,$4. Index $9 only in Koha-Auth-Number.
730 : Do not index $4. Index $9 only in Koha-Auth-Number.
Additionnaly, this patch indexes 205$f/$g in Author index (author of the
edition of the work)
Testing :
a/ Create a record with :
700$a Doe $b John $f1950 $cteacher $4070
710$a Corporate $b Meeting $p Address
716$a Trademark $f1960 $cgreat $4340
720$a Family $f1980 $4651
205$a 1st edition $fBy some guy $gAnd other guys

b/ Before applying patch, make several searches :
Search in author index : "teacher", "great", "Address", "1950", "1960", "070",
"340", "651"
=> you will see the record among the results
Search in author index : "Doe", "John Doe", "Trademark", "Family","corporate",
"meeting"
=> you will see the record among the results
Search in author index : "guy", "guys"
=> you will see the record among the results with simple search (because 205 is
indexed in title)
=> you will not see the record among the results with author search

c/ Apply patch
d/ Rebuild Zebra with ./misc/migration_tools/rebuild_zebra.pl -b -x -v -reset

e/ Try the same searches as before :
Search in author index : "teacher", "great", "Address", "1950", "1960", "070",
"340", "651"
=> you will not see the record among the results
Search in author index : "Doe", "John Doe", "Trademark", "Family","corporate",
"meeting"
=> you will see the record among the results
Search in author index : "guy", "guys"
=> you will see the record among the results in both simple search and author
search

Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel at gmail.com>
Comment: Work as described with GRS1.
No koha-qa errors.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list