[Koha-bugs] [Bug 9352] Zebra indexes useless subfields in UNIMARC 7XX

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Sep 5 10:49:24 CEST 2014


http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=9352

Martin Renvoize <martin.renvoize at ptfs-europe.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #20677|0                           |1
        is obsolete|                            |

--- Comment #52 from Martin Renvoize <martin.renvoize at ptfs-europe.com> ---
Created attachment 31385
  -->
http://bugs.koha-community.org/bugzilla3/attachment.cgi?id=31385&action=edit
[SIGNED OFF] Bug 9352 : More specific indexing of UNIMARC 7XX fields (GRS1)

1st patch : for GRS1 indexing
Problem :
Zebra indexes all subfields of UNIMARC 7XX fields in author index, including
$9, $3, $4 (function code), $f (dates of authors), $c (additions other than
dates), $p (address).
It causes Koha to give too much results.
For example, if an author is born in 1984 and is a teacher, searching "1984" or
"teacher" in simple search will return all work by this author if these pieces
of informations are in 7XX field. This is not how most ILS work, and it should
be corrected.

Solution :
This patch makes indexing of unimarc 7XX fields more specific. For each field,
useless subfields are not indexed.
70X : Do not index $f (dates),$c (additions other than dates),$p
(affiliation/address),$3,$4. Index $9 only in Koha-Auth-Number.
710-712 : Do not index $p (affiliation/address),$3,$4. Index $9 only in
Koha-Auth-Number. (I kept all other subfields : even if some may be useless I
am not sure of it)
716 : Do not index $f (dates),$c (additions other than dates),$3,$4. Index $9
only in Koha-Auth-Number.
72X : Do not index $f (dates),$3,$4. Index $9 only in Koha-Auth-Number.
730 : Do not index $4. Index $9 only in Koha-Auth-Number.
Additionnaly, this patch indexes 205$f/$g in Author index (author of the
edition of the work)
Testing :
a/ Create a record with :
700$a Doe $b John $f1950 $cteacher $4070
710$a Corporate $b Meeting $p Address
716$a Trademark $f1960 $cgreat $4340
720$a Family $f1980 $4651
205$a 1st edition $fBy some guy $gAnd other guys

b/ Before applying patch, make several searches :
Search in author index : "teacher", "great", "Address", "1950", "1960", "070",
"340", "651"
=> you will see the record among the results
Search in author index : "Doe", "John Doe", "Trademark", "Family","corporate",
"meeting"
=> you will see the record among the results
Search in author index : "guy", "guys"
=> you will see the record among the results with simple search (because 205 is
indexed in title)
=> you will not see the record among the results with author search

c/ Apply patch
d/ Rebuild Zebra with ./misc/migration_tools/rebuild_zebra.pl -b -x -v -reset

e/ Try the same searches as before :
Search in author index : "teacher", "great", "Address", "1950", "1960", "070",
"340", "651"
=> you will not see the record among the results
Search in author index : "Doe", "John Doe", "Trademark", "Family","corporate",
"meeting"
=> you will see the record among the results
Search in author index : "guy", "guys"
=> you will see the record among the results in both simple search and author
search

Signed-off-by: Bernardo Gonzalez Kriegel <bgkriegel at gmail.com>
Comment: Work as described with GRS1.
No koha-qa errors.

Signed-off-by: Martin Renvoize <martin.renvoize at ptfs-europe.com>

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list