[Koha-bugs] [Bug 11202] Improve UNIMARC biblio indexing

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Nov 22 22:08:12 CET 2013


http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=11202

mathieu saby <mathieu.saby at univ-rennes2.fr> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
  Attachment #22750|0                           |1
        is obsolete|                            |
  Attachment #22973|0                           |1
        is obsolete|                            |
  Attachment #23041|0                           |1
        is obsolete|                            |

--- Comment #36 from mathieu saby <mathieu.saby at univ-rennes2.fr> ---
Created attachment 23108
  -->
http://bugs.koha-community.org/bugzilla3/attachment.cgi?id=23108&action=edit
[PATCH] Bug 11202 - Improve UNIMARC biblio indexing

All previous patches (non signed off) squashed, with little changes
Test plan updated

This patch makes the following changes to UNIMARC biblio indexing :
A. Changes to record.abs and biblio-koha*/biblio-zebra* conf files
1. Changes with no expected functional effects
* add comments to biblio-koha-indexdefs.xml
* make biblio-koha-indexdefs.xml more compact by grouping some declarations
Ex : 200$f and 200$g => one declaration for 200$fg
* reorganizes definitions of 5XX fields
* suppress unneeded declarations (indexing of some 4XX fields and 6XX fields
not in UNIMARC format)
* change the way 308 field is indexed

2. Changes with functional effects
* index new fields added in UNIMARC format in 2012 to make it more FRBR
compliant : 506, 507, 576, 577.
These fields are experimental in most UNIMARC countries. They are not currently
used in France but will probably be in some months or years.
* unindex some (sub)fields unneeded by users (207,230,215,4XXd)
* index 208 in Material-Type -- see bug 11119
* index 100 pos 8-9 and 9-12 in pubdate:y and pubdate:n
* index 100 pos 8-9 in pubdate:s instead of 210$d
* Index all subfields of note 334 and 327 in note index
* Index 304 and 327 in title index as well as note index
327 can contain a list of titles included in a work
304 can contain the title of the original work in case of a translation
* Index 314 in author index as well as note index
314 can contain authors not mentionned in 200$f/g (the 4th, 5th etc author)
* Index 328 note in Dissertation-information as well as note
* Index 328$t in Title
* replace Title-host with Host-item -- see bug 11119

B. Changes to ccl.properties :
1. add a new index Dissertation-information (1056)
2. fix EAN, pubdate and acqdate (they were not linked with bib1 attributes)

C. Changes to Search.pm
1. add Dissertation-information and suppress Title-host and UPC

D. Changes to QP config file queryparser.yaml
1. add Dissertation-information
2 fix EAN, pubdate and acqdate

Test plan :
If you cannot test in GRS1, test only in DOM, as GRS will be deprecated...

1. Prepare Koha :
* Apply the patch in a UNIMARC Koha running with DOM and ICU
* copy src/etc/searchengine/queryparser.yaml into the main config directory of
QP
* copy src/etc/zebradb/ccl.properties into the main config directory of Zebra
* copy src/etc/zebradb/marc_defs/unimarc/biblio/* into the main config
directory of Zebra
* reindex biblios (rebuild_zebra.pl -r -b -x -v)

2. test note index : make some searches on 334$b or 327$b
3. test author index : make some searches on 314 field
4. test title index : make some searches on 304 and 327 field, make a search on
328$t subfield
5. test Dissertation-information index : make some searches on 328 field
6. test 506,507,576,577 fields (as they are mostly experimental, and as index
definition is trivial, this test can be skipped)
7. In a record, put in the dates of 100 fields the values "1000" (1st date) and
"1001" (2d date) ; try to search a book written in year 1000, you should find
the record ; idem for year 1001
8. make some searches and sort by date. It should work better as before,
especially if you have values like "c2009" or "impr. 2010" in 210 field
9. Regression test : make some searches on several indexes, like EAN, etc. It
should work as before

Test 2-9 with and without Queryparser activated.

Of course, to test search and sort by dates, you need to have full records,
with dates in 100 field as well as 210 field.
To search by date, you need to use advanced search, click on "More" and use
date filter. Do not use "yr" index in simple search, it does not work. Do not
use "date" index in dropdown list, it does not work either.

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list