[Koha-bugs] [Bug 28781] New: Elasticsearch facets may be inaccurate

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Thu Jul 29 15:21:46 CEST 2021


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=28781

            Bug ID: 28781
           Summary: Elasticsearch facets may be inaccurate
 Change sponsored?: ---
           Product: Koha
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: normal
          Priority: P5 - low
         Component: Searching - Elasticsearch
          Assignee: koha-bugs at lists.koha-community.org
          Reporter: nick at bywatersolutions.com

To recreate:
1 - Set system preferences:
    SearchEngine: Elasticsearch
    FacetMaxCount: 20
    displayFacetCount: Show
2 - Using koha testing docker default data search for '*'
3 - Note the author facets like
 Beckett, Samuel (5)
 Aristophanes (3)
 De Niese, Danielle (3)
 Euripides (3)
 Aristotle (2) 
4 - Set FacetMaxCount:400
5 - Repeat search
6 - Note facets like:
 Beckett, Samuel (5)
 Stevens, W. Richard (5)
 Plato (4)
 Aristophanes (3)
 De Niese, Danielle (3) 
7 - Some top terms are missing when FacetMaxCount is passed to ES as 'size'

>From the docs:
The size parameter can be set to define how many term buckets should be
returned out of the overall terms list. By default, the node coordinating the
search process will request each shard to provide its own top size term buckets
and once all shards respond, it will reduce the results to the final list that
will then be returned to the client. This means that if the number of unique
terms is greater than size, the returned list is slightly off and not accurate
(it could be that the term counts are slightly off and it could even be that a
term that should have been in the top size buckets was not returned).
https://www.elastic.co/guide/en/elasticsearch/reference/6.8/search-aggregations-bucket-terms-aggregation.html

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.


More information about the Koha-bugs mailing list