[Koha-bugs] [Bug 13665] Retrieve facets from zebra is slow

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Tue May 24 01:20:04 CEST 2022


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=13665

--- Comment #8 from David Cook <dcook at prosentient.com.au> ---
(In reply to Martin Renvoize from comment #2)
> Have we reported this upstream at all?

I reported something in this area yesterday:
https://github.com/indexdata/idzebra/issues/35

As Hugo notes, stop words seem to cause big problems, although it doesn't seem
tied to the result set but rather the initial records fetched and filtered. 

I had 2 nearly identical queries on a 1,000,000+ record database. The first
search returned instantly and its 1st facet returned in 2 seconds. The second
search (including "the") returned after 30 seconds and its 1st facet returned
after 60 seconds. Both had result sets of 3323-3325 records. Since "the" had
matched over 1,000,000 records, it seemed clear that the facet calculation
involved more than just the result set (although my initial skimming through
the Zebra C code couldn't confirm that).

I'm going to see if I can compile Zebra and figure out what's going on there,
so that I can either send a pull request or at least put Indexdata onto the
root cause and rough solution. 

But I'm also not going to commit heaps of time to this. It's fresh in my mind,
so now is a good time to do it, but we'll see...

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list