[Koha-bugs] [Bug 26996] New: Elasticsearch: Multiprocess reindexing sometimes doesn't reindex all records

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Nov 11 11:50:58 CET 2020


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=26996

            Bug ID: 26996
           Summary: Elasticsearch: Multiprocess reindexing sometimes
                    doesn't reindex all records
 Change sponsored?: ---
           Product: Koha
           Version: master
          Hardware: All
                OS: All
            Status: NEW
          Severity: minor
          Priority: P5 - low
         Component: Searching - Zebra
          Assignee: koha-bugs at lists.koha-community.org
          Reporter: bjorn.nylen at ub.lu.se

We've discovered that rebuild_elasticsearch.pl will not index all records in
some cases using multiple cpu's. Sometimes one (or more) child proc. will loop
through their records without ever committing them to ES.

Symptoms are that the process will work faster than others, not log "Committing
xxx records" and consume ever growing amount of memory. 

The problem appears to be in the fudging of commmit sizes for the childs. It
generates a float which will in some cases never be == 0 as the buffer counter
is decreased.

This set of param will fail for me everytime
./rebuild_elasticsearch -v -b -p 2 -c 400

Will uplaod a patch shortly.

-- 
You are receiving this mail because:
You are watching all bug changes.
You are the assignee for the bug.


More information about the Koha-bugs mailing list