Bug #5730

Occasional discrepancy between institutional facet count and actual results

Added by Tim Hutchinson over 8 years ago. Updated about 7 years ago.

Status:NewStart date:10/01/2013
Priority:MediumDue date:
Assignee:José Raddaoui Marín% Done:

0%

Category:Search / Browse
Target version:-
Google Code Legacy ID: Tested version:
Sponsored:No Requires documentation:

Description

It's not clear to me how to reproduce this, but both in our test environment and the AC beta site there are some discrepancies between the listed number of results in the institutional facet, and the actual result you get when you choose a particular institution.

E.g. in Archives Canada beta:
- browse by archival descriptions
- The Presbyterian Church in Canada shows 249 records
- click on The Presbyterian Church in Canada
- facet now lists 469 results, consistent with actual results

There are similar results if you first filter to fonds level or series level.

This does not seem to be a culture issue, since the number of results shown is the same in English and French.

Probably a fluke, but the incorrect results all seem to be in the last two facet positions (but conversely they are not always wrong).

institution-facet.png - Prior to applying the facet (26 KB) Dan Gillean, 10/01/2013 02:17 PM

institution-facet-filtered.png - Results of applying the facet (78.2 KB) Dan Gillean, 10/01/2013 02:17 PM

History

#1 Updated by Dan Gillean over 8 years ago

Confirmed in 2x. See attached screenshots

#2 Updated by Jesús García Crespo over 8 years ago

  • Assignee changed from Jesús García Crespo to José Raddaoui Marín

#3 Updated by Jesús García Crespo over 8 years ago

Tim, we'll take a look. Thanks!

#4 Updated by José Raddaoui Marín over 8 years ago

  • Status changed from New to In progress
  • Target version changed from Release 2.0.0 to Release 2.1.0

Hi,

This is a known Elasticsearch bug. Where making a query with facets over an index with more than one shard, if there is more results for a facet than the specified in the size parameter, the count value for the last terms of the facet is not accurate.

https://github.com/elasticsearch/elasticsearch/issues/1305

As there isn't a proper workaround for it, we'll review it in the 2.1 target.

#5 Updated by Jesús García Crespo over 8 years ago

  • Status changed from In progress to New

#6 Updated by Jesús García Crespo over 7 years ago

  • Target version changed from Release 2.1.0 to Release 2.2.0

#7 Updated by Sarah Romkey about 7 years ago

  • Target version deleted (Release 2.2.0)

Also available in: Atom PDF