ISDIAH Region facet: Two-word terms are broken up into two seperate regions
|Assignee:||Jesús García Crespo||% Done:|
|Target version:||Release 2.1.0|
|Google Code Legacy ID:||Tested version:|
- Create a new repository or edit an existing one
- In the contact area, edit the contact information - in the Physical location tab of the Contact modal, enter "British Columbia"
- Click submit on the contact modal, and save the record
- Navigate to Browse > Archival institutions and look at the Region facet filter results.
2 filters appear as "british" and "columbia"
1 filter for "british columbia" appears
This will be important to fix before any BC data is imported into 2.x! Or anywhere else with a 2 part region name.
#2 Updated by Tim Hutchinson over 8 years ago
Related to this, is it possible to retain the capitalization e.g. British Columbia rather than british columbia? This seems to be the only facet where that happens, but your use of "british columbia" in the issue description suggests that maybe there's a reason for that...
#4 Updated by Tim Hutchinson over 8 years ago
I'm guessing the two issues are related. This reminded me of a former database where capitalization got stripped out in the index - but in this case it seems to be behaving like a keyword index. I also noticed that a hyphenated name gets split into two entries.
#6 Updated by Jesús García Crespo over 8 years ago
- Status changed from New to QA/Review
Fixed in 74fe1c7bb39ac759c9cbbb3591b059846c157090. Capitalization is also respected now. Pretty easy fix, we were running the facet against the analyzed version of the field in ElasticSearch. But the non_analyzed version is the one just containing "British Columbia". FYI: we also use non_analyzed fields for sorting and things like that, that's why we keep them.