Details
-
Task
-
Status: Draft (View Workflow)
-
TBD
-
Resolution: Unresolved
-
None
-
None
-
customfield_11100 33095
-
CP: Roadmap backlog
-
Core: Platform
Description
Note This is a potential post-MVP feature.
normalizeDigits (see RMB-499) only allows ISBN normalization on a dedicated ISBN index to avoid over-normalizing number that are only similar to ISBN. This ticket is about an extended feature that allows normalization across many fields (e.g in indexes like "keyword" and "identifers").
Strategy: Use a different normalisation at index and query time:
"fullTextIndex": [ { "fieldName": "keyword", "multiFieldNames": "title,contributors[*].name,identifiers[*].value”, “normalizeLeadDigits” : “yes” }, { "fieldName": "title" “ }, { "fieldName": "languages" }, { "fieldName": "contributors", "arraySubfield" : "name", "arrayModifiers": ["contributorNameTypeId"] }, { "fieldName": "identifiers", "arraySubfield" : "value", "arrayModifiers": ["identifierTypeId"] “normalizeLeadDigits” : “yes” } ] }, normalizeLeadDigits_index — keeps the original series of digits and additionally add a normalised token, it runs only on the first occurrence of the series of digits “11-22 33 yyy” -> “ 11-22 33 112233 yyy” “xxx 11-22 33 yyy” -> “ xxx 11-22 33 yyy” normalizeLeadDigits_query replaces the original series of digits with a normalised token, it runs only on the first occurrence of the series of digits “11-22abc yyy” -> “1122 abc yyy” “xxx 11-22abc yyy” -> “xxx 11-22abc yyy”
selected where normDigits_index(field) @@ normDigits_query(query)
fv “11-22 33abc” -> nv “11-22 33 112233 abc”
q “112233abc” -> qn “112233 abc”
q “11-22 33abc -> qn “112233 abc”
TestRail: Results
Attachments
Issue Links
- relates to
-
MSEARCH-307 BE - Normalize ISBNs for ISBN searching. Search option: Identifier (all) in the Instance segment
-
- Closed
-
-
UIIN-647 Normalize ISBNs for ISBN searching in Instance, holdings, item segments. New search option: ISBN, normalized
-
- Closed
-
-
UIIN-998 Normalize ISBNs for ISBN searching. Search option: Keyword (Instance, Holdings, Item)
-
- Closed
-
-
UIIN-999 Normalize ISBNs for ISBN searching. Search option: Identifier (all) in the Instance segment
-
- Closed
-
-
RMB-499 Add "normalizeDigits" function
-
- Closed
-