Uploaded image for project: 'RAML Module Builder'
  1. RAML Module Builder
  2. RMB-574

allow to normalize ISBNs in any field

    XMLWordPrintable

Details

    • Task
    • Status: Draft (View Workflow)
    • TBD
    • Resolution: Unresolved
    • None
    • None
    • CP: Roadmap backlog
    • Core: Platform

    Description

      Note This is a potential post-MVP feature.

      normalizeDigits (see RMB-499) only allows ISBN normalization on a dedicated ISBN index to avoid over-normalizing number that are only similar to ISBN. This ticket is about an extended feature that allows normalization across many fields (e.g in indexes like "keyword" and "identifers").

      Strategy: Use a different normalisation at index and query time:

       "fullTextIndex": [
              {
                "fieldName": "keyword",
                "multiFieldNames": "title,contributors[*].name,identifiers[*].value”,
                “normalizeLeadDigits” : “yes”  
              },
              {
                "fieldName": "title"
                                        “
              },
              {
                "fieldName": "languages"
              },
              {
                "fieldName": "contributors",
                "arraySubfield" : "name",
                "arrayModifiers": ["contributorNameTypeId"]
              },
              {
                "fieldName": "identifiers",
                "arraySubfield" : "value",
                "arrayModifiers": ["identifierTypeId"] 
                “normalizeLeadDigits” : “yes”  
              }
            ]
          },
      
      
      normalizeLeadDigits_index — keeps the original series of digits and additionally add a normalised token, it runs only on the first occurrence of the series of digits
                   “11-22 33 yyy” -> “ 11-22 33 112233 yyy”
                   “xxx 11-22 33 yyy” -> “ xxx 11-22 33 yyy”
      
      
      normalizeLeadDigits_query replaces the original series of digits with a normalised token, it runs only on the first occurrence of the series of digits
      
                  “11-22abc yyy” -> “1122 abc yyy”
                  “xxx 11-22abc yyy” -> “xxx 11-22abc yyy”
      

      selected where normDigits_index(field) @@ normDigits_query(query)

      fv “11-22 33abc” -> nv “11-22 33 112233 abc”
      q “112233abc” -> qn “112233 abc”
      q “11-22 33abc -> qn “112233 abc”

      TestRail: Results

        Attachments

          Issue Links

            Activity

              People

                Unassigned Unassigned
                jakub Jakub Skoczen
                Votes:
                0 Vote for this issue
                Watchers:
                1 Start watching this issue

                Dates

                  Created:
                  Updated:

                  TestRail: Runs

                    TestRail: Cases