Uploaded image for project: 'RAML Module Builder'
  1. RAML Module Builder
  2. RMB-761

UTF-8 encoded characters are decoded when posting to SRS

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed (View Workflow)
    • Priority: P2
    • Resolution: Cannot Reproduce
    • Affects Version/s: None
    • Fix Version/s: None
    • Labels:
      None
    • Environment:

      folio-testing, Goldenrod

    • Template:
    • Sprint:
      Folijet Sprint 101, Folijet Sprint 102
    • Story Points:
      3
    • Development Team:
      Core: Platform
    • Release:
      R1 2021
    • Confidence factor:
      Medium

      Description

      Overview:
      When posting a record to SRS via POST /source-storage/batch/records or POST /source-storage/records, special characters which are UTF-8 encoded (e.g. \u030b) in the request payload become decoded. This has implications for OAI-PMH as these characters will cause failures when mapping the marc json -> marc21 XML.

      Steps to Reproduce:

      1. Note that the attached record includes: "Original Scale 1 \u030b= 14 miles (copy reduced in size.)"
      2. Post the attached record to /source-storage/batch/records or /source-storage/records
      3. Note that the response has: "Original Scale 1 ̋= 14 miles (copy reduced in size.)"

      Expected Results:
      The encoded characters remain encoded

      Actual Results:
      The encoded characters are decoded

      Additional Information:
      I think this is a problem for marc4j which is used in mod-oai-pmh to map the json representation of the marc record into marc21 XML. An uncaught JAXB exception is thrown in such cases and the client never receives a response unless something else times the request out (e.g. proxy, load balancer, okapi, etc.)

      Interested parties:
      Theodor Tolstoy Sobha Duvvuri Magda Zacharska

        TestRail: Results

          Attachments

          1. Adlam.mrc
            0.7 kB
          2. db.PNG
            db.PNG
            44 kB
          3. encoded.json
            8 kB
          4. Encoded characters.mrc
            0.8 kB
          5. EncodedCharacters - Data Export.mrc
            0.9 kB
          6. EncodedCharacters - Data Import.PNG
            EncodedCharacters - Data Import.PNG
            26 kB
          7. EncodedExample - Instance view.PNG
            EncodedExample - Instance view.PNG
            33 kB
          8. postman.PNG
            postman.PNG
            45 kB
          9. SearchInstanceUUIDs2020-11-19T00 01 48-05 00-2.mrc
            1 kB

            Issue Links

              Activity

                People

                Assignee:
                Unassigned Unassigned
                Reporter:
                cmcnally Craig McNally
                Votes:
                0 Vote for this issue
                Watchers:
                5 Start watching this issue

                  Dates

                  Created:
                  Updated:
                  Resolved:

                    TestRail: Runs

                      TestRail: Cases