Details
-
Bug
-
Status: Closed (View Workflow)
-
P3
-
Resolution: Done
-
None
-
None
-
Prokopovych
-
Cornell, Duke
Description
Overview: Searching for title in keyword does not work when vernacular title appears in the paired 880 (245) field.
Steps to Reproduce:
- Log into some FOLIO Snapshot as admin
- Ensure that Settings/Inventory/Z39.50 Target Profiles – OCLC WorldCat – Authentication is set to 100473910/PAOLF
- Open Inventory
- Use "Actions/Import" five times using these OCLC numbers: 85202630, 43564886, 1091030365 1153193551, 1089848331
- Search for a title in Russian, Chinese, Japanese, or Hebrew using the actual title in it's script (see example record's 880 $6 245 in Title (all). Record is displayed in results list
- Searching the transliterated title (in 245 $a) without ligature and other diacritics brings back no results in some cases. When I let the system replace ligature with none, by backspacing between the characters, results were as expected. However, when I removed other diacritics by hand, because the system has no replacement character, no results displayed
- These data are not normalized for indexing, which is an industry standard. There are results in neither Keyword nor Titles (all)
- Staff will not usually include diacritics when searching due to keyboard limitations.
Expected Results: Searching by title without including diacritics should result in finding the record.
Actual Results: Searching a Latin-based alphabet where a title includes diacritics, such as Portuguese, omitting said diacritics showed that these are normalized in the index (as expected by industry standard)
Additional Information:
These searches seem to work as expected when testing using the Inventory ES app.
As of 3/25/2021, these examples were in Snapshot Inventory: HRID in00000000082, HRID in00000000083, HRID in00000000085, HRID in00000000088, HRID in00000000090 because I successfully used the single record import function. Here are are the associated OCLC system numbers used in these tests:
Language Russian (no Cyrrilic) | https://www.worldcat.org/oclc/85202630 | Leti︠a︡shchai︠a︡ tufelʹka, ili, golyĭ nasmeshnik : sibirskie skazki i misticheskie bylichki |
Language Russian (includes Cyrillic) | https://www.worldcat.org/oclc/43564886 | Народные русские сказки А.Н. Афанасьева : в пяти томах. |
Language Hebrew (includes Hebrew alphabet) | https://www.worldcat.org/oclc/1091030365 | כך להישאר לעולם = Forever this way |
Language: Japanese translation of Chinese test (includes kanji and hanzi) | https://www.worldcat.org/oclc/1153193551 | 唐人如何吟诗 : 带你走进汉语音韵学 |
Language: Portuguese (Latin-based alphabet with diacritics) | https://www.worldcat.org/oclc/1089848331 | Fernão de Magalhães : um agente secreto ao serviço do rei D. Manuel I de Portugal? |
Interested parties: Everyone
TestRail: Results
Attachments
Issue Links
- relates to
-
MODINVSTOR-490 Combining characters (diacritics, umlauts) not found
-
- Closed
-
-
RMB-605 Combining characters (diacritics, umlauts) not found
-
- Closed
-
-
UXPROD-1468 Management of diacritics characters
-
- Closed
-
-
MSEARCH-305 BE- Inventory. Keyword search should use normalization of Resource titles and Contributor names
-
- Closed
-
-
MSEARCH-362 BE: Inventory. Keyword search should use normalization of Resource titles and Contributor names
-
- Draft
-