[Koha-bugs] [Bug 10662] Build OAI-PMH Harvesting Client

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Fri Nov 2 05:16:39 CET 2018


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=10662

--- Comment #233 from David Cook <dcook at prosentient.com.au> ---
(In reply to Josef Moravec from comment #209)
> Created attachment 81786 [details]
> Encoding problem + datatable
> 
> In submitted requests table I encountered a encoding problem:
> 
> "Knihovna Ãstí" should be "Knihovna Ústí".
> 
> Also, the heading of datatable is usually formatted in one line, see patron
> circulation history for example.

I am stumped by this one. 

I've added a Encode::Decode("UTF-8",$json_message") to the client used by the
web page, and that gets the characters to render as Knihovna Ústí on the web
page... but when I look at the actual hex code in the variable... it's not
valid UTF8. It's Windows 1252/Latin-1. 

The hex is 4b6e69686f766e6120da7374ed, and the internal representation in Perl
is PV = 0xa6bce80 "Knihovna \303\232st\303\255"\0 [UTF8 "Knihovna
\x{da}st\x{ed}"].

So DA and ED are Unicode code points that match Ú and í
(https://www.utf8-chartable.de/unicode-utf8-table.pl).

However, DA and ED are also the Latin-1 hex for Ú and í
(https://en.wikipedia.org/wiki/Windows-1252)(https://en.wikipedia.org/wiki/ISO/IEC_8859-1).

I think maybe I need to try with some Chinese characters that don't exist in
Latin-1...

-- 
You are receiving this mail because:
You are watching all bug changes.


More information about the Koha-bugs mailing list