[Koha-bugs] [Bug 10662] Build OAI-PMH Harvesting Client
bugzilla-daemon at bugs.koha-community.org
bugzilla-daemon at bugs.koha-community.org
Fri Nov 2 05:16:39 CET 2018
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=10662
--- Comment #233 from David Cook <dcook at prosentient.com.au> ---
(In reply to Josef Moravec from comment #209)
> Created attachment 81786 [details]
> Encoding problem + datatable
>
> In submitted requests table I encountered a encoding problem:
>
> "Knihovna ÃstÃ" should be "Knihovna Ústí".
>
> Also, the heading of datatable is usually formatted in one line, see patron
> circulation history for example.
I am stumped by this one.
I've added a Encode::Decode("UTF-8",$json_message") to the client used by the
web page, and that gets the characters to render as Knihovna Ústí on the web
page... but when I look at the actual hex code in the variable... it's not
valid UTF8. It's Windows 1252/Latin-1.
The hex is 4b6e69686f766e6120da7374ed, and the internal representation in Perl
is PV = 0xa6bce80 "Knihovna \303\232st\303\255"\0 [UTF8 "Knihovna
\x{da}st\x{ed}"].
So DA and ED are Unicode code points that match Ú and í
(https://www.utf8-chartable.de/unicode-utf8-table.pl).
However, DA and ED are also the Latin-1 hex for Ú and í
(https://en.wikipedia.org/wiki/Windows-1252)(https://en.wikipedia.org/wiki/ISO/IEC_8859-1).
I think maybe I need to try with some Chinese characters that don't exist in
Latin-1...
--
You are receiving this mail because:
You are watching all bug changes.
More information about the Koha-bugs
mailing list