[Koha-bugs] [Bug 1634] Incorrect encoding convertion in MARC record form

bugzilla-daemon at pippin.metavore.com bugzilla-daemon at pippin.metavore.com
Tue Dec 4 23:13:46 CET 2007


http://bugs.koha.org/cgi-bin/bugzilla/show_bug.cgi?id=1634





------- Comment #9 from sanyok_ko at mail.ru  2007-12-04 14:13 -------
This MARC record was produced by Koha export utility. Sorry, but I don't know
MARC record format, and can't set the LEADER/09 to 'a'.

Using this steps, you can check, that this MARC record is in UNICODE/UTF-8:
1. Rename Koha.mrc into Koha.txt
2. Open Koha.txt with your web browser (I used Mozilla/Firefox 2.0.0.11 on
Windows)
3. Set browser display encoding to UNICODE/UTF8
4. Compare text in the browser with the screenshot 'That's how should be'
5. You can find the title of the book by string 'aПриемы' with find in
page dialog and copy/paste technics.

You can check the encoding of this record using Cyrillic UTF8 encoding table
(http://www.utf8-chartable.de/unicode-utf8-table.pl?start=1024)
Just compare some first letters of the title.

This is title of the book in UTF-8 encoding
[root at server1 ~]# echo "Приемы
объектно-ориентированного
проектирования." | hexdump
0000000 9fd0 80d1 b8d0 b5d0 bcd0 8bd1 d020 d0be
0000010 d1b1 d08a d0b5 d1ba d082 d0bd 2dbe bed0
0000020 80d1 b8d0 b5d0 bdd0 82d1 b8d0 80d1 bed0
0000030 b2d0 b0d0 bdd0 bdd0 bed0 b3d0 bed0 d020
0000040 d1bf d080 d0be d0b5 d1ba d082 d1b8 d080
0000050 d0be d0b2 d0b0 d0bd d1b8 2e8f 000a
000005d

Additionally, I have found, that this linux commands will produce the same
malformed title, as you can see at screenshot:
echo 'title of the book in UTF8' | iconv -f latin-1 -t utf-8

[root at server1 ~]# echo "Приемы
объектно-ориентированного
проектирования." | iconv -f latin1 -t utf-8 | hexdump
0000000 90c3 9fc2 91c3 80c2 90c3 b8c2 90c3 b5c2
0000010 90c3 bcc2 91c3 8bc2 c320 c290 c3be c290
0000020 c3b1 c291 c38a c290 c3b5 c290 c3ba c291
0000030 c382 c290 c3bd c290 2dbe 90c3 bec2 91c3
0000040 80c2 90c3 b8c2 90c3 b5c2 90c3 bdc2 91c3
0000050 82c2 90c3 b8c2 91c3 80c2 90c3 bec2 90c3
0000060 b2c2 90c3 b0c2 90c3 bdc2 90c3 bdc2 90c3
0000070 bec2 90c3 b3c2 90c3 bec2 c320 c290 c3bf
0000080 c291 c380 c290 c3be c290 c3b5 c290 c3ba
0000090 c291 c382 c290 c3b8 c291 c380 c290 c3be
00000a0 c290 c3b2 c290 c3b0 c290 c3bd c290 c3b8
00000b0 c291 2e8f 000a
00000b5




------- You are receiving this mail because: -------
You are the QA contact for the bug, or are watching the QA contact.





More information about the Koha-bugs mailing list