[Koha-zebra] Re: Unimarc, marc21, Unicode, and MARC::File::XML
Adam Dickmeiss
adam at indexdata.dk
Tue Mar 21 20:57:30 CET 2006
Tümer Garip wrote:
> Hi Adam,
> You seem a bit offended that was not my intention, just frustation
> sometimes
> makes me use harsh words and translanting them to english may be too
> harsh.
>
> I do not need to send you any config+examples cause I tested this with
> your default config files. I am attaching an xml record in utf-8
If you're to receive help from me you need to to tell me which zebra.cfg
you're using. And show me the record + the way you indexed it (zebraidx
update ?)
>
> Briefly I had default configuration files and build zebra with xml
> records. When I noticed the problem
> I used yaz-client to see what was going on. On my log I could see data
> going in the zebra was with encoding utf-8
> While yaz client was returning xml with headers saying iso-8859-1 while
> I could actually see the utf-8 characters as they show as hex in yaz
> client.
I also need to know what you see? And you you'd expect to see.
/ Adam
> I have retried this procedures just now and it seems the same. Just
> adding encoding:UTF-8 to zebra.cfg and restarting the server you get
> correct heading and correct data. Please note that server has to be
> restarted but zebradb does not have to be rebuilt.
>
> Thanks
> Tumer
>
> -----Original Message-----
> From: Adam Dickmeiss [mailto:adam at indexdata.dk]
> Sent: Tuesday, March 21, 2006 9:00 PM
> To: Tümer Garip
> Cc: paul.poulain at free.fr; koha-zebra at nongnu.org
> Subject: Re: [Koha-zebra] Re: Unimarc, marc21, Unicode, and
> MARC::File::XML
>
>
> Tümer Garip wrote:
>
>>Hi,
>>
>>This problem if I understood it correctly has got nothing to do with
>>mysql or perl it has to do with ZEBRA unless it is to do with UNIMARC
>>which I am not familiar with. As you know (Paul) I have an utf-8
>>version working.
>>
>>I had the same problem from records coming from zebra and found out
>>that it is not doing what it is supposed to do unless you explicitly
>>set it to utf-8. You have to explicitly put "encoding utf-8" in all
>>your zebra config files especially the zebra.cfg and your .abs .
>>Otherwise unlike the documentation saying that zebra character code is
>
>
>>automatically set by the xml encoding it DOES NOT.
>
> I can't reproduce this (bug). Care to share a a config+example that
> illustrates this (Inserts an XML record from Perl in UTF-8) ?
>
>
>>Perl send xml to zebra with encoding utf-8 on the header and utf-8
>>data in it. Zebra saves all the data in utf-8 but returns an xml
>>saying encoding iso8859-1 at the header and utf-8 characters in data.
>>No module can correct this as it is stupid.
>
> Just need to know when the stupidity starts:-)
>
> / Adam
>
>
>>I corrected the problem by adding encoding:UTF-8 in zebra.cfg,
>>record.abs, sort-string.chr
>>
>>Hope it solves yours,
>>
>>Tumer
>>
>>
>>
>>_______________________________________________
>>Koha-zebra mailing list
>>Koha-zebra at nongnu.org
>>http://lists.nongnu.org/mailman/listinfo/koha-zebra
>>
>
>
>
>
> _______________________________________________
> Koha-zebra mailing list
> Koha-zebra at nongnu.org
> http://lists.nongnu.org/mailman/listinfo/koha-zebra
>
More information about the Koha-zebra
mailing list