[Koha-devel] Re: MARC character encoding

Ed Summers ehs at pobox.com
Thu Jan 23 06:19:04 CET 2003


> My conclusion is that the following code :
>        s/\xe1/\xc1/gm;
>        s/\xe2/\xc2/gm;
>        s/\xe3/\xc3/gm;
>        s/\xe4/\xc4/gm;
>        s/\xe8/\xc8/gm;
>        s/\xe9/\xc9/gm;
>        s/\xf0/\xd0/gm;
> is enough to migrate from MARC21 to UNIMARC char coding. It tried this 
> on my marc21->unimarc script, on 30 000 records, and it works fine.
> 
> So, i think we have 2 complete tables (marc21 and unimarc) in Biblio.pm, 
> that i commited a few minuts ago.

This is pretty sketchy, and I would guess far from complete. Perhaps
it's OK for your purposes but I wouldn't be comfortable sending my data
through it. In the absence of any documentation on UNIMARC character
encoding I guess you've decided that it will have to do.

//Ed

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 232 bytes
Desc: not available
URL: </pipermail/koha-devel/attachments/20030123/892827af/attachment-0002.pgp>


More information about the Koha-devel mailing list