<html>
<head>
<meta content="text/html; charset=UTF-8" http-equiv="Content-Type">
</head>
<body bgcolor="#FFFFFF" text="#000000">
Thanks Galen, Paul,<br>
<br>
I'm sorry I just noticed your answers in my inbox.<br>
<br>
Galen, I tried your solution (the record is in fact a utf8 encoded
xml document), but it made no difference. But it did lead me to
some nice places in the code (and documentation) where I now specify
the BinaryEncoding => 'utf8' in the 'use Marc::File::XML'. It
will probably break another migration, but this fixes my problem for
now.<br>
<br>
Thanks again!<br>
Philippe<br>
<br>
<br>
<br>
<div class="moz-cite-prefix">On 06/04/2014 06:49 PM, Galen Charlton
wrote:<br>
</div>
<blockquote
cite="mid:CAPLnt65+uuy7_TQgJt-QD1LyCktMiGCtRKhikPkgPXUFAQeYGQ@mail.gmail.com"
type="cite">
<div dir="ltr">Hi,
<div class="gmail_extra"><br>
<div class="gmail_quote">On Wed, Jun 4, 2014 at 11:56 AM,
Philippe Blouin <span dir="ltr"><<a
moz-do-not-send="true"
href="mailto:philippe.blouin@inlibro.com"
target="_blank">philippe.blouin@inlibro.com</a>></span>
wrote:<br>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;padding-left:1ex">
<div bgcolor="#FFFFFF" text="#000000">We're using the MARC
library for some migration, as usual, but we encountered
some new issue with some arabic title: the key code 703
<br>
<table border="2">
<tbody>
<tr>
<td>0x02BF</td>
<td>703</td>
<td>MODIFIER LETTER LEFT HALF RING</td>
<td><font size="+3">ʿ</font></td>
</tr>
</tbody>
</table>
is not part of the Table db, which cause the whole
subfield to disappear and causing us headaches.<br>
</div>
</blockquote>
</div>
<div><br>
</div>
<div><br>
</div>
<div>What is the source character encoding of the records? If
the records are already in UTF-8, then it is not necessary
to transcode them to MARC8, then back to UTF8 for loading
into Koha. Adding the following line to whatever code
you're using to pre-process the records might help:</div>
<div><br>
</div>
<div>MARC::Charset->assume_unicode(1);<br>
</div>
<div><br>
</div>
<div>As an alternative, you could adjust change the records to
use 0x02bb rather than 0x02bf. I'm assuming that the
strings in question are transliterated Arabic following the
ALA-LC Arabic romanization. If so, back in 1999, the
mapping of the "ayn" character was changed from 0x02bf to
0x02bb. [1]</div>
<div><br>
</div>
<div>[1] <a moz-do-not-send="true"
href="http://www.loc.gov/marc/marbi/2005/2005-05.html">http://www.loc.gov/marc/marbi/2005/2005-05.html</a></div>
<div><br>
</div>
<div>Regards,</div>
<div><br>
Galen</div>
-- <br>
<div dir="ltr">
<div>Galen Charlton</div>
<div>Manager of Implementation</div>
<div>Equinox Software, Inc. / The Open Source Experts</div>
<div>email: <a moz-do-not-send="true"
href="mailto:gmc@esilibrary.com" target="_blank">gmc@esilibrary.com</a></div>
<div>direct: +1 770-709-5581</div>
<div>cell: +1 404-984-4366</div>
<div>skype: gmcharlt</div>
<div>web: <a moz-do-not-send="true"
href="http://www.esilibrary.com/" target="_blank">http://www.esilibrary.com/</a></div>
<div>Supporting Koha and Evergreen: <a
moz-do-not-send="true" href="http://koha-community.org"
target="_blank">http://koha-community.org</a> & <a
moz-do-not-send="true" href="http://evergreen-ils.org"
target="_blank">http://evergreen-ils.org</a></div>
</div>
</div>
</div>
</blockquote>
<br>
<div class="moz-signature">-- <br>
<style type="text/css">
.moz-signature {
color: #FFFFFF;
}
.sig_inlibro {
padding-top : 2px;
color: #888888;
font-family : "Trebuchet MS", verdana;
font-size: 90%;
}
.sig_content {
border-top: 2px solid #DDDDDD;
border-bottom: 2px solid #BFD13D;
background-color : #F6F6F6;
padding-left:10px;
}
.sig_inlibro a:visited, .sig_inlibro a:hover, .sig_inlibro a:link {
text-decoration: none;
color: #005B85;
}
.nom {
color: #005B85;
font-weight : bold;
}
.inlibro, .in {
color: #BFD13D;
}
.libro {
color: #005B85;
}
.in, .libro {
font-size : 120%;
}
.desc {
margin-bottom: 0;
padding-bottom: 5px;
}
.small {
font-size: 80%;
}
.tagline {
color : #00BCE4;
}
.sig_footer {
padding-left : 10px;
background-color : #EEEFEA;
}
</style>
<div class="sig_inlibro">
<div class="sig_content"> <span class="nom">Philippe Blouin,</span><br>
<span class="tagline small">Responsable du développement
informatique</span><br>
<p class="desc small"> Tél. : (888) 604-2627<br>
<a href="mailto:philippe.blouin@inLibro.com">philippe.blouin@inLibro.com</a>
</p>
</div>
<div class="sig_footer"> <span class="in">in</span><span
class="libro">Libro</span> <span class="tagline small">|
pour esprit libre |</span> <a class="small"
href="http://www.inLibro.com">www.inLibro.com</a> </div>
</div>
</div>
</body>
</html>