[Koha-bugs] [Bug 35104] We should warn when attempting to save MARC records that contain characters invalid in XML

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Nov 1 01:48:25 CET 2023


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35104

--- Comment #19 from David Cook <dcook at prosentient.com.au> ---
(In reply to David Cook from comment #18)
> That being said... the XML 1.0 spec is pretty forgiving. After review, it's
> really just excluding *some* ASCII control characters, Unicode surrogates,
> U+FFFE, and U+FFFF. That's a really small number of characters and none of
> them are printable characters.

After review, for a UTF-8 encoded document, bug 34549 would only strip
non-printable characters. 

I might have another crack at trying to get a Latin-1 encoded document into
Koha... as I know that I've had Latin-1 encoded data in Koha before (although
it's very possible that it came through side-loading using non-Koha tools).

-- 
You are receiving this mail because:
You are watching all bug changes.
You are the assignee for the bug.


More information about the Koha-bugs mailing list