[Koha-bugs] [Bug 35104] We should warn when attempting to save MARC records that contain characters invalid in XML

bugzilla-daemon at bugs.koha-community.org bugzilla-daemon at bugs.koha-community.org
Wed Nov 1 00:26:56 CET 2023


https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35104

--- Comment #16 from David Cook <dcook at prosentient.com.au> ---
(In reply to Martin Renvoize from comment #15)
> Wow, that's an impressive find. Man this stuff ends up in fun "exciting"
> places.

Thanks! I did feel pretty good working that one out!

During this process, I've been thinking there might be an alternative to bug
34549 as well.

Instead of erasing the invalid characters, surely we could just escape them.

So I'm going to use my discovery from
https://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=35104#c14 to see if I
can do an alternate version of bug 34549 where we escape the bad characters
using numeric character references or an entity.

So the STX character from that example would be converted to something like
 instead of just erased.

Of course, it would be good to try it out with a variety of examples and not
just this sort of data corruption example.

But maybe it's also one of those ones where we can just do our best and see how
it evolves over time...

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.


More information about the Koha-bugs mailing list