[Koha-devel] Strange characters

Didier Gautheron didier.gautheron at biblibre.com
Wed Dec 16 17:02:54 CET 2020


Hi,

16 décembre 2020 16:23 "Fridolin SOMERS" <fridolin.somers at biblibre.com> a écrit:

> Hi,
> 
> I found some strange characters in sources :
> 
> https://git.koha-community.org/Koha-community/Koha/src/branch/master/tools/koha-news.pl#L7
> 
> It se a <?> :
> Casta?eda, Carlos Sebastian
> 
> Do you see that ?

It seems to be a valid UTF8:
ef bf bd
Character name REPLACEMENT CHARACTER
Likely from an old window file: ñ being the culprit.

> Is this non-UTF8 ?
> Can we build a command to find them all ?
> I've tried with 'grep -P' but impossible.
git grep �
find them, with false positive, or using iconv?
iconv -f utf8 -t utf8
should complain if there's invalid sequences
eg:
LANG=C iconv -f utf8 -t utf8 ./misc/cronjobs/automatic_renewals.pl > /dev/null
iconv: illegal input sequence at position 81


More information about the Koha-devel mailing list