Friday, January 30, 2009 06:01 AM
I am tryin to parse an xml file with Encoding at UTF-8. The parser fails with the following error:
parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0xC7 0x41 0x49 0x53
AMITIÉS FRAN╟AISES ANTWERPEN
However, when I Change the XML header to ISO-8859-1 encoding, the parsing works fine.What caused this? The character that it finds illegal is "C-Cedilla" (Ç). As far as I know, this is a legal character in UTF-8.