Tell that to the Windows computer I used last week that insisted Euro should be stored as 0x80... :-)
Anyway, this is off-topic but I'm curious to find out how diff would recognize which charset the file is using. If I have another string with the currency sign (0xA4 in ISO-8859-1) I'm quite certain it would report it as identical to Euro (0xA4 in ISO-8859-15). But for the sake of this argument I'll say "\x2122" (trademark) instead. (Gotcha! :-)
/ Jonas Walldén
Previous text:
2003-09-23 00:24: Subject: Re: wish: string with other quoting then \
Well, since HTML builds on SGML it's not clearly stated in the HTML specification,
HTML 2.0 _does_ cleary state that ' is valid. RFC 1866, section 3.2.4.
Take the Euro character as an example. "\x20AC" is what I'd use today, but that wouldn't work. True wide-string doesn't work well with tools such as mail, cvs, diff etc so that's not an option either. UTF-8 escaping is even less of an option due to more non-printing characters.
Just encode the source code as iso-8859-15. Works fine with tools such as mail, cvs, diff etc.
/ Marcus Comstedt (ACROSS) (Hail Ilpalazzo!)