I just noticed that Locale.Charset treats US-ASCII and Latin-1 as equivalent encodings. I'd like to implement a separate encoder for the former so that I can get the custom substitution string/callback invoked for everything above 127. Any objections or should I reserve a new name (suggestions welcome) for this behavior?
There is already a FIXME for implementing this, so go right ahead. :-) (Please also make sure that other aliases for ASCII such as iso-ir-6 and iso646-us are handled the same way.)
I found these aliases:
ascii us usascii isoir6 iso646us iso646irv1991 367 ansix341968
Looks ok?
ISO4873 is:
0x00 - 0x7f Identical to US-ASCII (ISO646-US). 0x80 - 0x9f Reserved for control characters. 0xa0 - 0xff Reserved for graphical characters.
pike-devel@lists.lysator.liu.se