Hi,
What should be the correct behavior of string_to_utf8() in case if
source string is not wide-string? Currently it just converts all
8-bit values to UTF-8 representation, which is not (really) Right
Thing (tm) - imagine UTF-8 string passed on... It will be
scrambled...
Yes, I know, no one would do this intentionally, but "to err is
human" - one live example is SQLite module, where this conversion is
performed by default, and UTF-8 string may (and eventually will) be
passed to SQLite, will be scrambled, and... :)
May be it makes sense to do nothing (return source string) in case
if source is 8-bit string? Or (at least) check that source is valid
UTF-8 stream (I wouldn't choose this way, though)?
Regards,
/Al