why is pike not using utf-8 internally?

16 Dec 2006


      ...
...
explaining that Pike doesn't advocate using UTF8 as an internal
format, and why
that is a good question.
why is that?
UTF8 is an encoding format; a character (byte) in an UTF8 bytestring
doesn't represent a character in the string.
The answer is more or less the same as why you don't use gzipped
strings as an internal format: an indexed character in your data
doesn't represent a character in the text.
Both gzip and UTF8 are good transfer encoding formats (and
orthogonal), but neither are very useful in string manipulation. 
(Ok, gzip is worse, I'll give you that. But you get my point.)
Nothing stops you from using neither gzip nor UTF-8-encoded strings in
Pike though, if you find it useful.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

why is pike not using utf-8 internally?