ah, i see where this is getting at, because of the variable length you have to do extra work to keep track of the position of each character, so working directly on utf-8 makes no sense and any code using utf-8 internally will have to keep converting back and forth.
greetings, martin.