utf8_char_index

24 Sep 2003

      It would also be possible to generate a byte-offset -> index table
when generating the UTF-8 from the widestring. Then it would be O(1),
but it would use 4 bytes more memory for each byte.
/ Per Hedbor ()
Previous text:
...
2003-09-24 12:28:
Subject: utf8_char_index

Actually, I wish to do it on a whole list of numbers (at least 2),
unfortunately unsorted. That could be optimized.
It's the exec() function in pcre that gives useful indexes, and I was
pondering to give the widestring wrapper object the correct result
(= character offsets) from that function, not byteoffsets.
I also need the reverse function for continued search (start_index).
/ Mirar

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

utf8_char_index