And what happens with the performance if all strings are converted to UTF-8 and then back again?
I think a better alternative might be to attempt to change 'char' to 'int32' in the PCRE-code.
/ Per Hedbor ()
Previous text:
2003-01-29 17:36: Subject: Re: Real-life PCRE vs Regexp benchmarking
On Wed, Jan 29, 2003 at 05:30:04PM +0100, Marcus Comstedt (ACROSS) (Hail Ilpalazzo!) @ Pike (-) developers forum wrote:
If we intend this to be a permanent solution, the fact that PCRE probably doesn't handle wide strings is a problem.
It has some support for UTF-8, and basically it is 8bit clean. This part is not well tested, though.
Regards, /Al
/ Brevbäraren