[Israel.pm] about utf8

Shmuel Fomberg semuelf at 012.net.il
Sun Jan 18 11:41:13 PST 2009


Yuval Kogman wrote:
> Why are you decoding utf8 manually?

I'm planing to add to Data::ParseBinary encoding ability. because it's 
part of binary stream, I can't relay on Perl to take the correct number 
of bytes for me. only after the text is separated from the binary 
stream, I can give it to Perl for decoding.

So I try to define a character. And debating with myself if surrogate 
pair count as one character or two. And how to detect such pair - if 
it's little or big endian, in the most elegant way.

Shmuel.



More information about the Perl mailing list