[Israel.pm] about utf8
semuelf at 012.net.il
Sun Jan 18 11:41:13 PST 2009
Yuval Kogman wrote:
> Why are you decoding utf8 manually?
I'm planing to add to Data::ParseBinary encoding ability. because it's
part of binary stream, I can't relay on Perl to take the correct number
of bytes for me. only after the text is separated from the binary
stream, I can give it to Perl for decoding.
So I try to define a character. And debating with myself if surrogate
pair count as one character or two. And how to detect such pair - if
it's little or big endian, in the most elegant way.
More information about the Perl