Hi. I've been reading a bit about utf8, and I learned that when reading a utf8 character, for each byte I need to check: (byte & 0xC0 ) == 0xC0 means that there is another byte for this character. Otherwise, it's the last byte of the character. Shmuel.