On Mon, 30 Aug 2004, Anatoly Vorobey wrote: > Turns out there's a file where the characters are already broken down > by classes: > > http://www.unicode.org/Public/UNIDATA/extracted/DerivedGeneralCategory.txt Thanks. This is what I needed. -- Shlomo Yona shlomo at cs.haifa.ac.il http://cs.haifa.ac.il/~shlomo/