[antlr-interest] Unicode character classes

Johannes Luber jaluber at gmx.de
Sun Mar 4 05:15:42 PST 2007


is it possible to specify Unicode character classes like Zs, Lu, Ll, Lt,
Lm, Lo, Nl and other without having to resort to spell each out every
single character (as
http://www.fileformat.info/info/unicode/category/Lu/list.htm shows, many
characters aren't in a range)? If not, this would be a useful addition
to ANTLR. In that regard, it seems one can't create arbitrary sets of
tokens and exclude from those other arbitrary tokens. Or knows someone a

Best regards,
Johannes Luber

