In the realm of future improvements, I'm thinking about changing the generate code for lexer grammars. My thoughts are here: http://www.antlr.org/wiki/display/~admin/2010/01/19/ANTLR+v4+lexers Ter