[antlr-interest] antlr3, international symbols

Andrew Gaydenko a at gaydenko.com
Fri Oct 5 15:59:20 PDT 2007


Hi!

I have such tokens defenitions (fragment):

------------------
fragment FOO
	:	'$' KEY ';' { doSomethingWith($KEY.text); }
	;
	
fragment KEY
	:	 ~( ' ' | '\t' | '\r' | '\n' | '\f' | '$' | ';' )+
	;
------------------


When KEY is "normal" word (say, '123.456' or 'a.b.c.d.'), all works fine.
But when KEY contains international symbols, $KEY.text returns "something
else" (abracadabra). A project works inside UTF8 enviroment (Linux/Eclipse).

On the contrary, such RAW token works fine with international symbols:

------------------
BAR
	:	(
			(BLA)=> BLA
		|	RAW  { doSomethingWith($RAW.text); }
		)*
	;

fragment RAW 
	:	.
	;
------------------

Have I missed something?


Andrew


More information about the antlr-interest mailing list