[antlr-interest] Characters ISO-8859-1 in grammars

Jim Idle jimi at temporal-wave.com
Fri Sep 3 09:07:15 PDT 2010


You can use anything that Java can use, but when using such characters in
the lexer, you are best advised to spell them out as the unicode code point
escape sequence such as:

CHARS: '\u1234\u3455' ;

Because when you perform actions like cutting and pasting to documents and
so on, various programs will transform non ASCII characters to invalid
sequences and otherwise screw up.

Jim

> -----Original Message-----
> From: antlr-interest-bounces at antlr.org [mailto:antlr-interest-
> bounces at antlr.org] On Behalf Of Prados Valiente Enrique
> Sent: Friday, September 03, 2010 2:48 AM
> To: antlr-interest at antlr.org
> Subject: [antlr-interest] Characters ISO-8859-1 in grammars
> 
> Hi all,
> 
> 
> 
> Can I use characters like
> 
> 
> 
> http://www.parthia.com/fonts/crossfont_test_ISO-8859-1.htm
> 
> 
> 
> À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô Õ Ö × Ø Ù Ú Û Ü Ý Þ ß
> 
> 
> 
> 
> 
> In my gramar ¿?
> 
> 
> 
> Any samples, please ¿?
> 
> 
> 
> 
> 
> 
> 
> 
> List: http://www.antlr.org/mailman/listinfo/antlr-interest
> Unsubscribe: http://www.antlr.org/mailman/options/antlr-interest/your-
> email-address



More information about the antlr-interest mailing list