[antlr-interest] Re: Problems with Unicode support in ANTLR
micheal_jor
open.zone at virgin.net
Thu May 16 22:59:19 PDT 2002
--- In antlr-interest at y..., "Matthew Ford" <Matthew.Ford at f...> wrote:
> This approach would not work for me as I need
>
> IDENT
> options {testLiterals=true;
> paraphrase = "an identifier";}
> : ('a'..'z'|'_'|'$'|'\u0080'..'\uFFFE')
> ('a'..'z'|'_'|'0'..'9'|'$'|'\u0080'..'\uFFFE')*
> ;
>
> So rather then sub-blocks, what I need is an efficient compression
method to
> store these bitsets in the Antlr.
The \00800..\uFFFE range might be overkill as many characters in that
range would not [normally] be usable as parts of an IDENT.
You are right that more efficient BitSet representation are needed
for ANTLR's Unicode support in general.
Micheal
Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
More information about the antlr-interest
mailing list