[antlr-interest] C++ and Unicode

Ruslan Zasukhin sunshine at public.kherson.ua
Mon Aug 16 04:22:36 PDT 2004


On 8/16/04 12:57 PM, "Ric Klaren" <klaren at cs.utwente.nl> wrote:

> On Mon, Aug 16, 2004 at 12:36:16PM +0300, Ruslan Zasukhin wrote:
>> Yes we also think that UTF8 should be the first step to unicode world.
>> And it looks to be relatively easy step.
> 
> I got it it going and indeed it was easier as I expected. But there's
> probably still a few 'sore' spots where wrong assumptions are made or the
> codegenerator needs tweaking. Generating grammars with a big charVocab is
> slow due to the bitset generating. Also the space bitsets currently take is
> a bit too much. I guess I can get a nice initial reduction by stripping
> leading/trailing zero's of the bitsets.

Yes, you can make such bitset class easy enough.

I wonder which char set you use?
    32000 chars ?


-- 
Best regards,
Ruslan Zasukhin      [ I feel the need...the need for speed ]
-------------------------------------------------------------
e-mail: ruslan at paradigmasoft.com
web: http://www.paradigmasoft.com

To subscribe to the Valentina mail list go to:
http://lists.macserve.net/mailman/listinfo/valentina
-------------------------------------------------------------



 
Yahoo! Groups Links

<*> To visit your group on the web, go to:
    http://groups.yahoo.com/group/antlr-interest/

<*> To unsubscribe from this group, send an email to:
    antlr-interest-unsubscribe at yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
    http://docs.yahoo.com/info/terms/
 



More information about the antlr-interest mailing list