[antlr-interest] Encoding of grammar files

Loring Craymer lgcraymer at yahoo.com
Sun Jun 29 22:29:28 PDT 2008


Try replacing the charVocabulary line in antlr.g with

charVocabulary='\u0000'..'\uFFFE';and rebuilding ANTLR 3 from source.

That should work; if not, I'm quite sure that Ter can be bribed with vodka and beer.

--Loring




----- Original Message ----
> From: Петров Александр <gmdidro at gmail.com>
> To: antlr-interest at antlr.org
> Sent: Sunday, June 29, 2008 8:16:08 AM
> Subject: [antlr-interest] Encoding of grammar files
> 
> Hello,
> 
> I want to add comments in my grammar files (.g) in Russian.
> 
> But it's seems to me, that antlr 3.0.1 have never been in the country
> of vodka and bears :)
> 
> grammar test;
> 
> options{ output = AST; ASTLabelType=CommonTree; language = CSharp; }
> 
> //Привет всем! -- Comment in Russian
> program : 'program';
> 
> >java -classpath "D:\antlr\antlr-3.0.1\lib\antlr-2.7.7.jar;D:\antlr\antlr-3.0
> .1\lib\antlr-3.0.1.jar;D:\antlr\antlr-3.0.1\lib\antlr-runtime-3.0.1.jar;D:\antlr
> \antlr-3.0.1\lib\stringtemplate-3.1b1.jar;" org.antlr.Tool test.g
> 
> ANTLR Parser Generator  Version 3.0.1 (August 13, 2007)  1989-2007
> error(10):  internal error: test.g : test.g:10:3: unexpected char: 0x41F
> org.antlr.tool.ANTLRLexer.nextToken(ANTLRLexer.java:321)
> antlr.TokenStreamRewriteEngine.nextToken(TokenStreamRewriteEngine.java:161)
> antlr.TokenBuffer.fill(TokenBuffer.java:69)
> antlr.TokenBuffer.LA(TokenBuffer.java:80)
> antlr.LLkParser.LA(LLkParser.java:52)
> org.antlr.tool.ANTLRParser.grammar(ANTLRParser.java:338)
> org.antlr.tool.Grammar.setGrammarContent(Grammar.java:521)
> org.antlr.tool.Grammar.(Grammar.java:456)
> org.antlr.Tool.getGrammar(Tool.java:331)
> org.antlr.Tool.process(Tool.java:267)
> org.antlr.Tool.main(Tool.java:70)
> 
> How to change the encoding of input .g-files and tell antlr about it ?
> Thank you,
> Petrov Alexander



      



More information about the antlr-interest mailing list