[antlr-interest] ASCII versus UNICODE

Wed Jun 19 00:51:12 PDT 2002

Hi,

I'm parsing source files containing ASCII characters. 
One of these files contains a special character causing a 
NoViableAltForCharException.

I specified "charVocabulary = '\0'..'\377' " which is the full
ASCII range so there should not be a problem.

When printing the ASCII code for the character I got 8218
which is definitely not within the range of ASCII.

Is it ANTLR which is converting the ASCII character to
UNICODE, or is Java doing that? Is it because I'm using a
Reader class instead of an InputStream?

Any suggestions would be welcome.

Silvain

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/