[antlr-interest] ASCII versus UNICODE

Silvain Piree s.piree at enneya.com
Wed Jun 19 00:51:12 PDT 2002


I'm parsing source files containing ASCII characters. 
One of these files contains a special character causing a 

I specified "charVocabulary = '\0'..'\377' " which is the full
ASCII range so there should not be a problem.

When printing the ASCII code for the character I got 8218
which is definitely not within the range of ASCII.

Is it ANTLR which is converting the ASCII character to
UNICODE, or is Java doing that? Is it because I'm using a
Reader class instead of an InputStream?

Any suggestions would be welcome.



Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 

More information about the antlr-interest mailing list