[antlr-interest] Accentuated chars in brazilian portuguese

Nilo Roberto C Paim nilopaim at gmail.com
Wed Jun 1 13:53:52 PDT 2011


Hi all,

I'm newbie using Antlr and I'm facing a problem when trying to parse a text
that contains accentuated chars in Brazilian Portuguese.

I've put a word definition on my grammar as follows:

		WORD :			( '\u00c0'..'\u00ff' | 'a'..'z' |
'A'..'Z' | '-' )+ ;

But have no success on parsing. Words like "não" ("no" in Portuguese) causes
lexar throws "Antlr.Runtime.NoViableAltException".

I'm trying to use C#.

Any hint?

TIA

Nilo, from Brasil...



More information about the antlr-interest mailing list