[antlr-interest] Accentuated chars in brazilian portuguese
Bart Kiers
bkiers at gmail.com
Wed Jun 1 13:59:46 PDT 2011
Hi Nilo,
The grammar:
grammar Brasil;
parse
: WORD EOF
;
WORD
: ('\u00c0'..'\u00ff' | 'a'..'z' | 'A'..'Z' | '-')+
;
parses the input "não" just fine in ANTLRWorks.
I'm not really familiar with C#, but for those who are, could you perhaps
post *how* you are testing it? (post a test rig that shows the behavior you
describe)
Regards,
Bart.
On Wed, Jun 1, 2011 at 10:53 PM, Nilo Roberto C Paim <nilopaim at gmail.com>wrote:
> Hi all,
>
> I'm newbie using Antlr and I'm facing a problem when trying to parse a text
> that contains accentuated chars in Brazilian Portuguese.
>
> I've put a word definition on my grammar as follows:
>
> WORD : ( '\u00c0'..'\u00ff' | 'a'..'z' |
> 'A'..'Z' | '-' )+ ;
>
> But have no success on parsing. Words like "não" ("no" in Portuguese)
> causes
> lexar throws "Antlr.Runtime.NoViableAltException".
>
> I'm trying to use C#.
>
> Any hint?
>
> TIA
>
> Nilo, from Brasil...
>
>
> List: http://www.antlr.org/mailman/listinfo/antlr-interest
> Unsubscribe:
> http://www.antlr.org/mailman/options/antlr-interest/your-email-address
>
More information about the antlr-interest
mailing list