[antlr-interest] Recognizing 5-th hex digit [correction]

David-Sarah Hopwood david-sarah at jacaranda.org
Wed Aug 26 17:21:31 PDT 2009


David-Sarah Hopwood wrote:
> Kieran Beltran wrote:
>> A review of the UCS documentation
>> http://unicode.org/Public/UNIDATA/UnicodeData.txt indicates that indeed
>> there is a 5-th hex digit that is used "publically", albeit infrequently -
>> primarily for mathematics, musical symbols and other areas.
> 
> Strictly speaking the code unit range

code point range, I meant.

> goes up to U+10FFFF (not all of which
> are valid characters). The \u notation isn't typically used for characters
> above U+FFFF, because it would be ambiguous with a four-digit escape
> followed by an unescaped hex digit.

-- 
David-Sarah Hopwood  ⚥  http://davidsarah.livejournal.com



More information about the antlr-interest mailing list