[antlr-interest] Recognizing 5-th hex digit [correction]
David-Sarah Hopwood
david-sarah at jacaranda.org
Wed Aug 26 17:21:31 PDT 2009
David-Sarah Hopwood wrote:
> Kieran Beltran wrote:
>> A review of the UCS documentation
>> http://unicode.org/Public/UNIDATA/UnicodeData.txt indicates that indeed
>> there is a 5-th hex digit that is used "publically", albeit infrequently -
>> primarily for mathematics, musical symbols and other areas.
>
> Strictly speaking the code unit range
code point range, I meant.
> goes up to U+10FFFF (not all of which
> are valid characters). The \u notation isn't typically used for characters
> above U+FFFF, because it would be ambiguous with a four-digit escape
> followed by an unescaped hex digit.
--
David-Sarah Hopwood ⚥ http://davidsarah.livejournal.com
More information about the antlr-interest
mailing list