[antlr-interest] unicode support

Pete Forman pete.forman at westerngeco.com
Wed Dec 18 00:43:43 PST 2002


At 2002-12-17 13:31 -0800, John Lambert wrote:
>oops! I meant UCS4 not UCS2 (16bit)
>
>JOhn
>
>-----Original Message-----
>From: John Lambert [mailto:jlambert at nwlink.com]
>Sent: Tuesday, December 17, 2002 12:50 PM
>To: antlr-interest at yahoogroups.com
>Subject: RE: [antlr-interest] unicode support
>
>
>
>I would also recommend the use of ICU, I think this is now the de
>facto standard package for both C++ and java.
>It would also be quite probable that anyone producing a Unicode
>application would already be using the ICU package.
>
>
>Please allow the full Unicode 3.2 specification range, it can be
>represented in any of 3 formats:
>UCS2, UTF16 and UTF8.

UCS-4 is from ISO/IEC 10646.  In Unicode UTF-32 is a better format to
avoid variable length encoding though the differences are minor.

http://www.unicode.org/unicode/reports/tr19/

That said, UTF-16 is probably the best format for us within the
program.

-- 
Pete Forman                -./\.-  Disclaimer: This post is originated
WesternGeco                  -./\.-   by myself and does not represent
pete.forman at westerngeco.com    -./\.-   opinion of Schlumberger, Baker
http://petef.port5.com           -./\.-   Hughes or their divisions.


 

Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/ 



More information about the antlr-interest mailing list