[antlr-interest] unicode support
Pete Forman
pete.forman at westerngeco.com
Wed Dec 18 00:43:43 PST 2002
At 2002-12-17 13:31 -0800, John Lambert wrote:
>oops! I meant UCS4 not UCS2 (16bit)
>
>JOhn
>
>-----Original Message-----
>From: John Lambert [mailto:jlambert at nwlink.com]
>Sent: Tuesday, December 17, 2002 12:50 PM
>To: antlr-interest at yahoogroups.com
>Subject: RE: [antlr-interest] unicode support
>
>
>
>I would also recommend the use of ICU, I think this is now the de
>facto standard package for both C++ and java.
>It would also be quite probable that anyone producing a Unicode
>application would already be using the ICU package.
>
>
>Please allow the full Unicode 3.2 specification range, it can be
>represented in any of 3 formats:
>UCS2, UTF16 and UTF8.
UCS-4 is from ISO/IEC 10646. In Unicode UTF-32 is a better format to
avoid variable length encoding though the differences are minor.
http://www.unicode.org/unicode/reports/tr19/
That said, UTF-16 is probably the best format for us within the
program.
--
Pete Forman -./\.- Disclaimer: This post is originated
WesternGeco -./\.- by myself and does not represent
pete.forman at westerngeco.com -./\.- opinion of Schlumberger, Baker
http://petef.port5.com -./\.- Hughes or their divisions.
Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
More information about the antlr-interest
mailing list