[antlr-interest] Unicode character classes

Douglas Godfrey douglasgodfrey at gmail.com
Fri Dec 16 07:24:04 PST 2011


there all in the sql2003 grammar including all characters valid in  an
Latin-1 and/or Unicode Identifier


On Fri, Dec 16, 2011 at 2:11 AM, Douglas Godfrey <douglasgodfrey at gmail.com>
wrote:
> 
> A set of rules for all UNICODE code pages can be found in the SQL2003
> grammar on the Antlr downloads page.
> 

I think that ideally a proper Unicode library should deal with valid and
invalid characters, or other checks.
For example, glib has a validate function,
http://developer.gnome.org/glib/2.30/glib-Unicode-Manipulation.html#g-unicha
r-validate
Other programming environments have similar libraries.

Simos





More information about the antlr-interest mailing list