[antlr-interest] Multipass parsing

Sun Mar 28 09:02:22 PST 2004

What's currently the best way to do multipass parsing (while retaining
all location information so that error messages are informative)?

Basically, I would like to be able to evolve a parser by starting from
recognizing and processing a coarse-grained structure and then
refining it, e.g., suppose I have a language that can be thought of at
a first cut as consisting of space-delimited tokens, and then I wish
also to parse within those tokens eventually.

Say I have a stream of text
    It was long-lived - 1810-1910 had 3+4-1.
and I wish to parse this by first splitting into 
    It
    was
    long-lived
    -
    1810-1910
    had
    3+4-1.
and refining eventually to what one might expect: a parse tree
containing words, compound words, dashes (delimiters in sentences),
durations of dates, arithmetic expressions, sentence terminator.

It seems difficult or monolithic to attempt to do this by creating
tokens in one pass and then parsing that stream of tokens.  Rather, it
seems it would be best to create token streams, which are then
manipulated to create refined token streams, etc.

Any ideas on how I should proceed?

-- 
Franklin

Yahoo! Groups Links

<*> To visit your group on the web, go to:
     http://groups.yahoo.com/group/antlr-interest/

<*> To unsubscribe from this group, send an email to:
     antlr-interest-unsubscribe at yahoogroups.com

<*> Your use of Yahoo! Groups is subject to:
     http://docs.yahoo.com/info/terms/