[antlr-interest] big XML file support

Scott Stanchfield scott at javadude.com
Mon May 15 15:42:53 PDT 2006


Check out ANTXR (http://javadude.com/tools/antxr). It's an antlr
derivative that I wrote to make XML parsing nice and simple.

You can use a SAX or XMLPULL front end for the scanner.

LMK if you have any questions,
-- Scott

> Suppose I have a file that looks like this:
>
> <a>
>   <b>
>     <c>
>       <d /> <d /> <d /> ... For a few GB worth
>     </c>
>     <c binary="true">
> <CDATA[[ about 10GB of binary data ]]>
>     </c>
>   </b>
> </a>
>
> I need a parser to go through and build up a structure with the tree but
> without any <d> or binary data. Instead, I just want to record the file
> locations for those and I'll go pull them from the file when I need them.
> Is
> ANTLR a good tool to do that or am I better off parsing by hand? Or should
> I
> use Xerces? Or, the real question, does ANTLR have some ability to do
> XML-type structures easily? What are the largest files you've parsed using
> ANTLR? I'm using C++. Thanks for your time.
>
> ______________________________
> Brannon King
>




More information about the antlr-interest mailing list