[antlr-interest] Re: Local lookahead depth
Oliver Zeigermann
oliver at zeigermann.de
Mon Nov 10 11:44:53 PST 2003
lgcraymer wrote:
> --- In antlr-interest at yahoogroups.com, "Oliver Zeigermann"
> <oliver at z...> wrote:
>
>
>>>>because of the memory issue. As a very practical exmaple I
>
> have
>
>>>parsing
>>>
>>>>of the AMM (Aircraft Maintenance Manual) which is available in
>>>
>>>SGML
>>>
>>>>(very hard to parse, really). I parsed this a few years using
>>>
>>>ANTLR, but
>>>
>>>>its size normally is around 100MB. A few years ago my machine
>
> had
>
>>>128MB
>>>
>>>>of RAM! You see what I mean?
>>>
>>>And how much disk space did you have? On a UNIX box, mmap() is
>
> a
>
>>>good way of automating file I/O, but even on systems without
>
> virtual
>
>>>memory, you can fake it. Performance is not an issue--with a
>
> problem
>
>>>of this size, nothing stays in the processor cache, and the
>
> overhead
>
>>>of the disk writes will be only a few percent.
>>>
>>>--Loring
>>
>>
>>Loring,
>>
>>are you really serious about this? Have a look at the DOM vs. SAX
>>discussion in the XML area...
>
>
> Of course. Large memory machines are a recent luxury, and it is not
> hard to use disks efficiently.
You know, when you have large amounts of data parsed, it *does* make a
difference if it takes one or ten hours per run.
Oliver
Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/
More information about the antlr-interest
mailing list