Hi everybody,
Is there any alternative to Parser.XML, accessible from Pike? Modules etc? Or are there any plans to rewrite it in C, possibly using something like libxml2? Perhaps someone may ask - what is wrong with Parser.XML?.. Well, it takes approximaly 20M RAM and 6 seconds to parse (only) 1M XML file (it contains approx. 80000 tags). This is a way too much, IMHO (libxml2 takes approx. 0.5s and only 4M RAM for same file - while this is not perfect too but more acceptable). Any ideas? Suggestions? Comments? Flames? :) Regards, /Al
If you don't need a true XML parser, you could probably use Parser.HTML. I don't think it get very much faster then that, of course, depending on what you do.
/ Mirar
Previous text:
2003-02-28 16:49: Subject: Parser.XML - The slow, the fat and the hungry :)
Hi everybody,
Is there any alternative to Parser.XML, accessible from Pike? Modules etc? Or are there any plans to rewrite it in C, possibly using something like libxml2?
Perhaps someone may ask - what is wrong with Parser.XML?.. Well, it takes approximaly 20M RAM and 6 seconds to parse (only) 1M XML file (it contains approx. 80000 tags). This is a way too much, IMHO (libxml2 takes approx. 0.5s and only 4M RAM for same file - while this is not perfect too but more acceptable).
Any ideas? Suggestions? Comments? Flames? :)
Regards, /Al
/ Brevbäraren
marek is working on proper libxml glue. check out his work, he might even enjoy getting your help :-)
greetings, martin.
On Fri, Feb 28, 2003 at 04:48:50PM +0100, Alexander Demenshin scribbled:
Hi everybody,
Is there any alternative to Parser.XML, accessible from Pike? Modules etc? Or are there any plans to rewrite it in C, possibly using something like libxml2?
check out the pexts from the caudium.net CVS. I have put an XML2 glue there recently. It's not nearly finished and I'm changing the callback API right now, but it's usable.
marek
Or are there any plans to rewrite it in C, possibly using something like libxml2?
Parser.XML is already written in C. When it was implemented there wasn't any good alternatives (as often is the case when we decide to do things our own way), but now it would make sense to replace it with a SAX-enabled parser. This was discussed on the last Pike summit and Martin Bähr was appointed coordinator for this effort.
/ Martin Nilsson (har bott i google)
Previous text:
2003-02-28 16:49: Subject: Parser.XML - The slow, the fat and the hungry :)
Hi everybody,
Is there any alternative to Parser.XML, accessible from Pike? Modules etc? Or are there any plans to rewrite it in C, possibly using something like libxml2?
Perhaps someone may ask - what is wrong with Parser.XML?.. Well, it takes approximaly 20M RAM and 6 seconds to parse (only) 1M XML file (it contains approx. 80000 tags). This is a way too much, IMHO (libxml2 takes approx. 0.5s and only 4M RAM for same file - while this is not perfect too but more acceptable).
Any ideas? Suggestions? Comments? Flames? :)
Regards, /Al
/ Brevbäraren
pike-devel@lists.lysator.liu.se