Re: XML parsing - huge file strategy?


Don't ask to ask, just ask
	PerlMonks

Re: XML parsing - huge file strategy?

by pc88mxer (Vicar)

on Jul 18, 2008 at 14:43 UTC ( [id://698636]=note: print w/replies, xml )

Need Help??

in reply to XML parsing - huge file strategy?

Could I read it in a line at a time, look for the appropriate start/stop tags and parse it that way?

If your XML is formatted in a manner which is conducive to line-by-line parsing then it shouldn't be a problem.

Alternatively, if your database dump is just a sequence of one kind of element, this should also work:

open(XML, "<", ...);
{
  local($/) = '</record>';
  while (<XML>) { ...process one <record> element... }
}
[download]

With regard to why your experiment on the 400M record table is taking so long... are you grabbing 1M records at a time by using a LIMIT clause (i.e. LIMIT n, 1000000)? If you have any index on that table (say on column colX), adding ORDER BY colX should help speed things along.

Comment on Re: XML parsing - huge file strategy? Select or Download Code

In Section Seekers of Perl Wisdom

Domain Nodelet^?

www.com | www.net | www.org

Node Status^?

node history
Node Type: note [id://698636]
help

Chatterbox^?

How do I use this? • Last hour • Other CB clients

Other Users^?

Others chanting in the Monastery: (4)

As of 2024-04-26 08:01 GMT

Sections^?

Information^?

Find Nodes^?

Leftovers^?

Today I Learned

Voting Booth^?

No recent polls found