Forum Moderators: coopster & phranque

Message Too Old, No Replies

Reliable RSS Parser?

         

Brett_Tabke

1:36 pm on Feb 20, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



Well, the XML RSS parser is bascially junk. It can't handle feeds with even the slightest of errors. On todays junkified rss scape it just don't cut it. I tried DIRTY RSS for awhile, but it too has left a lot to be lacking.

Anyone have any off-the-shelf solutions to parsing the wide range of rss/xml?

Birdman

2:38 pm on Feb 20, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I haven't tried this, but it appears to be what you're after.

Feed::Parser [turner.mikomi.org] - Perl extension to reliably parse RSS and Atom feeds

This is designed to be universal feed parser much like the Python one here: [feedparser.org...]
It differs from other feed parsers on the CPAN by being very robust and error tolerant. It also supports a wider variety of extensions then the other parsers.

Brett_Tabke

2:46 pm on Feb 20, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



thanks... will give it a try!

Birdman

2:57 pm on Feb 20, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Quite welcome! I'm interested to hear how it handles errors.

wruppert

5:14 pm on Feb 20, 2007 (gmt 0)

10+ Year Member



Thanks for this thread. I've been needing a better RSS parser too.

Brett_Tabke

5:37 pm on Feb 20, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



all things considered, I have went with xml::rss::parser

Nifty article about rss:
[diveintomark.org...]

Brett_Tabke

6:52 pm on Feb 26, 2007 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



All things considered - xml::rss is still a poor parser... ugh.

I took a hard look at Feed::Parser Birdman, but the prereg modules are far to steep. I couldn't run alot of them under windows.

Sooooo, back to toying with dirty rss and hacking my own.