The SitePoint Forums have moved.

You can now find them here.
This forum is now closed to new posts, but you can browse existing content.
You can find out more information about the move and how to open a new account (if necessary) here.
If you get stuck you can get support by emailing forums@sitepoint.com

If this is your first visit, be sure to
check out the FAQ by clicking the
link above. You may have to register
before you can post: click the register link above to proceed. To start viewing messages,
select the forum that you want to visit from the selection below.

Developing a blog feed aggregator

Hi,

I have the task of developing a blog feed aggregator. We have a website where the members have their own personal blogs. I need to design a script that gets their blog entries and puts in our database every couple of hours.

I've pretty much managed to do this and it works for basic wordpress style rss feeds, however what happens if the feed is in a different format? I had kind of assumed that RSS was RSS and one parser would suit all, but after doing a little testing this is not turning out to be the case.

Can anyone tell me how many different type of RSS / XML feeds I need to account and code for?

Is this the best way in your opinions of creating a feed aggregator site?

Any thoughts or opinions appreciated, i'm not sure if i'm barking up the wrong tree completely with this one.

Cool thanks, will try out those leads now. At the moment i'm using some kind of custom built thing I got off a friend, but whatever approach I take it doesn't sem to be able to deal with the Atom feed. I should have thought it ought to auto detect it, after all I won't necessarily know what kind of feed it is before I parse it. I thought the parser would be able to recognise the file format and parse accordingly.