Python Programming Language

How to parse usenet urls?

I'm trying to parse newsgroup messages, and I need to follow URLs in this format: news://some.server. I can past them into a newsreader with no problem, but I want to do it programatically.

I can't figure out how to follow these links - anyone have any ideas?

In article <1180573018.786188.220@h2g2000hsg.googlegroups.com>,

"snewma@gmail.com" <snewma@gmail.com> wrote: > I'm trying to parse newsgroup messages, and I need to follow URLs in > this format: news://some.server. I can past them into a newsreader > with no problem, but I want to do it programatically.

> I can't figure out how to follow these links - anyone have any ideas?

> How can I take the message link 'news://newsclip.ap.org/ > D8PE2G@news.ap.org' and follow it?

OK, gotcha. I misunderstood your original question. Perhaps this is just a synonym for "nntp:"? THis sounds like a dangerous assumption and hopefully someone more knowledgeable will come along and shoot me down. =) But when I fire up Ethereal and paste that news: URL into my browser, Firefox launches my newsreader client and Ethereal reports that my client connects to the remote server at the NNTP port (119), sends an NNTP LIST command and Ethereal identifies the subsequent conversation as NNTP.

If I were you I'd try handling news: URLs with nttplib. I bet it will work.