The world needs a phishing corpus. Basically, how hard is it to gather up a few hundred phishing emails and assemble them into an unmolested mailbox for study? Oddly, it's easy to do yet no one has done it.

I collect my spam, I always have ... it's a compulsion by now. In this time period (nov 2004 - june 2005) I collected over 32,000 spams, yet only about 415 phishing emails. This corpus is hand selected from these messages and contains nothing but good old fashioned phishing emails.

To the best of my knowledge this is the first such phishing corpus publicly available.