To address the problem of accumulating large collections of facts, we are developing KnowItAll — a domain-independent system that extracts massive amounts of information from the Web in an autonomous, scalable manner.

More about KnowItAll here (a paper from the WWW2005 conference; PDF) and here (from WWW20004; PDF.