Thursday, July 19, 2007

Lucene Tag Cloud Generator

I build the cloud from reading the lucene index and pruning it down. It is pruned down by a junk words file which can be used to control how it gets pruned down.Once I build the list I run a javascriipt file passing in the results, and then the javascript outputs the cloud.There are a few files to all of this....

JavaSourceCodeThe source code requires lucene. Though I wrote it as a Nutch plugin, it does not depend on Nutch.

JunkWordsFileThe junk word file contains terms, and some options.The options are baked into the code.