Hey everyone,
Been super busy since last week's meeting, but started
reading up on k-Means clustering and expecation-maximization,
in the hopes that I can use one of these techniques to start
clustering the KDD data.
Tonight I'm finally getting around to using Weka's built-in
clustering to see if it works with the KDD data:
http://weka.wikispaces.com/Using+cluster+algorithms
Can't promise anything in terms of results, but tomorrow I'd
be happy to give a (very) brief overview of k-means clustering
and expectation maximization, and hopefully some preliminary
results with a subset of the KDD data.
Perhaps some of us could work together to implement a clustering algorithm
in map-reduce form to work on an elastic map reduce cluster! Looking
forward to seeing everyone tomorrow,
mike
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.noisebridge.net/pipermail/ml/attachments/20100525/388de501/attachment.htm