I hereby propose to work on "Text mining add-on for Orange" (as listed at http://orange.biolab.si/trac/wiki/GSoC/Ideas) in GSoC 2012. As my resume (linked above) states, I'm comfortable with Python, MATLAB/Octave and C. So, working on source code refactoring (for Orange 2.5 dev guidelines compliancy), documentation, unit tests and PyPI supported install will be very useful to Orange project (in my humble opinion).Plus, due to me taking Stanford's Artificial Intelligence, Machine Learning, Probabilistic Graphical Models and Natural Language Processing and Caltech's Learning from Data online classes, I believe that the knowledge I gain from these course will facilitate the comparison between existing text pre-processing techniques in Orange and state-of-the-art algorithms (and their reimplementation, if needed) as stated on the ideas page.

With knowledge gained by working on this project, I intend to continue to contribute to Orange project even after completion of GSoC (as my primary academic/research interest areas are Computational Intelligence, Machine Learning and Data Mining).

I read on the ideas page that possible mentor for this project is Črt. I'm posting this proposal here so that the development team is notified of my proposal. Feel free to contact me here (or in personal) for any details.