Algorithm for text processing

Hello friends, I want to found some algorithms for text processing.
I have a lots of entries in the database and now I want to split by category (news, history, sport, business etc...) but I don't know none algorithm(s) for text processing.

So, my question is, what is the most popular algorithms for text processing (split by categories, find most similar items etc...) ?