Text (word) analysis and tokenized text modeling always give a chill air around ears, specially when you are new to machine learning. Thanks to Python and its extended libraries for its warm support around text analytics and machine learning. Scikit-learn is a savior and excellent support in text processing when you also understand some of the concept like "Bag of word", "Clustering" and "vectorization". Vectorization is must-to-know technique for all machine leaning learners, text miner…

I just created a cloud at home. Suppose I start on a project aiming to create a computer game. I purchase 4 servers and some software. After a couple of weeks I realize that in order to complete the project I'll need 6 more servers, but I have run out of money. I decide to write an…

Did you know? Organizations classified as 'Fact Finders' — described as more analytically oriented — are 20% more likely to be among leaders within their industry. While business analytics has been around for decades,…

The Balanced Scorecard (BSC) is a new buzz-word that stands for the performance management magic pill. Many books on management praise this business concept and report an impressive adoption rate by Fortune 1000 companies. When it comes to practice it appears that only top management in the company knows about the Balanced Scorecard and it is used at a minimum of its potential.

Over the last several months, as I looked at addressing the business needs across various industries as someone leading a team of Data Scientists, the question of domain expertise invariably cropped up.

Attending one meeting with a Pharmaceutical company, I was posed with the question of, "Have you done work in the areas of Rare Signal detection?" In a similar vein, while preparing for a meeting with an Auto finance major, the question was in the area of using Auto…

Google recently replaced its AdWords MySql Database with a Database that they built in-house namely F1 Database. AdWords serves thousand of users, " which all share a database over 100TB serving up hundreds of thousands of requests per second, and runs SQL queries that scan…

The pivot role of a wild savant woman in the midst of revolutions and enlightenment: Emilie du Châtelet from my summer book Passionate Minds of David Bodanis, Three Rivers Press N.Y, 2006. I am impressed by Emilie ‘ insights permeated into the scientific mainstream with her unconventional but logic thoughts, so original that her male scientists recycled her ideas even forgot who had originated them. Emilie du Châtelet contributed the square to Einstein concept of energy…

Have you ever done a Google search for mining data? It returns the same results as for data mining. Yet these are two very different keywords: mining data usually means data about mining. And if you search for data about mining you still get the same results anyway.…