Comparing Machine Learning Classifiers for Movie WOM Opinion Mining

Abstract

Nowadays, online word-of-mouth has become a powerful influencer to marketing and sales in business. Opinion mining and sentiment analysis is frequently adopted at market research and business analytics field for analyzing word-of-mouth content. However, there still remain several challengeable areas for 1) sentiment analysis aiming for Korean word-ofmouth content in film market, 2) availability of machine learning models only using linguistic features, 3) effect of the size of the feature set. This study took a sample of 10,000 movie reviews which had posted extremely negative/positive rating in a movie portal site, and conducted sentiment analysis with four machine learning algorithms: na?ve Bayesian, decision tree, neural network, and support vector machines. We found neural network and support vector machine produced better accuracy than na?ve Bayesian and decision tree on every size of the feature set. Besides, the performance of them was boosting with increasing of the feature set size.