"... This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs. A ..."

This paper presents a simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (thumbs down). The classification of a review is predicted by the average semantic orientation of the phrases in the review that contain adjectives or adverbs

"... We study unsupervised classification of text documents into a taxonomy of concepts annotated by only a few keywords. Our central claim is that the structure of the taxonomy encapsulates background knowledge that can be exploited to improve classification accuracy. Under our hierarchical Dirichlet ..."

We study unsupervisedclassification of text documents into a taxonomy of concepts annotated by only a few keywords. Our central claim is that the structure of the taxonomy encapsulates background knowledge that can be exploited to improve classification accuracy. Under our hierarchical Dirichlet

"... HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte p ..."

HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et a ̀ la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d’enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

"... this paper we are particularly interested in quadratic cluster boundaries, which amounts to selecting the model class of multivariate normal densities. We need to calculate the code length L(X jc) ..."

"... Nevada, tested automatic classification of landforms. Landform classification is based largely on size, shape, orientation, and relief of an area (MacMillan et al. 2007). Also, since most landforms on earth’s surface depend on adjacent bodies, it is difficult to ..."

Nevada, tested automatic classification of landforms. Landform classification is based largely on size, shape, orientation, and relief of an area (MacMillan et al. 2007). Also, since most landforms on earth’s surface depend on adjacent bodies, it is difficult to

"... Abstract: Machine vision systems typically classify images of a flotation froth surface into one of a distinct set of classes. This process typically involves having an experienced operator identify a set of froth classes. After this, a machine vision system is trained to identify these froth classe ..."

classes. Identifying these froth classes is particularly challenging for froths which have “dynamic ” bubble size distributions. Using unsupervised clustering algorithms, it is possible to automatically learn these froth classes without user input. Validation of this technique is done by showing

"... Abstract—Ancient manuscript analysis is to aid historian to classify, annotate and judge the authenticity of larger collections of ancient manuscripts. Previous method was to examine the composition of manuscript such as paper support or inks by destructive sampling and chemical analysis. The aim of ..."

of this paper is to present an image-based non-destructive and non-invasive method to analyze ancient manuscripts. A new unsupervisedclassification algorithm was designed to distinguish different type of paper support without any priori knowledge. The advantage of this classifier comparing with traditional

"... Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always be available to research workers. In this paper, we look into possibilities to automatically collect training data by samp ..."

Data acquisition is a major concern in text classification. The excessive human efforts required by conventional methods to build up quality training collection might not always be available to research workers. In this paper, we look into possibilities to automatically collect training data