Semantic Suffix Net Clustering for Search Results

Executive Summary

Suffix Tree Clustering (STC) uses the suffix tree structure to find a set of snippets that share a common phrase and uses this information to propose clusters. As a result, STC is a fast incremental algorithm for automatic clustering and labeling but it cannot cluster semantically similar snippets. However, the meaning of the words is indeed an important property that relates them to other words, although there may not be a match of text strings per se. In this paper, the authors propose a new semantic search results clustering algorithm, called Semantic Suffix Net Clustering (SSNC). It is based on Semantic Suffix net Structure (SSN).