Clustering of protein models is the task of separating a set of protein models (here a protein dynamics trajectory) into groups (called clusters). The clustering is done in such a way that models are more similar in the same group to each other (here in the sense of RMSD measure), than those in other groups (clusters). CABSfold utilizes classical K-means clustering method.
After clustering is done, each cluster representative is chosen (always the model which average dissimilarity to all models in a cluster is minimal). Predicted protein models, presented in the Predicted structures tab, are each cluster representatives (the clusters and the corresponding models are marked by the same numbers, e.g. Model 1 represents Cluster 1).
The clusters are numbered/ranked according to cluster density values, from the most dense (numbered as a first) to the least dense one.