A Survey of Link Prediction in Social Networks

Abstract

Link prediction is an important task for analying social networks which also has applications in other domains like, information retrieval, bioinformatics and e-commerce. There exist a variety of techniques for link prediction, ranging from feature-based classification and kernel-based method to matrix factorization and probabilistic graphical models. These methods differ from each other with respect to model complexity, prediction performance, scalability, and generalization ability. In this article, we survey some representative link prediction methods by categorizing them by the type of the models. We largely consider three types of models: first, the traditional (non-Bayesian) models which extract a set of features to train a binary classification model. Second, the probabilistic approaches which model the joint-probability among the entities in a network by Bayesian graphical models. And, finally the linear algebraic approach which computes the similarity between the nodes in a network by rank-reduced similarity matrices. We discuss various existing link prediction models that fall in these broad categories and analyze their strength and weakness. We conclude the survey with a discussion on recent developments and future research direction.

Liben-Nowell, David, and Kleinberg, Jon. (2007). The Link Prediction Problem for Social Networks. Journal of the American Society for Information Science and Technology, 58(7):1019-1031.CrossRefGoogle Scholar

Niculescu-Mizil, and Alexandru, and Caruana, Rich. (2005). Predicting Good Probabilities with Supervised Learning. International Conference on Machine Learning.Google Scholar

[43]

Oyama, Satoshi, and Manning, Christopher D., (2004). Using feature conjunctions across examples for learning pairwise classifiers, In The Proc. of European Conference on Machine Learning, pp. 323-333.Google Scholar

Tylenda, Tomasz, and Angelova, Ralitsa, and Bahadur, Srikanta. (2009). Towards time-aware link prediction in evolving social network. SNA-KDD ’09: Proceedings of the third Workshop on Social Network Mining and Analysis.Google Scholar

Xu, Zhao, and Tresp, Volker, and Yu, Shipeng, and Yu, Kai. (2005). Nonparametric Relational Learning for Social Network Analysis. SNA-KDD ’08: In Proceedings of the Second Workshop on Social Network Mining and Analysis.Google Scholar