Category Archives: Big data

Post navigation

In this blog post, I will discuss the data challenge of the Machine Learning for Sport Analytics workshop (MLSA 2018) at PKDD 2018. The challenge consisted of predicting the receivers of football passes (pass prediction). I will first briefly describe … Continue reading →

In this blog post, I will talk about the future of research on pattern mining. I will also discuss some lessons learnt from the decades of research in this field and talk about research opportunities. What is the state of … Continue reading →

Today, the SPMF data mining software website has passed the milestone of 600,000 visitors. In recent years, SPMF has also been used in more than 500 research papers. All this success is thanks to the users and many contributors who have … Continue reading →

As you may know, I am one of the four co-organizers of the first international Workshop on Utility-Driven Mining (UDM2018), which will be held at KDD 2018 in London, England this August. I am quite excited about this workshop, and in … Continue reading →

In this blog post, I will discuss the PAKDD 2018 conference (Pacific Asia Conference on Knowledge Discovery and Data Mining), in Melbourne Australia, from the 3rd June to the 6th June 2018. About the PAKDD conference PAKDD is an important conference … Continue reading →

This week I have attended the China International Big Data Industry Expo 2018 in Guiyang, China. I will describe this event and some of the key things that I have observed so far. What is the China International Big Data Industry … Continue reading →

In this blog post, I will talk about the vision of the Semantic Web that was proposed in the years 2000s, and why it failed. Then, I will talk about how it has been replaced today by the use of … Continue reading →

In this post, I will provide two standard benchmark datasets that can be used for frequent subgraph mining. Moreover, I will provide a set of small graph datasets that I have created for debugging subgraph mining algorithms. The format of … Continue reading →

In this blog post, I will explain why the FSMS algorithm for frequent subgraph mining is an incorrect algorithm. I will publish this blog post because I have found that the algorithm is incorrect after spending a few days to … Continue reading →