Clustering is the process of grouping the data into classes or clusters so that objects within a cluster have high similarity in comparison to one another, but are very dissimilar to objects in other clusters. Dissimilarities are assessed based on the attribute values describing the objects.
There are a large number of clustering algorithms. The ...

So the event is over. I think I can say for all three organizers, Mladen Prajdić, Matija Lah, and me, that we are tired now. However, we are extremely satisfied. It was a great event. First few numbers and comparison with SQL Saturday #274, the first SQL Saturday Slovenia event that took place last year. ...

I am proud and glad I can announce two top pre-conference seminars at the PASS SQL Saturday #356 Slovenia conference. The speakers and the seminars titles are: Stacia Misner - Power Up Your Data with Excel and Power BI Kevin Boles - Tune Like A Guru! Both seminars will take place on Friday, December 12th, in the classrooms of our sponsor ...

The session Troubleshooting Clusters by Allan Hirt (@SQLHA) has been published on channel SQLPASS TV.
Abstract
Whether you are new to clusters or have years of experience, you may still cross your fingers when implementing a failover cluster instance (FCI) of SQL Server or an availability group (AG). Both require an underlying Windows Server ...

Pure success!
I could simply stop here. However, I want to mention again everybody involved in this, and also some who were unfortunately missing.
First of all, PASS is the organization that defined SQL Saturdays. And apparently the idea works
I have to thank again to all of the speakers. Coming to share your amazing knowledge is something we ...

This is the third part of the fraud detection whitepaper. You can find the first part and the second part in my previous blog posts about this topic. Data Preparation The problem of credit card fraud detection is not trivial. With every transaction processed, only a limited amount of data is available, making it difficult if not impossible to ...

I love to learn about new technology, and I especially love a long deep-dive technical session with a real expert or a well-crafted, inches thick technical book. Even if either one is expensive. Learning is probably my favorite thing to do.
Yet I stand before you with an appeal: Stop “sending people to training.”
Why would I say such a thing? ...

I am proud to announce that my first course for Pluralsight is released. The course title is Logical and Physical Modeling for Analytical Applications. Here is the description of the course.
A bad data model leads to an application that does not perform well. Therefore, when developing an application, you should create a good data model from the ...

This is the second part of the fraud detection whitepaper. You can find the first part in my previous blog post about this topic. My Approach to Data Mining Projects It is impossible to evaluate the time and money needed for a complete fraud detection infrastructure in advance. Personally, I do not know the customer’s data in advance. I don’t ...

Many companies or organizations do regular data cleansing. When you cleanse the data, the data quality goes up to some higher level. The data quality level is determined by the amount of work invested in the cleansing. As time passes, the data quality deteriorates, and you need to repeat the cleansing process. If you spend an equal amount of ...