Articles

Implementing a Fileserver with Nginx and Lua.

Using the power of Nginx it is easy to implement the quite complex logic of file upload with metadata and authorization support and without the need of any heavy application server. In this article, you can find the basic implementation of such Fileserver using Nginx and Lua only.

An automatic terms extraction for Domain-specific corpora.

Using simple frequency-based methods, such as Domain Specificity method and Domain-Specific TF-IDF, it is possible to automatically extract and score terms for given domain-specific corpus. In this article, we will use Python and its ecosystem to illustrate such methods in action.

Probabilistic data structures. Quotient filter.

In this article, we continue our acquaintance with implementations of probabilistic sets and consider a modern successor of the Bloom filter that is called Quotient filter. Such data structures can effectively work in situations when we need to handle billions of elements and have optimized memory access.

A Simple Way to Find Turning points for a Trajectory with Python.

Probabilistic data structures. Bloom filter.

In the article we consider such popular implementation of a probabilistic set as Bloom filter, that can efficiently solve the problem of determining membership of some element in a large set of elements without the need to store every element and use many comparisons.

Realtime Twitter Sentiment Analysis with Storm and Elasticsearch.

Andrii Gakhov, Ph.D.

I'm a mathematician and software engineer holding a Ph.D. in mathematical modeling and numerical methods. For a number of years I have been a teacher in the School of Computer Science at V. Karazin Kharkov National University, Ukraine and currently work as a software practitioner for ferret go GmbH, the leading community moderation, automation, and analytics company in Germany.