I try to take part in some Kaggle competitions, with mixed results. But recently I have been forced to reinstall system quite a few times. Setting it up with all libraries etc. takes a while. So, I have decided to dockerize everything. nVidia has released their wrapper on top of the docker that allows access to ...

Investigating network problems within docker can be tricky. Let’s see how to get more details how to see the communication between docker and host. If you are familiar with wireshark and linux, then probably you have also heard about tcpdump. If not, then it is a tool we can use to see what is happening ...

Not so long ago I have obtained ssl certificates for all of my domains, including thedatawise.com. So from now on it will be safer to browse my blog. Also whole traffic from http is redirected to https, so there is no way to use unsafe version of the website. The certificates were issued by Let’s ...

I just got a new camera and lens. As my old Lightroom 4 cannot deal with the AWR raw files I decided to try something else. I did not want to switch my workflow to Linux so I decided to try Capture One 9. On one of the very first pictures I tried to process ...

This post will be short, only to mention that I joined Amazon Affiliate Program. Basically if you want to buy something on Amazon (.com or .co.uk) you can go to the website through my link and if you buy something through I may get some small money for that. As nobody has bought anything yet ...

If you have an AMD GPU and you want to do some Deep Learning, there is not much choice around. You have to pick something based on OpenCL, until AMD starts to support CUDA. The Boltzman Initiative announced recently will bring CUDA and C++ support for AMD’s GPUs, but we have to wait for that ...

A couple of years ago I had a ‘funny’ case with Network Attached Storage (NAS) by Cisco/Linksys (NSS 4000 and NSS 6000). The idea was to create a backup solution for a company where all important documents from desktops and laptops would be backed-up. Dropbox started about a year earlier and it was not a ...

I have stumbled upon a B-Tree template library written in C++. The library I found was written by someone from Google and can be downloaded from http://code.Google.com/p/cpp-btree/. According to the authors the advantage over the standard libraries implementing map and set data structures is the memory overhead. Usually those are implemented using Red-Black trees, which ...

If you look for a truly in-memory open source database then VoltDB might be the only choice available at the moment. Unfortunately, for me, there are no binaries for Windows available as big part of the code was written in C/C++ which is hard to port from Linux to Windows. But there is a way ...

Some time ago I wrote a post Financial Times and Google describing how to read articles from behind the paywall at ft.com. This way does not work anymore, but there is another way, even simpler. Just go to www.google.com paste a link to the article you want to read in the text box, e.g. http://www.ft.com/cms/s/0/a8573532-65bf-11e5-97e9-7f0bf5e7177b.html ...

I decided to redesign my CV a little bit. The old template in LaTex, while giving a neat output, is not flexible. Every small change requires a lot of time, just to make sure that everything is in the right place. So, as I got my hands on the MS Office 2016 I could start ...

Not the latest news, but I am more than happy to see it. The MS Office 2016 will feature dark themed user interface, which apparently was “the most requested feature for Office 2013”, see theverge.com. If you want you may check it yourself. There is a public preview version available for free from Microsoft’s website. ...

I knew a day like that would come, but I did not know I would last so long with it. I run out of space on my free Dropbox, I have some more on Google Drive, but I do not want to use that as it has much worse UX than Dropbox. So I started ...

Databases on GPGPU are still in a very early stage of development. I would even say that, unless it is top secret, it is rather only a field for academic research at the moment – and that is where we can find the most information about it. Recently I got back to looking for something ...

In my spare time I am creating a web scraping environment. Why is it environment not a script or anything like that? Because those pretty much already exist. The aim of my project is to have a list of websites to monitor, scrape them every now and then, and finally process the data. One of ...

If you just started your adventure with databases you may not pay too much attention do data types. Often reading in the data from a flat file will be so troublesome that you just define column types to be VARCHAR(100) just to have everything read in. Maybe after having the data read you convert some ...

Conditional formatting in Excel is one of the things every analyst should know well. When you have to compare loads of data the best way is to plot them, but even better to colorcode the values. That is pretty easy in Excel, as long as you can use conditional formatting (highlight a range, then conditional ...

I was preparing a post about what can slow down a query, but when I was reading a lot of different stuff online to make sure I am writing truth I encountered several interesting papers, and presentations that I think are worth reading. First of them is a series of presentations regarding Data Processing on ...