Main menu

Downloads

This challenge will be evaluated on a multitude of datasets which are freely available including the following demo dataset. Each dataset consists of a set of files each holding the data for each trajectory. The files are in a space-separated value format and the first line should be ignored (it usually holds column name information). The first two columns are always the X and Y coordinate. A sample file begins like this:

For this challenge, we ignore the curvature on earth and map projections, so you should expect any sizes of numbers for X and Y and treat them as if they were plain Euclidean coordinates. You should also ignore the contents of the first line of each file, it might look different during evaluation.

A dataset is specified as a single file with filenames on each line. In this way, we can create multiple datasets of varying sizes from the same set of trajectory files. Such a file for the sample dataset begins like

Note that the file names shall be used as unique keys in the output file format below, so you have to load these into memory. If you want to create new datasets, it is best to use find, sort and head on Linux systems like this:

> find files | sort -R | head -500 > dataset.txt

which generates a random list of 500 files in the directory files. Using find instead of ls makes the output contain the full path to the file. For Geolife, for example, you could use a find -name *.plt to generate a similar file.

The problem itself is specified as a file queries.txt containing one query per line. The first line will be numbered zero. Such a problem file looks similar to