Question: Why do we need to rebuild the collection.spec file correctly (see Terrier/LiveDoc/TrecExample)? Is the file collection.spec created with the script trec_setup.sh?

VassilisPlachouras: The trec_setup.sh script uses the utility find on Unix/Linux/MacOS X systems and the trec_setup.bat script uses the class FileFind, on Windows, so that we obtain the absolute paths for all the files under a given directory.

If under the directory you specify for the trec_setup script, there are only the collection files, then it is not necessary to create the collection.spec file manually. If under the directory where the collection files are stored, there are other files as well, then it may be necessary to check that the automatically generated collection.spec file contains only the collection files, and either edit it or create it manually.

Have someone used the distributed version of Terrier with the terabyte TREC track to compare efficiency between a language model approach and the DFR models, or conventional models?

More information: I have not the data of the terabyte track and I would like to know whether the assignment of non-zero probabilities to terms increase the complexity of the retrieval, especially with long queries or QE.

Question: Terabyte Track

More information: I am browsing the results of the terabyte track. FUB did not participate to the this track. I have not the results. However I knew that Terrier was first both onlong and short queries. I see that there is another group is claiming that they were first on title only run. Also I saw somebody complaining that results are not yet available. It seems that a resuming official table from organizers is still missing. Could something from GU clear me this point.