Reptile is a software developed in C++ for correcting sequencing errors in short reads from next-gen sequencing platforms. Reptile has several favorable properties:

Memory efficiency. Reptile can process input data with sizes larger than main memory. For instance, to process a 160x coverage (3.8GB) Illumina data for E. coli it requires only ~1GB memory, which is easily available in a desktop computer.

High speed. Processing Illumina data for a microbe typically takes 0.5hr ~ 2hrs, depending on the number and the quality of reads.

Note: the default values of program parameters are dataset dependent, i.e., they vary as dataset changes
and hence are not “fixed” or “standard”.
The calculation of these parameters can be automated but currently, many of them need to be set manually using the method explained in the paper (there is no assumption of any information of the reference genome). In general, the default parameters are chosen based on the histograms of quality scores, tile occurrences, and so on, of the dataset under consideration.