Benediktsson, Elís Ingi

Abstract [en]

Megasatellites are polymorphic tandem repetitive sequences with repeat-units longer than or equal to 1000 base pairs. The novel algorithm Megasatfinder predicts megasatellites in the human genome. A structured method of analysing the algorithm is developed and conducted. The analysis method consists of six test scenarios. Scripts are created, which execute the algorithm using various parameter settings. Three nucleotide sequences are applied; a real sequence extracted from the human genome and two random sequences, generated using different base probabilities. Usability and accuracy are investigated, providing the user with confidence in the algorithm and its output. The results indicate that Megasatfinder is an excellent tool for the detection of megasatellites and that the generated results are highly reliable. The results of the complete analysis suggest alterations in the default parameter settings, presented as user guidelines, and state that artificially generated sequences are not applicable as models for real DNA in computational simulations.