Abstract

Bioinformatics is a new and interesting field of research, and genuine discoveries are being made all the time. This project aims to investigate the application of a mathematical tool, called a Rauzy graph, to the problem of classifying DNA sequences. This project aims to learn more about the properties of DNA, and the nature of the graphs themselves.
The work for the project involves developing an algorithm for the computer to generate Rauzy graphs. Different sets of DNA sequences are used to build graphs from, and their properties analysed to investigate their behaviour in the biology domain.
The results of the work show that differences in DNA sequences do affect the structure of their Rauzy graphs and that, through this, the properties of the Rauzy graphs of DNA sequences could be used to classify those sequences.