A process for analysis of microarray comparative genomics hybridisation studies for bacterial genomes

A process for analysis of microarray comparative genomics hybridisation studies for bacterial genomes - Download this document for free, or read online. Document in PDF available to download.

BMC Genomics

, 9:53

First Online: 29 January 2008Received: 06 March 2007Accepted: 29 January 2008

Abstract

BackgroundMicroarray based comparative genomic hybridisation CGH experiments have been used to study numerous biological problems including understanding genome plasticity in pathogenic bacteria. Typically such experiments produce large data sets that are difficult for biologists to handle. Although there are some programmes available for interpretation of bacterial transcriptomics data and CGH microarray data for looking at genetic stability in oncogenes, there are none specifically to understand the mosaic nature of bacterial genomes. Consequently a bottle neck still persists in accurate processing and mathematical analysis of these data. To address this shortfall we have produced a simple and robust CGH microarray data analysis process that may be automated in the future to understand bacterial genomic diversity.

ResultsThe process involves five steps: cleaning, normalisation, estimating gene presence and absence or divergence, validation, and analysis of data from test against three reference strains simultaneously. Each stage of the process is described and we have compared a number of methods available for characterising bacterial genomic diversity, for calculating the cut-off between gene presence and absence or divergence, and shown that a simple dynamic approach using a kernel density estimator performed better than both established, as well as a more sophisticated mixture modelling technique. We have also shown that current methods commonly used for CGH microarray analysis in tumour and cancer cell lines are not appropriate for analysing our data.

ConclusionAfter carrying out the analysis and validation for three sequenced Escherichia coli strains, CGH microarray data from 19 E. coli O157 pathogenic test strains were used to demonstrate the benefits of applying this simple and robust process to CGH microarray studies using bacterial genomes.

Electronic supplementary materialThe online version of this article doi:10.1186-1471-2164-9-53 contains supplementary material, which is available to authorized users.

* Images in this website could be protected under copyright laws. They are picked automatically from bing.Please use the contact form to claim a copyright infriction and the item will be removed immediatelly.