Boa tarde a todos,
Teremos no dia 5 de Novembro, segunda-feira, pelas 13:00, na sala 0.20,
Pavilhão de Informática II, IST Alameda, a apresentação e discussão da
tese de mestrado da Ana Sofia Correia, "Fast mapping and querying over
large scale typing data".
Abstract: High-Throughput DNA Sequencing (HTS) methods gave rise to a
paradigm shift in microbial typing and genomic population structure
studies. The ability to partially sequence the genomes of hundreds to
thousands of strains created the need for effective ways to represent
relationships between strains. Single Nucleotide Polymorphism (SNP)
analysis and whole or core genome MultiLocus Sequence Typing (wgMLST or
cgMLST), result in profiles that have thousands of loci which can be
used for outbreak investigation, epidemiological surveillance of clones
of interest and bacterial population or evolutionary studies. The first
step to define these profiles is to map reads obtained through genome
sequencing, identify relevant genes, and query existing typing databases
to find if the strain being analyzed has been identified already, or if
it is a new strain. Given the size of existing typing databases, the
data volume resulting from HTS, and the urgency of these analyses,
namely when in presence of outbreaks, the inherent computational problem
of mapping and querying typing data has become a big challenge. To solve
this issue, this work intend to demonstrate and proof a new approach
that relies on Linear Codes, specifically on Reed Muller codes.
Saudações,
Alexandre Francisco