Several types of sequence may also be retrieved from individual CGD Locus pages using options on the ‘Retrieve Sequences’ pull-down menu. The options include the genomic DNA (with introns), the coding sequence (with introns removed), the genomic DNA with 1 kb upstream and downstream flanking regions, and the predicted protein sequence (ORF translation). The sequences in Assembly 20 and Assembly 19 are available from this menu. In cases where the two diploid alleles are known to differ in sequence in Assembly 19, both of these sequences are available. The selected sequence is displayed in a browser window in FASTA format, and may be copied and pasted into anotherapplication.

Sequence information may be retrieved in bulk from the CGD website. The downloadable files are indexed on the CGD Downloads page (http://www.candidagenome.org/DownloadContents.shtml). The raw sequences of C.albicans chromosomes from Assembly 20 are available, as well as an archive of contigs from previous assemblies (Assemblies 4, 5, 6 and 19) from the Stanford Genome Technology Center (10). For Assemblies 19 and 20, files with processed sequences are also available: DNA sequences of all coding regions, with or without introns, and with and without 1 kb flanking regions. The protein sequences derived from ORF translation are available for the Assemblies 20, 19 and 6. The DNA sequence of the mitochondrial genome is also available for download (10), as are the sequences of the tRNA genes predicted at CGD using the tRNAscan-SE algorithm (15).

GBrowse view of data from Assembly 20 of the C.albicans genome sequence. The view may be scrolled along the Assembly 20 chromosome, or zoomed in on a particular region. Selectable tracks of data include ORFs, Contig 19s, centromeric regions, and sequence gap regions.

Comparative sequence analysis at CGD

For sequence analysis, CGD provides a BLAST tool accessible via a link shown in the banner at the top of almostevery page on the CGD website (http://www.candidagenome.org/cgi-bin/nph-blast). The tool allows comparison of any query sequence to one of several C.albicans sequence datasets. The full suite of BLAST programs is available [BLASTN, BLASTP, BLASTX, TBLASTX and TBLASTN (16)]. The sequence databases that can be searched include the complete sequence of Assembly 20 chromosomes, Assembly 19 supercontigs, Assembly 6 contigs or Assembly 4 contigs; the coding sequences of ORFs from Assemblies 20 or 19 with or without introns; the translation products of ORFs from Assemblies 20, 19 or 6; the predicted tRNA genes from Assembly 19, or the mitochondrial genome sequence.

The medical information provided on this website is of a general nature and can not substitute for the advice of a medical professional
(for example, a qualified doctor/physician)! Information from the internet could and should NOT be used to offer or render a medical opinion or otherwise
engage in the practice of medicine.