The goal of the project is to validate predicted genes by computing a confidence score and suggesting possible errors/untrusted regions in the sequence. The results of the prediction validation will make evidence about how the sequencing curation may be done and can be useful in improving or trying new approaches for gene prediction tools. The main target users of this tool are the Biologists who want to validate the data obtained in their own laboratories.

Most of the currated data from Uniprot database (yellow starred) passes the length validation test. All the others remain our challenge.
Other databases I did't touch yet: for all kind of species [1] and plants [2].