Description

The aim of the GENCODE
Genes project (Harrow et al., 2006) is to produce a set of
highly accurate annotations of evidence-based gene features on the human reference genome.
This includes the identification of all protein-coding loci with associated
alternative splice variants, non-coding with transcript evidence in the public
databases (NCBI/EMBL/DDBJ) and pseudogenes. A high quality set of gene
structures is necessary for many research studies such as comparative or
evolutionary analyses, or for experimental design and interpretation of the
results.

The GENCODE Genes tracks display the high-quality manual annotations merged
with evidence-based automated annotations across the entire
human genome. The GENCODE gene set presents a full merge
between HAVANA manual annotation and Ensembl automatic annotation.
Priority is given to the manually curated HAVANA annotation using predicted
Ensembl annotations when there are no corresponding manual annotations. With
each release, there is an increase in the number of annotations that have undergone
manual curation.
This annotation was carried out on the GRCh38 (hg38) genome assembly.

Display Conventions

These are multi-view composite tracks that contain differing data sets
(views). Instructions for configuring multi-view tracks are
here.
Only some subtracks are shown by default. The user can select which subtracks
are displayed via the display controls on the track details pages.
Further details on display conventions and data interpretation are available in the track descriptions.

Data access

GENCODE Genes and its associated tables can be explored interactively using the
REST API, the
Table Browser or the
Data Integrator.
The GENCODE data files for hg38 are available in our
downloads directory as wgEncodeGencode* files in genePred format.
All the tables can also be queried directly from our public MySQL
servers, with instructions on this method available on our
MySQL help page as well as on
our blog.