You are here

Statistics of MetaPhOrs

In total, over 7 million phylogenetic trees from 12 databases for 18,481,015 genes from 2,714 species have been processed in order to compute comprehensive set of 6,883,833,661 orthologs and paralogs.

Repositories

Below, you can find detailed information about repositories used in metaPhOrs (release 201405). Please refer to Methodology section, to get more information how the data have been processed.*different proteome versions of the same genome counted; 1,124 unique genomes

dbid

database

version

proteins

proteomes

trees

1

ensembl vertebrates

75

1,528,634

66

21,124

2

ensembl bacteria

16

934,735

248

37,806

3

ensembl fungi

22

461,950

45

25,851

4

ensembl metazoa

22

1,010,007

52

58,822

5

ensembl pan

22

1,566,810

176

85,300

6

ensembl plants

22

1,717,565

38

51,270

7

ensembl protists

22

414,390

29

111,473

8

phylomedb

201405

16,421,385

1,766*

3,697,286

9

orthomcl

5

1,393,144

150

77,113

10

eggnog

4

14,875,530

3,686

1,325,900

11

hogenom

6

7,251,480

1,464

185,800

12

treefam

8

2,326,692

109

15,322

SUM

49,902,322

7,829

5,693,067

Species

There are 2,714 species present in metaPhOrs. Retrieve statistics of:

Tree of life

Orthologs deposited in metaPhOrs were mapped onto the Tree of Life. The bar charts show the proportion of proteins from each proteome that is represented in the database. Green bars illustrate fraction of proteome with orthologs and/or paralogs indentified. While grey bars represent the fraction of preoteome without any homologs. Species that were used as a seed to reconstructe the phylome (like human, yeast, E.coli) and their close relatives are characterized by high coverage (respectively longer green bars). Similarily, the model organisms, that are well-represented in external trees repositories.