Gene sets¶
Annotation Summary¶
In general annotations may be associated directly with an NCBI GeneID or they may be associated with a gene product that has an assigned Uniprot accession (UniprotKB-AC) ID. All UniprotKB-AC accessions are mapped to UniprotKb-entries. To clarify the following tables.
taxon- matches with the taxon summary table on the top of the pagegenes (annots)- The number of unique GeneIDs from NCBI that are associated with a given taxonuniprot (annots)- The identifiers from uniprot (non-unique) that are associated with a given taxoncoding-genes- GeneIDs that have at least one associated uniprot identifiercombined- uniprot and gene annotations with redundant annotations removedcoverage-combineddivided bygenes
The combined number of annotations may be less than the total observed annotations because multiple uniprot ids may map to a single gene id.
| Taxon | Scientific Name | Common Name |
|---|---|---|
| 10090 | Mus musculus | house mouse |
| 9606 | Homo sapiens | human |
| 7227 | Drosophila melanogaster | fruit fly |
| 7955 | Danio rerio | leopard danio |
Including IEA evidence¶
| Taxon | genes (annots) | uniprot (annots) | coding-genes | combined | coverage |
|---|---|---|---|---|---|
| 10090 | 69347 (18938) | 16671 (15735) | 15619 | 19006 | 27.4071 |
| 9606 | 47736 (18114) | 140289 (100435) | 18921 | 18244 | 38.2185 |
| 7227 | 23387 (11563) | 40464 (26100) | 13543 | 11870 | 50.7547 |
| 7955 | 36579 (14990) | 2926 (2697) | 2669 | 15069 | 41.1958 |
Excluding IEA evidence¶
| Taxon | genes (annots) | uniprot (annots) | coding-genes | combined | coverage |
|---|---|---|---|---|---|
| 10090 | 69347 (15763) | 16671 (11428) | 15619 | 15842 | 22.8445 |
| 9606 | 47736 (14077) | 140289 (17803) | 18921 | 14263 | 29.8789 |
| 7227 | 23387 (10338) | 40464 (12648) | 13543 | 10500 | 44.8967 |
| 7955 | 36579 (3962) | 2926 (1261) | 2669 | 4006 | 10.9516 |
Other organisms¶
| ncbi_id | name | total_genes | total_annotations |
|---|---|---|---|
| 352472 | Dictyostelium discoideum AX4 | 13893 | 46815 |
| 243164 | Dehalococcoides ethenogenes 195 | 1643 | 3903 |
| 214684 | Cryptococcus neoformans | 6618 | 12688 |
| 211586 | Shewanella oneidensis | 4591 | 10351 |
| 227321 | Aspergillus nidulans FGSC A4 | 9596 | 31811 |
| 3702 | Arabidopsis thaliana | 33584 | 150462 |
| 7955 | Danio rerio | 36644 | 91414 |
| 9031 | Gallus gallus | 25998 | 15334 |
| 205920 | Ehrlichia chaffeensis | 1159 | 2810 |
| 7227 | Drosophila melanogaster | 23387 | 80109 |
| 176299 | Agrobacterium fabrum | 5465 | 142 |
| 246194 | Carboxydothermus hydrogenoformans | 2708 | 6282 |
| 195099 | Campylobacter jejuni RM1221 | 1941 | 4557 |
| 9913 | Bos taurus | 43990 | 7634 |
| 36329 | Plasmodium falciparum 3D7 | 5510 | 3448 |
| 4536 | Oryza nivara | 165 | 97 |
| 511145 | Escherichia coli | 4498 | 5292 |
| 265669 | Listeria monocytogenes | 2935 | 6977 |
| 40149 | Oryza meridionalis | 124 | 4 |
| 227377 | Coxiella burnetii | 2096 | 4449 |
| 9606 | Homo sapiens,human | 46263 | 204932 |
| 243233 | Methylococcus capsulatus | 3053 | 7166 |
| 243231 | Geobacter sulfurreducens PCA | 3714 | 7689 |
| 39946 | Oryza sativa Indica Group | 161 | 160 |
| 39947 | Oryza sativa Japonica Group | 30535 | 4740 |
| 559292 | Saccharomyces cerevisiae S288c | 6350 | 75240 |
| 284812 | Schizosaccharomyces pombe | 6954 | 35088 |
| 246200 | Ruegeria pomeroyi | 4349 | 10631 |
| 10116 | Rattus norvegicus | 44939 | 243527 |
| 10090 | Mus musculus | 57997 | 267454 |
| 212042 | Anaplasma phagocytophilum str. HZ | 1412 | 3438 |
| 4529 | Oryza rufipogon | 343 | 2 |
| 222891 | Neorickettsia sennetsu | 974 | 2392 |
| 234826 | Anaplasma marginale | 1005 | 196 |
| 999953 | Trypanosoma brucei brucei | 10193 | 1820 |
| 198094 | Bacillus anthracis str. Ames,None, | 5450 | 12398 |
| 6239 | Caenorhabditis elegans,nematode, | 45727 | 65615 |
| 223283 | Pseudomonas syringae pv. | 5843 | 9962 |