genomes
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- taxonomy_id
- INTEGER (primary key), unique identifier for each species
- species
- TEXT, binomial name of the species
- family
- TEXT, taxonomic family of the species
- order
- TEXT, taxonomic order of the species
- phylum
- TEXT, taxonomic phylum of the species
- accession
- TEXT, accession number of the genome assembly
- n_minor_introns
- INTEGER, total number of minor introns in the genome
- n_major_introns
- INTEGER, total number of major introns in the genome
- percent_minor_introns
- REAL, percentage of minor introns in the genome
- busco_score
- REAL, BUSCO score assessing the genome assembly completeness (vs. eukaryota_odb10)
- minor_snRNAs
- TEXT, minor snRNAs found in the annotated transcriptome
- genome_version
- TEXT, version of the genome assembly
- source_url
- TEXT, URL for the source genome/annotation files
- source_metadata
- TEXT, additional metadata from the original data source
- minor_intron+
- INTEGER, indicates if the species is inferred to contain real minor introns (1) or not (0)
5 rows where minor_snRNAs contains "u4atac" and n_minor_introns = 317 sorted by percent_minor_introns descending
This data as json, CSV (advanced)
Suggested facets: phylum, busco_score, minor_snRNAs (array)
| taxonomy_id | species | family | order | phylum | accession | n_minor_introns | n_major_introns | percent_minor_introns ▲ | busco_score | minor_snRNAs | genome_version | source_url | source_metadata | minor_intron+ |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 99882 | Toxostoma redivivum | Mimidae | Passeriformes | Chordata | GCA_013397375.1 | 317 | 110132 | 0.2870102943439959 | 86.7 | ["u11", "u12", "u4atac", "u6atac"] | ASM1339737v1 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/013/397/375/GCA_013397375.1_ASM1339737v1 | GCA_013397375.1;PRJNA545868;SAMN12253839;VXBI00000000.1;representative genome;99882;99882;Toxostoma redivivum;;B10K-DU-002-15;latest;Scaffold;Major;Full;2020/07/10;ASM1339737v1;B10K Consortium;na;na;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/013/397/375/GCA_013397375.1_ASM1339737v1;;;na | 1 |
| 227184 | Rynchops niger | Laridae | Charadriiformes | Chordata | GCA_013400035.1 | 317 | 113734 | 0.2779458312509316 | 88.6 | ["u11", "u12", "u4atac", "u6atac"] | ASM1340003v1 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/013/400/035/GCA_013400035.1_ASM1340003v1 | GCA_013400035.1;PRJNA545868;SAMN12253840;VXBH00000000.1;representative genome;227184;227184;Rynchops niger;;B10K-DU-002-16;latest;Scaffold;Major;Full;2020/07/10;ASM1340003v1;B10K Consortium;na;na;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/013/400/035/GCA_013400035.1_ASM1340003v1;;;na | 1 |
| 87088 | Vigna umbellata | Fabaceae | Fabales | Streptophyta | GCF_018835915.1 | 317 | 121990 | 0.2591838570155428 | 89.0 | ["u11", "u12", "u4atac", "u6atac"] | ASM1883591v1 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/018/835/915/GCF_018835915.1_ASM1883591v1 | GCF_018835915.1;PRJNA819033;SAMN11230903;SPEB00000000.1;representative genome;87088;87088;Vigna umbellata;cultivar=VRB-3 Himshakti;;latest;Scaffold;Major;Full;2021/06/10;ASM1883591v1;ICGEB;GCA_018835915.1;identical;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/018/835/915/GCF_018835915.1_ASM1883591v1;;;na | 1 |
| 9678 | Crocuta crocuta | Hyaenidae | Carnivora | Chordata | GCA_008692635.1 | 317 | 123814 | 0.2553753695692454 | 88.6 | ["u11", "u12", "u4atac", "u6atac"] | BGI_CrCroc_1.0 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/008/692/635/GCA_008692635.1_BGI_CrCroc_1.0 | GCA_008692635.1;PRJNA554753;SAMN12283067;VOAJ00000000.1;representative genome;9678;9678;Crocuta crocuta;;KB4526;latest;Scaffold;Major;Full;2019/10/07;BGI_CrCroc_1.0;BGI;na;na;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/008/692/635/GCA_008692635.1_BGI_CrCroc_1.0;;;na | 1 |
| 6087 | Hydra vulgaris | Hydridae | Anthoathecata | Cnidaria | GCF_022113875.1 | 317 | 141713 | 0.2231922833204252 | 98.4 | ["u11", "u12", "u4atac", "u6atac"] | Hydra_105_v3 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/022/113/875/GCF_022113875.1_Hydra_105_v3 | GCF_022113875.1;PRJNA814716;SAMN18321928;JAGKSS000000000.1;representative genome;6087;6087;Hydra vulgaris;;105;latest;Chromosome;Major;Full;2022/02/14;Hydra_105_v3;University of Vienna;GCA_022113875.1;different;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/022/113/875/GCF_022113875.1_Hydra_105_v3;;;na | 1 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "genomes" (
"taxonomy_id" INTEGER,
"species" TEXT,
"family" TEXT,
"order" TEXT,
"phylum" TEXT,
"accession" TEXT,
"n_minor_introns" INTEGER,
"n_major_introns" INTEGER,
"percent_minor_introns" REAL,
"busco_score" REAL,
"minor_snRNAs" TEXT,
"genome_version" TEXT,
"source_url" TEXT,
"source_metadata" TEXT,
"minor_intron+" INTEGER
,PRIMARY KEY ([taxonomy_id])
);
CREATE INDEX [idx_genomes_phylum]
ON [genomes] ([phylum]);
CREATE INDEX [idx_genomes_order]
ON [genomes] ([order]);
CREATE INDEX [idx_genomes_family]
ON [genomes] ([family]);