genomes
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- taxonomy_id
 - INTEGER (primary key), unique identifier for each species
 - species
 - TEXT, binomial name of the species
 - family
 - TEXT, taxonomic family of the species
 - order
 - TEXT, taxonomic order of the species
 - phylum
 - TEXT, taxonomic phylum of the species
 - accession
 - TEXT, accession number of the genome assembly
 - n_minor_introns
 - INTEGER, total number of minor introns in the genome
 - n_major_introns
 - INTEGER, total number of major introns in the genome
 - percent_minor_introns
 - REAL, percentage of minor introns in the genome
 - busco_score
 - REAL, BUSCO score assessing the genome assembly completeness (vs. eukaryota_odb10)
 - minor_snRNAs
 - TEXT, minor snRNAs found in the annotated transcriptome
 - genome_version
 - TEXT, version of the genome assembly
 - source_url
 - TEXT, URL for the source genome/annotation files
 - source_metadata
 - TEXT, additional metadata from the original data source
 - minor_intron+
 - INTEGER, indicates if the species is inferred to contain real minor introns (1) or not (0)
 
2 rows where busco_score = "94.5" and genome_version = "v1.0" sorted by percent_minor_introns descending
This data as json, CSV (advanced)
Suggested facets: minor_snRNAs (array)
| taxonomy_id | species | family | order | phylum | accession | n_minor_introns | n_major_introns | percent_minor_introns ▲ | busco_score | minor_snRNAs | genome_version | source_url | source_metadata | minor_intron+ | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10228 | Trichoplax adhaerens | Trichoplacidae | Placozoa | GCF_000150275.1 | 211 | 93076 | 0.2261837126287693 | 94.5 | ["u11", "u12", "u4atac", "u6atac"] | v1.0 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/150/275/GCF_000150275.1_v1.0 | GCF_000150275.1;PRJNA30931;SAMN02953699;ABGP00000000.1;representative genome;10228;10228;Trichoplax adhaerens;strain=Grell-BS-1999;;latest;Scaffold;Major;Full;2008/06/17;v1.0;US DOE Joint Genome Institute (JGI-PGF);GCA_000150275.1;identical;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/150/275/GCF_000150275.1_v1.0;;;na | 1 | |
| 400682 | Amphimedon queenslandica | Niphatidae | Haplosclerida | Porifera | GCF_000090795.1 | 263 | 149226 | 0.1759326773207393 | 94.5 | ["u11", "u12", "u4atac", "u6atac"] | v1.0 | https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/090/795/GCF_000090795.1_v1.0 | GCF_000090795.1;PRJNA66531;SAMN02743868;ACUQ00000000.1;representative genome;400682;400682;Amphimedon queenslandica;;;latest;Scaffold;Major;Full;2010/05/28;v1.0;US DOE Joint Genome Institute (JGI-PGF);GCA_000090795.1;different;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/090/795/GCF_000090795.1_v1.0;;;na | 1 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "genomes" (
"taxonomy_id" INTEGER,
  "species" TEXT,
  "family" TEXT,
  "order" TEXT,
  "phylum" TEXT,
  "accession" TEXT,
  "n_minor_introns" INTEGER,
  "n_major_introns" INTEGER,
  "percent_minor_introns" REAL,
  "busco_score" REAL,
  "minor_snRNAs" TEXT,
  "genome_version" TEXT,
  "source_url" TEXT,
  "source_metadata" TEXT,
  "minor_intron+" INTEGER
  ,PRIMARY KEY ([taxonomy_id])
);
CREATE INDEX [idx_genomes_phylum]
    ON [genomes] ([phylum]);
CREATE INDEX [idx_genomes_order]
    ON [genomes] ([order]);
CREATE INDEX [idx_genomes_family]
    ON [genomes] ([family]);