home / WtMTA

genomes

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

taxonomy_id
INTEGER (primary key), unique identifier for each species
species
TEXT, binomial name of the species
family
TEXT, taxonomic family of the species
order
TEXT, taxonomic order of the species
phylum
TEXT, taxonomic phylum of the species
accession
TEXT, accession number of the genome assembly
n_minor_introns
INTEGER, total number of minor introns in the genome
n_major_introns
INTEGER, total number of major introns in the genome
percent_minor_introns
REAL, percentage of minor introns in the genome
busco_score
REAL, BUSCO score assessing the genome assembly completeness (vs. eukaryota_odb10)
minor_snRNAs
TEXT, minor snRNAs found in the annotated transcriptome
genome_version
TEXT, version of the genome assembly
source_url
TEXT, URL for the source genome/annotation files
source_metadata
TEXT, additional metadata from the original data source
minor_intron+
INTEGER, indicates if the species is inferred to contain real minor introns (1) or not (0)

2 rows where family = "Pipridae", minor_snRNAs = "["u12", "u4atac", "u6atac"]" and minor_snRNAs contains "u12" sorted by percent_minor_introns descending

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: minor_snRNAs (array)

taxonomy_id species family order phylum accession n_minor_introns n_major_introns percent_minor_introns ▲ busco_score minor_snRNAs genome_version source_url source_metadata minor_intron+
328815 Manacus vitellinus Pipridae Passeriformes Chordata GCF_001715985.3 519 149246 0.3465429172370046 87.5 ["u12", "u4atac", "u6atac"] ASM171598v3 https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/715/985/GCF_001715985.3_ASM171598v3 GCF_001715985.3;PRJNA341382;SAMN02299332;MCBO00000000.3;representative genome;328815;328815;Manacus vitellinus;;BGI_N305;latest;Scaffold;Major;Full;2019/07/03;ASM171598v3;Smithsonian Institution National Museum of Natural History;GCA_001715985.3;identical;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/715/985/GCF_001715985.3_ASM171598v3;;;na 1
321398 Lepidothrix coronata Pipridae Passeriformes Chordata GCF_001604755.1 554 162800 0.3391407617811623 94.9 ["u12", "u4atac", "u6atac"] Lepidothrix_coronata-1.0 https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/604/755/GCF_001604755.1_Lepidothrix_coronata-1.0 GCF_001604755.1;PRJNA338288;SAMN04274560;LVWP00000000.1;representative genome;321398;321398;Lepidothrix coronata;;B3197;latest;Scaffold;Major;Full;2016/03/31;Lepidothrix_coronata-1.0;McDonnell Genome Institute;GCA_001604755.1;identical;https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/001/604/755/GCF_001604755.1_Lepidothrix_coronata-1.0;;;na 1

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "genomes" (
"taxonomy_id" INTEGER,
  "species" TEXT,
  "family" TEXT,
  "order" TEXT,
  "phylum" TEXT,
  "accession" TEXT,
  "n_minor_introns" INTEGER,
  "n_major_introns" INTEGER,
  "percent_minor_introns" REAL,
  "busco_score" REAL,
  "minor_snRNAs" TEXT,
  "genome_version" TEXT,
  "source_url" TEXT,
  "source_metadata" TEXT,
  "minor_intron+" INTEGER
  ,PRIMARY KEY ([taxonomy_id])
);
CREATE INDEX [idx_genomes_phylum]
    ON [genomes] ([phylum]);
CREATE INDEX [idx_genomes_order]
    ON [genomes] ([order]);
CREATE INDEX [idx_genomes_family]
    ON [genomes] ([family]);
Powered by Datasette · Queries took 27.528ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)