home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

16 rows where transcript_id = 35515751

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
199620747 GT-AG 0 1.000000099473604e-05 310 rna-XM_010219115.1 35515751 1 533 842 Tinamus guttatus 94827 CAG|GTAGGACCAT...TCATCTTTCACC/TCATCTTTCACC...TGCAG|CCA 2 1 3.462
199620748 GT-AG 0 1.000000099473604e-05 152 rna-XM_010219115.1 35515751 2 938 1089 Tinamus guttatus 94827 TAG|GTGCGTTTCA...TGTGTCTGAATC/CTGTGTCTGAAT...TCCAG|CTT 1 1 8.029
199620749 GT-AG 0 1.000000099473604e-05 429 rna-XM_010219115.1 35515751 3 1149 1577 Tinamus guttatus 94827 CAG|GTAGGTCTCT...AAAGCCTGAGCA/ACTGGACTCACT...CACAG|GCT 0 1 10.865
199620750 GT-AG 0 1.000000099473604e-05 84 rna-XM_010219115.1 35515751 4 1626 1709 Tinamus guttatus 94827 CAG|GTGAGGGACA...TGGTCCTGAACA/CTGGTCCTGAAC...CACAG|ACC 0 1 13.173
199620751 GT-AG 0 1.000000099473604e-05 224 rna-XM_010219115.1 35515751 5 1845 2068 Tinamus guttatus 94827 CAG|GTGAGACCCG...GTGTTCCCAATG/CGTGTTCCCAAT...CACAG|GTC 0 1 19.663
199620752 GT-AG 0 3.609202834716017e-05 415 rna-XM_010219115.1 35515751 6 2170 2584 Tinamus guttatus 94827 CAA|GTATGGCTCC...CCCTCCTGGTTT/GGTATCCCCACG...CACAG|GAG 2 1 24.519
199620753 GT-AG 0 1.000000099473604e-05 118 rna-XM_010219115.1 35515751 7 2792 2909 Tinamus guttatus 94827 GAG|GTGAGCGGGC...AGTGGCTTGAGG/ATGAAACGAACT...CACAG|GAC 2 1 34.471
199620754 GT-AG 0 0.0023437057899975 316 rna-XM_010219115.1 35515751 8 3018 3333 Tinamus guttatus 94827 CAG|GTACCCGTGG...AGACTCAGAGCA/GAAAGACTCAGA...TGCAG|GTC 2 1 39.663
199620755 GT-AG 0 1.0552541421800198e-05 371 rna-XM_010219115.1 35515751 9 3474 3844 Tinamus guttatus 94827 AGG|GTAGGTCTGC...CAGCCCTTCTCT/GCTCGTGTCAGC...TGCAG|TGC 1 1 46.394
199620756 GT-AG 0 1.000000099473604e-05 141 rna-XM_010219115.1 35515751 10 3964 4104 Tinamus guttatus 94827 CAG|GTGAGCCCAC...CCAGCTCTGACC/CCAGCTCTGACC...CCCAG|GAG 0 1 52.115
199620757 GT-AG 0 1.000000099473604e-05 380 rna-XM_010219115.1 35515751 11 4228 4607 Tinamus guttatus 94827 AAG|GTGAGACACG...TCTCTCTTGCCT/TCTTGCCTCACT...CCCAG|ATG 0 1 58.029
199620758 GT-AG 0 1.705358529892508e-05 445 rna-XM_010219115.1 35515751 12 4794 5238 Tinamus guttatus 94827 AAG|GTACCGCGGC...CATCTCTCACAA/ACATCTCTCACA...CCTAG|GAG 0 1 66.971
199620759 GT-AG 0 1.000000099473604e-05 409 rna-XM_010219115.1 35515751 13 5416 5824 Tinamus guttatus 94827 AAG|GTAAGCGGGA...GCCCTTTTGCCT/GTCTCCCCCATG...TGCAG|CTC 0 1 75.481
199620760 GC-AG 0 1.000000099473604e-05 106 rna-XM_010219115.1 35515751 14 5963 6068 Tinamus guttatus 94827 CAG|GCAAAGCCCC...TGAACCTTTTCC/CCTTTTCCAAGG...GGCAG|GAG 0 1 82.115
199620761 GT-AG 0 1.000000099473604e-05 327 rna-XM_010219115.1 35515751 15 6202 6528 Tinamus guttatus 94827 TGT|GTAAGGGCTC...GGTGCTCCAATG/TGTCTGTCCACG...GGCAG|CGT 1 1 88.51
199620762 GT-AG 0 1.000000099473604e-05 418 rna-XM_010219115.1 35515751 16 6684 7101 Tinamus guttatus 94827 CAG|GTGAGCGGCC...GCACCCTTCGCA/CGCATCCTGACA...CTCAG|GAT 0 1 95.962

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 30.18ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)