home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

16 rows where transcript_id = 14424033

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: score, length, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
77101545 GT-AG 0 1.000000099473604e-05 217 rna-XM_006411589.2 14424033 1 161942 162158 Eutrema salsugineum 72664 CAG|GTTGGCTATC...GTATTCTAAATT/ATTTTGCTCACG...AACAG|TCT 0 1 15.491
77101546 GT-AG 0 1.000000099473604e-05 77 rna-XM_006411589.2 14424033 2 162270 162346 Eutrema salsugineum 72664 CAG|GTGGGTAATT...TATGTTTTATCT/TTATGTTTTATC...TGTAG|CGT 0 1 17.454
77101547 GT-AG 0 1.000000099473604e-05 90 rna-XM_006411589.2 14424033 3 162464 162553 Eutrema salsugineum 72664 AAG|GTAATATACT...TCCTCCATAATT/TTTGTTCGTATG...GGCAG|GTT 0 1 19.523
77101548 GT-AG 0 0.0001099776872261 73 rna-XM_006411589.2 14424033 4 162725 162797 Eutrema salsugineum 72664 ACG|GTATGAGTAA...TTATCTTTATTC/TCTTTATTCACA...CACAG|GGG 0 1 22.546
77101549 GT-AG 0 1.000000099473604e-05 416 rna-XM_006411589.2 14424033 5 162960 163375 Eutrema salsugineum 72664 CAG|GTACTGAAAA...ATATTCTCAATT/AATATTCTCAAT...CTTAG|ATT 0 1 25.411
77101550 GT-AG 0 0.0004352626329155 81 rna-XM_006411589.2 14424033 6 163613 163693 Eutrema salsugineum 72664 CAG|GTAATTTTTT...CCGTCCTTACTT/CTTACTTTCATT...AACAG|ATT 0 1 29.602
77101551 GT-AG 0 3.334101635065554e-05 77 rna-XM_006411589.2 14424033 7 163816 163892 Eutrema salsugineum 72664 GAG|GTTCGTTATT...AACTTCTTATCT/CAACTTCTTATC...TGTAG|CTT 2 1 31.76
77101552 GT-AG 0 0.0180486287196632 107 rna-XM_006411589.2 14424033 8 165310 165416 Eutrema salsugineum 72664 GAG|GTATGCTATT...ACTATCTTACTT/ATCTTACTTATA...TGCAG|GAT 0 1 56.817
77101553 GT-AG 0 3.5466428860253295e-05 286 rna-XM_006411589.2 14424033 9 165513 165798 Eutrema salsugineum 72664 GAT|GTAAGTATTC...GCTTCTATAATA/AATAACTTCACA...TCCAG|GTG 0 1 58.515
77101554 GT-AG 0 1.000000099473604e-05 114 rna-XM_006411589.2 14424033 10 167350 167463 Eutrema salsugineum 72664 CAA|GTGAGTTTTT...TTGTTTTTAATT/CTTTTCCTCACT...CGCAG|GAG 0 1 85.942
77101555 GT-AG 0 0.0011914852253043 84 rna-XM_006411589.2 14424033 11 167689 167772 Eutrema salsugineum 72664 ATT|GTACTGTCTT...TATTCTTTTACT/TATTCTTTTACT...TACAG|GTT 0 1 89.92
77101556 GT-AG 0 0.0013405968470832 184 rna-XM_006411589.2 14424033 12 167878 168061 Eutrema salsugineum 72664 CAG|GTTTGCTGTT...TCATCCTTATAA/TGCATTTTCATC...CTCAG|GTG 0 1 91.777
77101557 GT-AG 0 1.000000099473604e-05 90 rna-XM_006411589.2 14424033 13 168143 168232 Eutrema salsugineum 72664 GAG|GTGATTTTTC...GAAACCTTGTTT/GCATTACTGAAA...TGCAG|GCG 0 1 93.21
77101558 GT-AG 0 0.0189952282978414 190 rna-XM_006411589.2 14424033 14 168302 168491 Eutrema salsugineum 72664 AAT|GTATGTTGTT...CTGACCCTGATT/TGTTGATTCATT...TGTAG|TTA 0 1 94.43
77101559 GT-AG 0 1.000000099473604e-05 242 rna-XM_006411589.2 14424033 15 168621 168862 Eutrema salsugineum 72664 AAG|GTAAGAAATC...GGATTCTAAAAC/ACTGTTTTTATG...TGCAG|CAC 0 1 96.711
77101560 GT-AG 0 1.000000099473604e-05 91 rna-XM_006411589.2 14424033 16 168983 169073 Eutrema salsugineum 72664 AAG|GTGAGTCATA...TTATTTGTACCT/TTGTTTGTAAAG...AACAG|GAA 0 1 98.833

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 28.74ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)