home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

15 rows where transcript_id = 12801876

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, score, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
68110917 GT-AG 0 4.9711035241992205e-05 771 rna-XM_029510336.1 12801876 1 14129318 14130088 Echeneis naucrates 173247 AAG|GTACCGCAAA...TTCACTTTTGCT/TTGTGTTTCACT...TTCAG|TTG 1 1 4.303
68110918 GT-AG 0 0.0001877958155602 681 rna-XM_029510336.1 12801876 2 14127607 14128287 Echeneis naucrates 173247 AAA|GTAAACATGT...CATCACTTAAAA/CAATCCATCACT...TGTAG|AGC 2 1 23.409
68110919 GT-AG 0 1.000000099473604e-05 182 rna-XM_029510336.1 12801876 3 14127300 14127481 Echeneis naucrates 173247 AAG|GTTATTAAAC...TCTCTCTTACAA/GTCTCTCTTACA...CATAG|CTA 1 1 25.728
68110920 GT-AG 0 1.000000099473604e-05 410 rna-XM_029510336.1 12801876 4 14126801 14127210 Echeneis naucrates 173247 CAG|GTTAGTCAGC...ATTGTCATATCT/CCTATTGTCATA...TCTAG|GCA 0 1 27.379
68110921 GT-AG 0 1.000000099473604e-05 129 rna-XM_029510336.1 12801876 5 14126595 14126723 Echeneis naucrates 173247 GAG|GTAAGAGAGA...AATCTATTAATT/AATCTATTAATT...TGCAG|ACG 2 1 28.807
68110922 GT-AG 0 1.000000099473604e-05 362 rna-XM_029510336.1 12801876 6 14126092 14126453 Echeneis naucrates 173247 AAA|GTGAGTTTAC...TTGGCTTCAGCC/GCCATTCTAACT...TCTAG|GCT 2 1 31.423
68110923 GT-AG 0 1.000000099473604e-05 272 rna-XM_029510336.1 12801876 7 14123238 14123509 Echeneis naucrates 173247 CAG|GTAGTTTGCA...GCTTCTGTGAAC/CTCTGGTTTATT...TTAAG|GTA 1 1 79.317
68110924 GT-AG 0 1.000000099473604e-05 108 rna-XM_029510336.1 12801876 8 14123044 14123151 Echeneis naucrates 173247 CAA|GTGAGCTCTT...TTTCATTTGATG/GTGGATTTCATT...TACAG|GAA 0 1 80.913
68110925 GT-AG 0 1.0797764371699584e-05 253 rna-XM_029510336.1 12801876 9 14122745 14122997 Echeneis naucrates 173247 AAA|GTAAGTTCAC...TAAATGTTGACT/TTGTGTTTCACC...TCAAG|GTT 1 1 81.766
68110926 GT-AG 0 0.0002853301359774 117 rna-XM_029510336.1 12801876 10 14122410 14122526 Echeneis naucrates 173247 AAG|GTACCACGCA...CTGTACTTATTT/GCTGTACTTATT...TTTAG|ACT 0 1 85.81
68110927 GT-AG 0 1.000000099473604e-05 257 rna-XM_029510336.1 12801876 11 14122028 14122284 Echeneis naucrates 173247 CAA|GTGAGACAAA...CATGTCTTATTT/TTATGTCTGATC...TGCAG|TTA 2 1 88.128
68110928 GT-AG 0 0.001654523987545 359 rna-XM_029510336.1 12801876 12 14121542 14121900 Echeneis naucrates 173247 AAG|GTATTCACCT...ATACCGTTACTC/CCGTTACTCAAT...TTCAG|AGG 0 1 90.484
68110929 GC-AG 0 1.000000099473604e-05 229 rna-XM_029510336.1 12801876 13 14121139 14121367 Echeneis naucrates 173247 CAG|GCAAGAAGCT...TTCTTCTGAAGA/AGTAAATTCATT...CTTAG|GGA 0 1 93.712
68110930 GT-AG 0 1.5466592323165025e-05 678 rna-XM_029510336.1 12801876 14 14120292 14120969 Echeneis naucrates 173247 CAG|GTATTAAACA...CTCCTGTTAACA/GTGTGTTTCATC...TGTAG|AGT 1 1 96.847
68110931 GT-AG 0 0.0197350036046008 83 rna-XM_029510336.1 12801876 15 14120085 14120167 Echeneis naucrates 173247 CAG|GTATTCCCAC...TTTCTCTTATCT/ATTTCTCTTATC...GTCAG|GTT 2 1 99.147

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 27.937ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)