home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

27 rows where transcript_id = 14424091

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: dinucleotide_pair, is_minor, score, length, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
77102332 GT-AG 0 0.003180332936462 97 rna-XM_024148944.1 14424091 1 6451495 6451591 Eutrema salsugineum 72664 AAG|GTTTCTGATC...TTTCCCTTTTCA/TCCGGTTTGAAT...TTCAG|TTA 0 1 3.654
77102333 GT-AG 0 1.000000099473604e-05 125 rna-XM_024148944.1 14424091 2 6451788 6451912 Eutrema salsugineum 72664 AAG|GTTCGTCACT...TTCTTCTTTGTT/TTGAATGTCATT...TCTAG|GCT 1 1 9.081
77102334 GT-AG 0 1.000000099473604e-05 98 rna-XM_024148944.1 14424091 3 6452029 6452126 Eutrema salsugineum 72664 CAG|GTTGATTAAA...TTGTCTTTATCC/TTTGTCTTTATC...TTCAG|CAT 0 1 12.292
77102335 GT-AG 0 0.0361191557467274 212 rna-XM_024148944.1 14424091 4 6452244 6452455 Eutrema salsugineum 72664 CAG|GTATTTTTTC...TTCTCATTATTT/TGTATTCTCATT...TTCAG|CCC 0 1 15.532
77102336 GT-AG 1 99.9986295496562 87 rna-XM_024148944.1 14424091 5 6452603 6452689 Eutrema salsugineum 72664 TTC|GTATCCTTAT...TTCCCCTTAATT/TTGTTTGTCATG...TCCAG|ATA 0 1 19.601
77102337 GT-AG 0 1.000000099473604e-05 76 rna-XM_024148944.1 14424091 6 6453038 6453113 Eutrema salsugineum 72664 GAG|GTAATTACTC...ATAATATTAACA/ATAATATTAACA...ATCAG|ATA 0 1 29.236
77102338 GT-AG 0 2.651497894850896e-05 313 rna-XM_024148944.1 14424091 7 6453269 6453581 Eutrema salsugineum 72664 GAG|GTATAAGACC...ATTGTTTTATTT/GATTGTTTTATT...TGCAG|TGA 2 1 33.527
77102339 GT-AG 0 1.000000099473604e-05 81 rna-XM_024148944.1 14424091 8 6453704 6453784 Eutrema salsugineum 72664 CAG|GTTTGTGGTA...TAATTTTTGCTA/ATTTTTGCTATT...TGCAG|GAT 1 1 36.905
77102340 GT-AG 0 1.000000099473604e-05 204 rna-XM_024148944.1 14424091 9 6453877 6454080 Eutrema salsugineum 72664 GGG|GTAAGATGTT...GCATCCGTTATG/CCAGAAGTAATT...TGCAG|GCT 0 1 39.452
77102341 GC-AG 0 1.7175320273838524e-05 93 rna-XM_024148944.1 14424091 10 6454325 6454417 Eutrema salsugineum 72664 AAG|GCATGCCCTT...ATACTCTTAGAT/TGATAATTCATA...TGCAG|GTG 1 1 46.207
77102342 GT-AG 0 0.012387743583884 306 rna-XM_024148944.1 14424091 11 6454473 6454778 Eutrema salsugineum 72664 TAG|GTTTGCTTCT...CACTTTTTAACT/CACTTTTTAACT...TTCAG|GGG 2 1 47.73
77102343 GT-AG 0 0.0003113675946738 84 rna-XM_024148944.1 14424091 12 6454931 6455014 Eutrema salsugineum 72664 ATG|GTACTTCTTG...TAGCTCTTGTCT/TCTTGTCTCACA...TATAG|CTG 1 1 51.938
77102344 GT-AG 0 0.0004310313323445 83 rna-XM_024148944.1 14424091 13 6455086 6455168 Eutrema salsugineum 72664 GAG|GTACTTGTTC...ATATCTTTATCT/TATATCTTTATC...TCCAG|GTT 0 1 53.904
77102345 GT-AG 0 0.0845594456606758 506 rna-XM_024148944.1 14424091 14 6455343 6455848 Eutrema salsugineum 72664 AAG|GTATCTACTG...GAATCCATAGCT/TTTATTTTCATC...TGCAG|GCT 0 1 58.721
77102346 GT-AG 0 4.718938795003046e-05 375 rna-XM_024148944.1 14424091 15 6455921 6456295 Eutrema salsugineum 72664 CCA|GTAAATCAAC...CTTCTCTTATTT/CCTTCTCTTATT...CCCAG|GAC 0 1 60.714
77102347 GT-AG 0 1.000000099473604e-05 87 rna-XM_024148944.1 14424091 16 6456395 6456481 Eutrema salsugineum 72664 CAG|GTTTGTAACT...ATAGACTTAATG/TCTCGTCTGATA...CCCAG|GAT 0 1 63.455
77102348 GT-AG 0 0.0005758290690893 416 rna-XM_024148944.1 14424091 17 6456572 6456987 Eutrema salsugineum 72664 AAG|GTATGTCTTT...AAGTTTTTAAGA/AAGTTTTTAAGA...TTCAG|GTG 0 1 65.947
77102349 GT-AG 0 1.3480895792469194 96 rna-XM_024148944.1 14424091 18 6457064 6457159 Eutrema salsugineum 72664 ACA|GTATGTTTTC...TTGACTTTGACT/TTGACTTTGACT...TTCAG|GTG 1 1 68.051
77102350 GT-AG 0 1.000000099473604e-05 78 rna-XM_024148944.1 14424091 19 6457254 6457331 Eutrema salsugineum 72664 TAG|GTAATTGGTA...CTGCTCTTATAT/ACTGCTCTTATA...TGCAG|CAT 2 1 70.653
77102351 GT-AG 0 1.000000099473604e-05 106 rna-XM_024148944.1 14424091 20 6457402 6457507 Eutrema salsugineum 72664 AAG|GTTAGCAAAT...GAATTTTTCTCT/CACCTGTTGAAT...TTCAG|GAC 0 1 72.591
77102352 GT-AG 0 1.000000099473604e-05 294 rna-XM_024148944.1 14424091 21 6457688 6457981 Eutrema salsugineum 72664 AAG|GTGGTTCTCT...GTTTCCTAATCA/GGTTTCCTAATC...TGCAG|CCT 0 1 77.575
77102353 GT-AG 0 1.000000099473604e-05 81 rna-XM_024148944.1 14424091 22 6458105 6458185 Eutrema salsugineum 72664 AAG|GTAACAGAAT...GTGTTCTTCATT/GTGTTCTTCATT...TGCAG|TGG 0 1 80.98
77102354 GT-AG 0 0.0006672322764353 84 rna-XM_024148944.1 14424091 23 6458270 6458353 Eutrema salsugineum 72664 GAG|GTACTTTTTC...GCAATCTTAAGA/TTGCTTGTAATC...TGTAG|GTG 0 1 83.306
77102355 GT-AG 0 1.000000099473604e-05 95 rna-XM_024148944.1 14424091 24 6458539 6458633 Eutrema salsugineum 72664 TAG|GTAATTTTAA...ATTGACATGACG/TAGGTTGTGAAA...TGCAG|GTT 2 1 88.427
77102356 GT-AG 0 1.000000099473604e-05 105 rna-XM_024148944.1 14424091 25 6458773 6458877 Eutrema salsugineum 72664 AAG|GTAAATCCCT...TATTCCTTGTCA/ATAATACTCATA...CGCAG|AAT 0 1 92.276
77102357 GT-AG 0 1.000000099473604e-05 90 rna-XM_024148944.1 14424091 26 6458938 6459027 Eutrema salsugineum 72664 AAT|GTGAGTTTTC...TTCACTGTAATA/GTAAACTTCACT...GGTAG|GAG 0 1 93.937
77102358 GT-AG 0 1.000000099473604e-05 93 rna-XM_024148944.1 14424091 27 6459133 6459225 Eutrema salsugineum 72664 CAG|GTAAGATTTA...TGTCCCTTGGTG/CTCAGTCTCAGA...ATCAG|GTG 0 1 96.844

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 70.61ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)