introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 15214378
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82387572 | GT-AG | 0 | 1.000000099473604e-05 | 12603 | rna-XM_033919629.1 15214378 | 3 | 435863058 | 435875660 | Geotrypetes seraphini 260995 | TGG|GTAAGGAAGT...CTCCCCTTTGTT/GTGCAGCTCACT...TGCAG|AGT | 2 | 1 | 9.144 | 
| 82387573 | GT-AG | 0 | 3.48909784122105e-05 | 1317 | rna-XM_033919629.1 15214378 | 4 | 435861488 | 435862804 | Geotrypetes seraphini 260995 | GTG|GTAAGCACTA...TGGTTTTTATTT/TTTTATTTCATT...TGCAG|TGG | 0 | 1 | 13.283 | 
| 82387574 | GT-AG | 0 | 1.000000099473604e-05 | 14068 | rna-XM_033919629.1 15214378 | 5 | 435847350 | 435861417 | Geotrypetes seraphini 260995 | CTG|GTAAGTAAGC...GCTACTGTGATG/CTGTGATGTATT...TCCAG|GTT | 1 | 1 | 14.428 | 
| 82387575 | GT-AG | 0 | 1.000000099473604e-05 | 823 | rna-XM_033919629.1 15214378 | 6 | 435846357 | 435847179 | Geotrypetes seraphini 260995 | AAG|GTACATGAAA...TACATATTATTC/CTACATATTATT...TGCAG|GTC | 0 | 1 | 17.209 | 
| 82387576 | GT-AG | 0 | 1.000000099473604e-05 | 4943 | rna-XM_033919629.1 15214378 | 7 | 435841172 | 435846114 | Geotrypetes seraphini 260995 | AAA|GTGAGTGTAG...TGTGTTTTTGTT/AACTTGCTAAGT...TTCAG|GCA | 2 | 1 | 21.168 | 
| 82387577 | GT-AG | 0 | 1.5722305409244213e-05 | 788 | rna-XM_033919629.1 15214378 | 8 | 435840317 | 435841104 | Geotrypetes seraphini 260995 | CAG|GTAACAAGAA...TTTTTCTTCTTT/TTAATATTTATA...ACCAG|CTG | 0 | 1 | 22.264 | 
| 82387578 | GT-AG | 0 | 1.000000099473604e-05 | 16991 | rna-XM_033919629.1 15214378 | 9 | 435823267 | 435840257 | Geotrypetes seraphini 260995 | CAG|GTGAGTTTAA...AAGTCTTTAAGA/AAGTCTTTAAGA...TTTAG|TGG | 2 | 1 | 23.229 | 
| 82387579 | GT-AG | 0 | 0.0008804404604384 | 17084 | rna-XM_033919629.1 15214378 | 10 | 435806038 | 435823121 | Geotrypetes seraphini 260995 | GCT|GTAAGCTCAA...GATTTTTAAGCA/TGTCTGGTAATT...TACAG|ATG | 0 | 1 | 25.601 | 
| 82387580 | GT-AG | 0 | 0.0001121467590146 | 6504 | rna-XM_033919629.1 15214378 | 11 | 435799358 | 435805861 | Geotrypetes seraphini 260995 | TGT|GTAAGTGTTC...TGGTTTTTATTG/GTGGTTTTTATT...TGCAG|ATC | 2 | 1 | 28.48 | 
| 82387581 | GT-AG | 0 | 1.000000099473604e-05 | 9931 | rna-XM_033919629.1 15214378 | 12 | 435789326 | 435799256 | Geotrypetes seraphini 260995 | AAG|GTAAATAATG...CTCCCCCTACTT/GTTTGTCTGATG...CTCAG|TGC | 1 | 1 | 30.133 | 
| 82387582 | GT-AG | 0 | 5.395938315656748e-05 | 4571 | rna-XM_033919629.1 15214378 | 13 | 435784536 | 435789106 | Geotrypetes seraphini 260995 | AAG|GTAACTGGAA...AAGTTTTTACTT/GAAGTTTTTACT...TCCAG|GAT | 1 | 1 | 33.715 | 
| 82387583 | GT-AG | 0 | 1.000000099473604e-05 | 5993 | rna-XM_033919629.1 15214378 | 14 | 435778397 | 435784389 | Geotrypetes seraphini 260995 | AAG|GTAAAAACAA...TCCTCCTTCCTG/AAGCATCTAAAT...TCTAG|CTG | 0 | 1 | 36.103 | 
| 82387584 | GT-AG | 0 | 0.0001228768376561 | 458 | rna-XM_033919629.1 15214378 | 15 | 435777903 | 435778360 | Geotrypetes seraphini 260995 | AAG|GTATTTGTAT...GATGCCTGAAAC/ATTTCGCTGATG...TTTAG|GTC | 0 | 1 | 36.692 | 
| 82387585 | GT-AG | 0 | 1.000000099473604e-05 | 3925 | rna-XM_033919629.1 15214378 | 16 | 435773847 | 435777771 | Geotrypetes seraphini 260995 | CTG|GTACAGTAAC...GAAAACTTGATT/CTTATTTTTACT...TTTAG|TTA | 2 | 1 | 38.835 | 
| 82387586 | GT-AG | 0 | 1.000000099473604e-05 | 706 | rna-XM_033919629.1 15214378 | 17 | 435773008 | 435773713 | Geotrypetes seraphini 260995 | CAG|GTAGGAACTG...TTATTTTTATAT/TTTATTTTTATA...TTCAG|GAG | 0 | 1 | 41.011 | 
| 82387587 | GT-AG | 0 | 1.000000099473604e-05 | 991 | rna-XM_033919629.1 15214378 | 18 | 435771810 | 435772800 | Geotrypetes seraphini 260995 | GAG|GTGAGATGTG...TGATATTTAATA/CTGTTGCTGATA...CTCAG|GTC | 0 | 1 | 44.397 | 
| 82387588 | GT-AG | 0 | 2.162890706941423e-05 | 6324 | rna-XM_033919629.1 15214378 | 19 | 435765357 | 435771680 | Geotrypetes seraphini 260995 | AAG|GTAGGTATGA...TTTTCTTTACCT/ATTTTCTTTACC...TGCAG|GTT | 0 | 1 | 46.507 | 
| 82387589 | GT-AG | 0 | 0.0002980171976122 | 5794 | rna-XM_033919629.1 15214378 | 20 | 435759418 | 435765211 | Geotrypetes seraphini 260995 | CAG|GTAACATTAG...TATTCTTTCTCC/ACTATACAGAAT...TCTAG|CTC | 1 | 1 | 48.879 | 
| 82387590 | GT-AG | 0 | 2.739854305823761e-05 | 23278 | rna-XM_033919629.1 15214378 | 21 | 435735993 | 435759270 | Geotrypetes seraphini 260995 | CAG|GTATTGCTGG...TGTGCTTTTCTT/ACCCTGTTCATG...CACAG|AAG | 1 | 1 | 51.284 | 
| 82387591 | GT-AG | 0 | 1.000000099473604e-05 | 2723 | rna-XM_033919629.1 15214378 | 22 | 435733144 | 435735866 | Geotrypetes seraphini 260995 | GAG|GTTATGTGCT...TAGTTCTTGATC/TAGTTCTTGATC...TACAG|GTA | 1 | 1 | 53.345 | 
| 82387592 | GT-AG | 0 | 2.597711991974616e-05 | 2279 | rna-XM_033919629.1 15214378 | 23 | 435730724 | 435733002 | Geotrypetes seraphini 260995 | GTG|GTAAGTTGCT...ACTGTCTTGATG/TGTGTGCTCACT...TCCAG|GAA | 1 | 1 | 55.652 | 
| 82387593 | GC-AG | 0 | 1.000000099473604e-05 | 11992 | rna-XM_033919629.1 15214378 | 24 | 435717572 | 435729563 | Geotrypetes seraphini 260995 | GAG|GCAAGTATGA...TAAATATTGATT/TAAATATTGATT...GACAG|GAA | 0 | 1 | 74.628 | 
| 82387594 | GC-AG | 0 | 1.1426894981463972e-05 | 1788 | rna-XM_033919629.1 15214378 | 25 | 435715606 | 435717393 | Geotrypetes seraphini 260995 | CAG|GCATGCACCA...GTCTCCTTATAG/TGTCTCCTTATA...TGCAG|GTG | 1 | 1 | 77.54 | 
| 82387595 | GT-AG | 0 | 1.000000099473604e-05 | 17219 | rna-XM_033919629.1 15214378 | 26 | 435698286 | 435715504 | Geotrypetes seraphini 260995 | GAG|GTAAAAGTAG...CCCCTTTTAACG/CTTTTTCTCACC...GGCAG|GTG | 0 | 1 | 79.192 | 
| 82387596 | GT-AG | 0 | 1.000000099473604e-05 | 14597 | rna-XM_033919629.1 15214378 | 27 | 435683513 | 435698109 | Geotrypetes seraphini 260995 | CAG|GTGAGTAGCA...TTTTTCTTATCT/TTTTTTCTGAAT...TTCAG|TTT | 2 | 1 | 82.071 | 
| 82387597 | GT-AG | 0 | 8.720325411110725e-05 | 18417 | rna-XM_033919629.1 15214378 | 28 | 435664913 | 435683329 | Geotrypetes seraphini 260995 | GAA|GTAAGCAACG...ATTTTCTTTTTT/CTATAACTGAGA...ACTAG|TAA | 2 | 1 | 85.065 | 
| 82387598 | GT-AG | 0 | 1.000000099473604e-05 | 7801 | rna-XM_033919629.1 15214378 | 29 | 435656919 | 435664719 | Geotrypetes seraphini 260995 | GTG|GTGAGTGTAT...AAATCTTTGCTA/GTGTTTCCAACC...TGCAG|GAT | 0 | 1 | 88.222 | 
| 82387599 | GT-AG | 0 | 0.0014311638712148 | 1091 | rna-XM_033919629.1 15214378 | 30 | 435655582 | 435656672 | Geotrypetes seraphini 260995 | CAG|GTATACAACA...CTTTTCTCACCT/ACTTTTCTCACC...TTTAG|CTT | 0 | 1 | 92.246 | 
| 82387600 | GT-AG | 0 | 9.208734042124592e-05 | 5990 | rna-XM_033919629.1 15214378 | 31 | 435649474 | 435655463 | Geotrypetes seraphini 260995 | TCA|GTAAGTGGTG...GAGTCCTTAGTA/TATATTTTGATT...GCCAG|ACT | 1 | 1 | 94.176 | 
| 82387601 | GT-AG | 0 | 1.000000099473604e-05 | 4473 | rna-XM_033919629.1 15214378 | 32 | 435644847 | 435649319 | Geotrypetes seraphini 260995 | GAG|GTGAGCAGGG...CATTTGTTGAAA/GTTGAAATCATT...TCTAG|GAG | 2 | 1 | 96.696 | 
| 82387602 | GT-AG | 0 | 1.000000099473604e-05 | 475 | rna-XM_033919629.1 15214378 | 33 | 435644307 | 435644781 | Geotrypetes seraphini 260995 | TTG|GTAAGTTGAA...TGTTCCTTCTCC/TATTTGCTCAGC...TTCAG|AGG | 1 | 1 | 97.759 | 
| 82403206 | GT-AG | 0 | 1.000000099473604e-05 | 8250 | rna-XM_033919629.1 15214378 | 1 | 435950650 | 435958899 | Geotrypetes seraphini 260995 | GAA|GTAAGTGAAA...AAACCTTTATTT/TCAGTTCTAAGT...CTTAG|TGC | 0 | 2.356 | |
| 82403207 | GT-AG | 0 | 1.000000099473604e-05 | 74560 | rna-XM_033919629.1 15214378 | 2 | 435875980 | 435950539 | Geotrypetes seraphini 260995 | CAG|GTGATTTATC...ATTTTCTTTTCT/TCTTTTCTGATT...TGTAG|GGC | 0 | 4.155 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);