introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
34 rows where transcript_id = 76437
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 346193 | GT-AG | 0 | 4.499445877552415e-05 | 1688 | rna-XM_022233881.1 76437 | 2 | 5647345 | 5649032 | Acanthaster planci 133434 | CAG|GTAAATTTTT...GGCTCTTTCACT/CTGTTTTTTATG...TACAG|GAA | 0 | 1 | 4.799 |
| 346194 | GT-AG | 0 | 0.0036443526363187 | 11437 | rna-XM_022233881.1 76437 | 3 | 5635841 | 5647277 | Acanthaster planci 133434 | CAG|GTACCAGTCC...TTTTTTTTATTT/TTTTTTTTTATT...TTCAG|ATA | 1 | 1 | 5.924 |
| 346195 | GT-AG | 0 | 1.000000099473604e-05 | 673 | rna-XM_022233881.1 76437 | 4 | 5635036 | 5635708 | Acanthaster planci 133434 | TAG|GTAGGAAGTG...GCCCCCTTGATT/TTGATTCTCAAA...ATCAG|ACA | 1 | 1 | 8.139 |
| 346196 | GT-AG | 0 | 1.000000099473604e-05 | 1182 | rna-XM_022233881.1 76437 | 5 | 5633735 | 5634916 | Acanthaster planci 133434 | CAG|GTAATTTTGC...CATTTCTGATTC/TCATTTCTGATT...CACAG|GTT | 0 | 1 | 10.136 |
| 346197 | GT-AG | 0 | 1.000000099473604e-05 | 1117 | rna-XM_022233881.1 76437 | 6 | 5632509 | 5633625 | Acanthaster planci 133434 | GTG|GTGAGTGCCA...AGTTCCATACTG/AGATTAGTAACA...TTCAG|AGC | 1 | 1 | 11.965 |
| 346198 | GT-AG | 0 | 0.0001440001895749 | 478 | rna-XM_022233881.1 76437 | 7 | 5631927 | 5632404 | Acanthaster planci 133434 | AAG|GTACTCGTGA...TCATTTTTGAGA/TTTAGGCTCATT...TGCAG|TTA | 0 | 1 | 13.71 |
| 346199 | GT-AG | 0 | 1.000000099473604e-05 | 1513 | rna-XM_022233881.1 76437 | 8 | 5629922 | 5631434 | Acanthaster planci 133434 | AAG|GTGGGAGGCC...TATATCATGAAA/ACATATATCATG...AACAG|AAG | 0 | 1 | 21.967 |
| 346200 | GT-AG | 0 | 1.000000099473604e-05 | 574 | rna-XM_022233881.1 76437 | 9 | 5629176 | 5629749 | Acanthaster planci 133434 | ATG|GTAAGGATAC...TCTTTTTTGCTA/GCTACACTCAGT...CACAG|GCT | 1 | 1 | 24.853 |
| 346201 | GT-AG | 0 | 1.000000099473604e-05 | 607 | rna-XM_022233881.1 76437 | 10 | 5628484 | 5629090 | Acanthaster planci 133434 | CAG|GTAGGTGTAA...TGTGTTTCATTT/TTGTGTTTCATT...CTCAG|GCA | 2 | 1 | 26.28 |
| 346202 | GT-AG | 0 | 0.0023802398475836 | 190 | rna-XM_022233881.1 76437 | 11 | 5628158 | 5628347 | Acanthaster planci 133434 | GCT|GTAAGTTTCA...AATCACTTAATT/AGTTGTCTAATC...GACAG|CTC | 0 | 1 | 28.562 |
| 346203 | GT-AG | 0 | 1.000000099473604e-05 | 642 | rna-XM_022233881.1 76437 | 12 | 5627324 | 5627965 | Acanthaster planci 133434 | CAG|GTGGGCAGGG...AATTCCTTAACC/CTTTGAATAATT...TACAG|CTG | 0 | 1 | 31.784 |
| 346204 | GT-AG | 0 | 1.000000099473604e-05 | 811 | rna-XM_022233881.1 76437 | 13 | 5626411 | 5627221 | Acanthaster planci 133434 | AAG|GTCAGTCTTG...TCTCTCTTACCA/GTCTCTCTTACC...GGCAG|AAT | 0 | 1 | 33.496 |
| 346205 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_022233881.1 76437 | 14 | 5625670 | 5626061 | Acanthaster planci 133434 | CAA|GTAAGTAGCA...TACCCCTTGGCA/GGGTTTGTAACT...TGCAG|GCA | 1 | 1 | 39.352 |
| 346206 | GT-AG | 0 | 1.000000099473604e-05 | 655 | rna-XM_022233881.1 76437 | 15 | 5624893 | 5625547 | Acanthaster planci 133434 | AAG|GTTGGTTCTT...TTGGTCCTATCT/TTGGATCTCAGA...TGCAG|ATG | 0 | 1 | 41.4 |
| 346207 | GT-AG | 0 | 1.000000099473604e-05 | 474 | rna-XM_022233881.1 76437 | 16 | 5624216 | 5624689 | Acanthaster planci 133434 | GAA|GTGAGTTTGA...ACTTTTTGGAAA/AAAGATTTTATG...CACAG|GTT | 2 | 1 | 44.806 |
| 346208 | GT-AG | 0 | 1.1799910386736654e-05 | 553 | rna-XM_022233881.1 76437 | 17 | 5623539 | 5624091 | Acanthaster planci 133434 | CAT|GTAAGTCCTA...CTGCCCTTCATT/AATATTTTCATT...TCTAG|AAC | 0 | 1 | 46.887 |
| 346209 | GT-AG | 0 | 1.000000099473604e-05 | 405 | rna-XM_022233881.1 76437 | 18 | 5623008 | 5623412 | Acanthaster planci 133434 | GAG|GTAAGCACAG...TATCCCTTGTCC/TATGTTTTCATT...TGCAG|CAA | 0 | 1 | 49.002 |
| 346210 | GT-AG | 0 | 1.000000099473604e-05 | 563 | rna-XM_022233881.1 76437 | 19 | 5622362 | 5622924 | Acanthaster planci 133434 | CAA|GTAAGTGTAA...TAGTCATTTATT/TAGTCATTTATT...TTCAG|GAG | 2 | 1 | 50.394 |
| 346211 | GT-AG | 0 | 1.000000099473604e-05 | 336 | rna-XM_022233881.1 76437 | 20 | 5621789 | 5622124 | Acanthaster planci 133434 | CAG|GTACAGGATT...CAGGCTTTGCTG/GCTTTGCTGATC...TGTAG|GAA | 2 | 1 | 54.372 |
| 346212 | GT-AG | 0 | 1.000000099473604e-05 | 237 | rna-XM_022233881.1 76437 | 21 | 5621452 | 5621688 | Acanthaster planci 133434 | GAG|GTAAAGCAGG...TTTGCTTTACCT/TTGTATTTCATG...AAAAG|ACC | 0 | 1 | 56.05 |
| 346213 | GT-AG | 0 | 1.000000099473604e-05 | 678 | rna-XM_022233881.1 76437 | 22 | 5620532 | 5621209 | Acanthaster planci 133434 | CAA|GTAAGTCATC...TGTTTCTTGCTT/AAGTGGTTTATG...CTCAG|GTC | 2 | 1 | 60.111 |
| 346214 | GT-AG | 0 | 1.000000099473604e-05 | 722 | rna-XM_022233881.1 76437 | 23 | 5619689 | 5620410 | Acanthaster planci 133434 | GAG|GTCAGTCGTT...CCCCACTTGACC/CAGAGTCTGACT...GCCAG|GAT | 0 | 1 | 62.141 |
| 346215 | GT-AG | 0 | 1.000000099473604e-05 | 1044 | rna-XM_022233881.1 76437 | 24 | 5618457 | 5619500 | Acanthaster planci 133434 | CAG|GTACAAGATG...TTGTCCTCTGCG/ATTTGTATAAGT...GGCAG|GGA | 2 | 1 | 65.296 |
| 346216 | GT-AG | 0 | 1.000000099473604e-05 | 555 | rna-XM_022233881.1 76437 | 25 | 5617748 | 5618302 | Acanthaster planci 133434 | CAG|GTGAGTCTGG...TATGTTTGAATT/TTGAAGTTCACA...TGCAG|GTC | 0 | 1 | 67.881 |
| 346217 | GT-AG | 0 | 1.000000099473604e-05 | 740 | rna-XM_022233881.1 76437 | 26 | 5616776 | 5617515 | Acanthaster planci 133434 | CAG|GTGGGCACAC...TGGTCTTTGTTT/TTTTTTTCCATT...TGTAG|GCA | 1 | 1 | 71.774 |
| 346218 | GT-AG | 0 | 1.000000099473604e-05 | 2105 | rna-XM_022233881.1 76437 | 27 | 5614516 | 5616620 | Acanthaster planci 133434 | GTG|GTGAGTCTTA...AGCTTCTTAACA/AGCTTCTTAACA...GTCAG|GTG | 0 | 1 | 74.375 |
| 346219 | GT-AG | 0 | 1.000000099473604e-05 | 379 | rna-XM_022233881.1 76437 | 28 | 5613899 | 5614277 | Acanthaster planci 133434 | GGG|GTAGGTGCTG...TTTCTTGTGATG/TTTCTTGTGATG...TACAG|GGA | 1 | 1 | 78.369 |
| 346220 | GT-AG | 0 | 1.000000099473604e-05 | 388 | rna-XM_022233881.1 76437 | 29 | 5613311 | 5613698 | Acanthaster planci 133434 | CAG|GTGGGTCATT...AATTTTTTAATC/AATTTTTTAATC...ATTAG|TTA | 0 | 1 | 81.725 |
| 346221 | GT-AG | 0 | 1.000000099473604e-05 | 819 | rna-XM_022233881.1 76437 | 30 | 5612344 | 5613162 | Acanthaster planci 133434 | AAG|GTTGTTGTCT...TAGAACTTAATG/TCCTCTCTCACT...GGCAG|GCA | 1 | 1 | 84.209 |
| 346222 | GT-AG | 0 | 1.000000099473604e-05 | 445 | rna-XM_022233881.1 76437 | 31 | 5611666 | 5612110 | Acanthaster planci 133434 | AAG|GTGAGTGGGT...CTCTCCCCAACA/ACAAATTTCAGT...CACAG|GTA | 0 | 1 | 88.119 |
| 346223 | GT-AG | 0 | 0.0008967900562281 | 425 | rna-XM_022233881.1 76437 | 32 | 5611127 | 5611551 | Acanthaster planci 133434 | CTT|GTAAGCTAAA...CTTTCCTTGTTT/GTAATGCTAACT...TACAG|TAT | 0 | 1 | 90.032 |
| 346224 | GT-AG | 0 | 1.000000099473604e-05 | 225 | rna-XM_022233881.1 76437 | 33 | 5610721 | 5610945 | Acanthaster planci 133434 | CAG|GTAGGTTGGG...TATTTATTATCT/AGTGTATTTATT...TCTAG|TTG | 1 | 1 | 93.069 |
| 346225 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-XM_022233881.1 76437 | 34 | 5610468 | 5610612 | Acanthaster planci 133434 | AAG|GTAGGTCAAC...TCATCATTACCT/GACATCATCATT...CATAG|AAG | 1 | 1 | 94.882 |
| 348219 | GT-AG | 0 | 1.000000099473604e-05 | 3286 | rna-XM_022233881.1 76437 | 1 | 5649310 | 5652595 | Acanthaster planci 133434 | AAG|GTAGGACTTC...TTCTTTTTATCT/ATATTCCTCATT...TTCAG|CTA | 0 | 2.635 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);